Check if past_key_values is provided when using prefix_tuning in peft_model #1942

Nidhogg-lyz · 2024-07-22T15:12:00Z

Fix TypeError object got multiple values for keyword argument 'past_key_values' when past_key_values is provided in base_model's forward call in Prefix Tuning. Resolves #1938 .

Nidhogg-lyz · 2024-07-22T15:19:41Z

I'm not sure if I add the unit test file test_past_kv.py in the correct place, so if there's any problem please let me know.

BenjaminBossan

Thanks for providing this fix so quickly. I have a few comments, please take a look.

tests/test_past_kv.py

src/peft/peft_model.py

merge upstream commits

…nto check_past_kv merge origin commit

HuggingFaceDocBuilderDev · 2024-07-26T09:21:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan · 2024-07-26T09:21:37Z

Thanks for the updates @Nidhogg-lyz, could you please run make style?

Nidhogg-lyz · 2024-07-26T09:30:04Z

@BenjaminBossan I've already run make style and make quality. It seems the errors raised are not related to the codes I modified, should I edit others' code?

~nidhogg% make style
ruff check --fix src tests examples docs scripts docker
src/peft/tuners/boft/layer.py:94:20: F811 Redefinition of unused `fbd_cuda` from line 87
   |
92 |             )
93 |             # extra_cuda_cflags = ['-std=c++14', '-ccbin=$$(which gcc-7)']) # cuda10.2 is not compatible with gcc9. Specify gcc 7
94 |             import fbd_cuda
   |                    ^^^^^^^^ F811
95 |     except Exception as e:
96 |         warnings.warn(f"Failed to load the CUDA extension: {e}, check if ninja is available.")
   |
   = help: Remove definition: `fbd_cuda`

src/peft/tuners/lora/model.py:556:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
554 |             )
555 | 
556 |         if target_module_types[0] == str:
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
557 |             new_target_modules = "|".join(f"({self.peft_config[adapter].target_modules})" for adapter in adapters)
558 |         elif target_module_types[0] == set:
    |

src/peft/tuners/lora/model.py:558:14: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
556 |         if target_module_types[0] == str:
557 |             new_target_modules = "|".join(f"({self.peft_config[adapter].target_modules})" for adapter in adapters)
558 |         elif target_module_types[0] == set:
    |              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
559 |             new_target_modules = reduce(
560 |                 operator.or_, (self.peft_config[adapter].target_modules for adapter in adapters)
    |

src/peft/tuners/tuners_utils.py:574:9: F811 Redefinition of unused `active_adapter` from line 509
    |
573 |     @property
574 |     def active_adapter(self) -> str | list[str]:
    |         ^^^^^^^^^^^^^^ F811
575 |         # use a property to ensure that active_adapter is not set directly, instead use the set_adapter method
576 |         return self._active_adapter
    |
    = help: Remove definition: `active_adapter`

src/peft/tuners/xlora/model.py:49:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
   |
47 |     for module in base.modules():
48 |         # Check the exact type because classes like OPTLearnedPositionalEmbedding inherit from nn.Embedding
49 |         if type(module) == lora.Linear:
   |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
50 |             device = module.lora_A[next(iter(module.lora_A))].weight.device
51 |             new_layer = XLoraLinearLayer(
   |

src/peft/tuners/xlora/model.py:61:14: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
   |
59 |             module.forward = new_layer.forward  # type: ignore[method-assign]
60 |             total_swapped += 1
61 |         elif type(module) == lora.Embedding:
   |              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
62 |             device = module.lora_embedding_A[next(iter(module.lora_embedding_A))].device
63 |             new_layer = XLoraEmbeddingLayer(
   |

src/peft/tuners/xlora/model.py:73:14: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
   |
71 |             module.forward = new_layer.forward  # type: ignore[method-assign]
72 |             total_swapped += 1
73 |         elif type(module) == lora.Conv2d:
   |              ^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
74 |             device = module.lora_A[next(iter(module.lora_A))].weight.device
75 |             new_layer = XLoraConv2dLayer(
   |

tests/test_mixed.py:125:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
123 |         tuner_layers = [mod for mod in peft_model_01.modules() if isinstance(mod, BaseTunerLayer)]
124 |         tuner_types = {type(tuner_layer) for tuner_layer in tuner_layers}
125 |         if type(config0) == type(config1):
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
126 |             assert len(tuner_types) == 1
127 |         else:
    |

tests/test_mixed.py:150:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
148 |         tuner_layers = [mod for mod in peft_model_10.modules() if isinstance(mod, BaseTunerLayer)]
149 |         tuner_types = {type(tuner_layer) for tuner_layer in tuner_layers}
150 |         if type(config0) == type(config1):
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
151 |             assert len(tuner_types) == 1
152 |         else:
    |

tests/test_mixed.py:169:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
167 |         tuner_layers = [mod for mod in peft_model_10.modules() if isinstance(mod, BaseTunerLayer)]
168 |         tuner_types = {type(tuner_layer) for tuner_layer in tuner_layers}
169 |         if type(config0) == type(config1):
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
170 |             assert len(tuner_types) == 1
171 |         else:
    |

tests/test_tuners_utils.py:294:20: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
292 |         for name, actual_module in actual_model.named_modules():
293 |             expected_module = expected_model_module_dict[name]
294 |             assert type(actual_module) == type(expected_module)
    |                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
295 | 
296 |     def test_maybe_include_all_linear_layers_ia3_loha(self):
    |

Found 11 errors.
make: *** [style] Error 1

For make quality

~nidhogg% make quality
ruff check src tests examples docs scripts docker
src/peft/tuners/boft/layer.py:94:20: F811 Redefinition of unused `fbd_cuda` from line 87
   |
92 |             )
93 |             # extra_cuda_cflags = ['-std=c++14', '-ccbin=$$(which gcc-7)']) # cuda10.2 is not compatible with gcc9. Specify gcc 7
94 |             import fbd_cuda
   |                    ^^^^^^^^ F811
95 |     except Exception as e:
96 |         warnings.warn(f"Failed to load the CUDA extension: {e}, check if ninja is available.")
   |
   = help: Remove definition: `fbd_cuda`

src/peft/tuners/lora/model.py:556:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
554 |             )
555 | 
556 |         if target_module_types[0] == str:
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
557 |             new_target_modules = "|".join(f"({self.peft_config[adapter].target_modules})" for adapter in adapters)
558 |         elif target_module_types[0] == set:
    |

src/peft/tuners/lora/model.py:558:14: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
556 |         if target_module_types[0] == str:
557 |             new_target_modules = "|".join(f"({self.peft_config[adapter].target_modules})" for adapter in adapters)
558 |         elif target_module_types[0] == set:
    |              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
559 |             new_target_modules = reduce(
560 |                 operator.or_, (self.peft_config[adapter].target_modules for adapter in adapters)
    |

src/peft/tuners/tuners_utils.py:574:9: F811 Redefinition of unused `active_adapter` from line 509
    |
573 |     @property
574 |     def active_adapter(self) -> str | list[str]:
    |         ^^^^^^^^^^^^^^ F811
575 |         # use a property to ensure that active_adapter is not set directly, instead use the set_adapter method
576 |         return self._active_adapter
    |
    = help: Remove definition: `active_adapter`

src/peft/tuners/xlora/model.py:49:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
   |
47 |     for module in base.modules():
48 |         # Check the exact type because classes like OPTLearnedPositionalEmbedding inherit from nn.Embedding
49 |         if type(module) == lora.Linear:
   |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
50 |             device = module.lora_A[next(iter(module.lora_A))].weight.device
51 |             new_layer = XLoraLinearLayer(
   |

src/peft/tuners/xlora/model.py:61:14: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
   |
59 |             module.forward = new_layer.forward  # type: ignore[method-assign]
60 |             total_swapped += 1
61 |         elif type(module) == lora.Embedding:
   |              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
62 |             device = module.lora_embedding_A[next(iter(module.lora_embedding_A))].device
63 |             new_layer = XLoraEmbeddingLayer(
   |

src/peft/tuners/xlora/model.py:73:14: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
   |
71 |             module.forward = new_layer.forward  # type: ignore[method-assign]
72 |             total_swapped += 1
73 |         elif type(module) == lora.Conv2d:
   |              ^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
74 |             device = module.lora_A[next(iter(module.lora_A))].weight.device
75 |             new_layer = XLoraConv2dLayer(
   |

tests/test_mixed.py:125:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
123 |         tuner_layers = [mod for mod in peft_model_01.modules() if isinstance(mod, BaseTunerLayer)]
124 |         tuner_types = {type(tuner_layer) for tuner_layer in tuner_layers}
125 |         if type(config0) == type(config1):
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
126 |             assert len(tuner_types) == 1
127 |         else:
    |

tests/test_mixed.py:150:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
148 |         tuner_layers = [mod for mod in peft_model_10.modules() if isinstance(mod, BaseTunerLayer)]
149 |         tuner_types = {type(tuner_layer) for tuner_layer in tuner_layers}
150 |         if type(config0) == type(config1):
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
151 |             assert len(tuner_types) == 1
152 |         else:
    |

tests/test_mixed.py:169:12: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
167 |         tuner_layers = [mod for mod in peft_model_10.modules() if isinstance(mod, BaseTunerLayer)]
168 |         tuner_types = {type(tuner_layer) for tuner_layer in tuner_layers}
169 |         if type(config0) == type(config1):
    |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
170 |             assert len(tuner_types) == 1
171 |         else:
    |

tests/test_tuners_utils.py:294:20: E721 Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks
    |
292 |         for name, actual_module in actual_model.named_modules():
293 |             expected_module = expected_model_module_dict[name]
294 |             assert type(actual_module) == type(expected_module)
    |                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ E721
295 | 
296 |     def test_maybe_include_all_linear_layers_ia3_loha(self):
    |

Found 11 errors.
make: *** [quality] Error 1

BenjaminBossan · 2024-07-26T09:35:13Z

What is your ruff version? Try 0.4.10.

Nidhogg-lyz · 2024-07-26T09:42:16Z

Done! My original version of ruff is 0.5.4 and that's the reason.

BenjaminBossan · 2024-07-26T10:27:47Z

The errors for MacOS on CI seem to be unrelated to this PR:

RuntimeError: "LayerNormKernelImpl" not implemented for 'Half'

I will investigate further and come back to this PR afterwards.

Edit: Sorry, I was mistaken, it's not unrelated. @Nidhogg-lyz could you please remove the usage of float16 from the test?

Nidhogg-lyz · 2024-07-29T01:59:51Z

Done!

BenjaminBossan

Thanks a lot for the fix.

There was an error with prefix tuning when some models like Llava passed past_key_values explicitly, even if it was None, because it resulted in that argument passed twice (once explicit, once via kwargs). This is now fixed.

nidhogg added 2 commits July 22, 2024 21:57

check past_key_values in forward function of peft_model

a4949a6

add unit test test_past_kv.py

f73324c

update code style

6f5115c

Nidhogg-lyz marked this pull request as ready for review July 23, 2024 03:33

BenjaminBossan requested changes Jul 23, 2024

View reviewed changes

liyuzhi and others added 4 commits July 26, 2024 10:21

Merge remote-tracking branch 'origin/main' into check_past_kv

83fb3b5

merge upstream commits

update check_past_kv logic

e532e3a

Merge branch 'huggingface:main' into check_past_kv

b422ec5

Merge branch 'check_past_kv' of https://github.com/Nidhogg-lyz/peft i…

0ce2a61

…nto check_past_kv merge origin commit

Nidhogg-lyz requested a review from BenjaminBossan July 26, 2024 05:09

quality & style check

c2811b6

rm float16

3eca24b

BenjaminBossan approved these changes Jul 29, 2024

View reviewed changes

BenjaminBossan merged commit 296fbcd into huggingface:main Jul 29, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check if past_key_values is provided when using prefix_tuning in peft_model #1942

Check if past_key_values is provided when using prefix_tuning in peft_model #1942

Uh oh!

Nidhogg-lyz commented Jul 22, 2024

Uh oh!

Nidhogg-lyz commented Jul 22, 2024

Uh oh!

BenjaminBossan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 26, 2024

Uh oh!

BenjaminBossan commented Jul 26, 2024

Uh oh!

Nidhogg-lyz commented Jul 26, 2024

Uh oh!

BenjaminBossan commented Jul 26, 2024

Uh oh!

Nidhogg-lyz commented Jul 26, 2024

Uh oh!

BenjaminBossan commented Jul 26, 2024 •

edited

Loading

Uh oh!

Nidhogg-lyz commented Jul 29, 2024

Uh oh!

BenjaminBossan left a comment

Uh oh!

Uh oh!

Uh oh!

Check if past_key_values is provided when using prefix_tuning in peft_model #1942

Check if past_key_values is provided when using prefix_tuning in peft_model #1942

Uh oh!

Conversation

Nidhogg-lyz commented Jul 22, 2024

Uh oh!

Nidhogg-lyz commented Jul 22, 2024

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 26, 2024

Uh oh!

BenjaminBossan commented Jul 26, 2024

Uh oh!

Nidhogg-lyz commented Jul 26, 2024

Uh oh!

BenjaminBossan commented Jul 26, 2024

Uh oh!

Nidhogg-lyz commented Jul 26, 2024

Uh oh!

BenjaminBossan commented Jul 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Nidhogg-lyz commented Jul 29, 2024

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

BenjaminBossan commented Jul 26, 2024 •

edited

Loading