Fix max allowed lengths for prompts larger than 4096 #74

ani300 · 2025-07-03T21:02:10Z

This takes cares of an oversight that prevented prompts longer than 4k from being run with generate()

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

JRosenkranz · 2025-07-08T03:37:19Z

aiu_fms_testing_utils/utils/paged.py

@@ -27,6 +27,7 @@ def adjust_inputs_to_batch(input_ids: torch.Tensor, **extra_kwargs):
 def generate(
    model: Union[Callable, torch.nn.Module],
    input_ids: torch.Tensor,
+    max_seq_len: int = 4096,


Do we anticipate this getting used? or is it just a placeholder to grab the max_seq_len param if passed to fms generate?

this is to match the signature between both generate() calls and avoid errors yes

We may just want this to be part of the attention_specific_kwargs in the inference and warmup

JRosenkranz · 2025-07-10T00:16:36Z

aiu_fms_testing_utils/utils/__init__.py

@@ -53,6 +53,7 @@ def warmup_model(
        generate(
            model,
            _warmup_input_ids,
+            max_seq_len=_warmup_input_ids.shape[1] + max_new_tokens,


could we just add this to the attention_specific_kwargs in line 32? this way we don't need to change the signature of paged generate.

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

ani300 added 2 commits July 3, 2025 20:55

Fix max length of generation being cut to 4096

dcc95c0

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

Correct max_seq_len across aftu

5e0a4a3

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

ani300 requested a review from JRosenkranz July 3, 2025 21:02

Merge branch 'main' into fix-max-len

d0e20cb

JRosenkranz reviewed Jul 8, 2025

View reviewed changes

JRosenkranz reviewed Jul 10, 2025

View reviewed changes

ani300 added 3 commits July 11, 2025 22:15

fix signatures

bf06dba

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

more fixes

a6abeff

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

One last fix?

b83650f

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix max allowed lengths for prompts larger than 4096 #74

Fix max allowed lengths for prompts larger than 4096 #74

Uh oh!

ani300 commented Jul 3, 2025

Uh oh!

JRosenkranz Jul 8, 2025

Uh oh!

ani300 Jul 8, 2025

Uh oh!

JRosenkranz Jul 10, 2025

Uh oh!

ani300 Jul 11, 2025

Uh oh!

JRosenkranz Jul 10, 2025

Uh oh!

ani300 Jul 11, 2025

Uh oh!

Uh oh!

Fix max allowed lengths for prompts larger than 4096 #74

Are you sure you want to change the base?

Fix max allowed lengths for prompts larger than 4096 #74

Uh oh!

Conversation

ani300 commented Jul 3, 2025

Uh oh!

JRosenkranz Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

ani300 Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

JRosenkranz Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

ani300 Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

JRosenkranz Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

ani300 Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!