Packed tokenizer #1473

AngledLuffa · 2025-04-05T08:06:21Z

Use a PackedSequence in the tokenizer. Somehow it's substantially slower...

however, it would address the tokenization not being consistent based on batch size:

Sort in the other direction means we don't need to use enforce_sorted=False Things are faster without the packed sequences, unfortunately, but they wind up having unstable results: #1472

AngledLuffa force-pushed the packed_tokenizer branch from 672cc4e to 48f31bf Compare April 12, 2025 00:07

AngledLuffa merged commit 4433e83 into dev Apr 12, 2025
1 check passed

AngledLuffa deleted the packed_tokenizer branch April 12, 2025 00:08

Pack & pad tensors to the LSTM in the tokenizer using PackedSequence

48f31bf

Sort in the other direction means we don't need to use enforce_sorted=False Things are faster without the packed sequences, unfortunately, but they wind up having unstable results: #1472

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Packed tokenizer #1473

Packed tokenizer #1473

AngledLuffa commented Apr 5, 2025 •

edited

Loading

Packed tokenizer #1473

Packed tokenizer #1473

Conversation

AngledLuffa commented Apr 5, 2025 • edited Loading

AngledLuffa commented Apr 5, 2025 •

edited

Loading