Skip to content

Add WA for exporting bos/eos token ids in rt_info #3289

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

atobiszei
Copy link
Collaborator

@atobiszei atobiszei commented May 13, 2025

Issue was that for:
https://huggingface.co/cross-encoder/msmarco-MiniLM-L6-en-de-v1
in exported reranking model no bos/eos token ids were provided. Workaround is to get this information from transformers and ensure those are available in rt_info.

This only partially resolves:
#3272
As there is still missing support for token_type_ids:
https://github.com/openvinotoolkit/model_server/blob/main/src/rerank/rerank_calculator.cc#L318C90-L318C104

Those fields were not specified in exported .xml
@atobiszei atobiszei added the WIP Do not merge until resolved label May 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
WIP Do not merge until resolved
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant