Skip to content

Steve's tokenizer changes #3297

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from
Open

Steve's tokenizer changes #3297

wants to merge 3 commits into from

Conversation

dkalinowski
Copy link
Collaborator

🛠 Summary

Taken from PR #3274

stevegrubb and others added 3 commits May 5, 2025 15:32
I was working through a demo where it said to run
"pip install -U optimum[diffusers,openvino]".
When I did that, I noticed that it downloaded:
openvino_tokenizers-2025.1.0.0-py3-none-manylinux2014_x86_64.whl.
However, the tokenizer is built and installed in the image
by the Dockerfile. After more digging, I found that openvino's
python module is not installed in the image. This patch installs
the openvino-tokenizer python module in the image and uses a
symlink to point to the tokenizer shared object to save space.
Only create the convert_tokenizer if python is enbled
@dkalinowski dkalinowski requested review from dtrawins, mzegla and michalkulakowski and removed request for michalkulakowski May 19, 2025 12:23
@@ -220,6 +220,17 @@ RUN git clone https://github.com/openvinotoolkit/openvino_tokenizers.git /openvi
WORKDIR /openvino_tokenizers/build
RUN cmake .. -DCMAKE_BUILD_TYPE=Release -DCMAKE_VERBOSE_MAKEFILE="${VERBOSE_LOGS}" && cmake --build . --parallel $JOBS ; cp /openvino_tokenizers/build/src/lib*.so /opt/intel/openvino/runtime/lib/intel64/

WORKDIR /openvino_tokenizers/
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

shall we add it also to ubuntu dockerfile?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants