`tokenizer` is still being used in `Trainer` instead of `processing_class` #37734

arjunaskykok · 2025-04-24T06:54:53Z

In the Fine-tuning a model with the Trainer API page, either in the documentation or the notebook, we are told to use the tokenizer parameter when initializing the Trainer class.

But the tokenizer parameter has been deprecated. We should use the processing_class parameter.

@stevhliu

The text was updated successfully, but these errors were encountered:

stevhliu · 2025-04-24T16:05:55Z

Good catch, would you like to open a PR to update the docs and course to use the processing_class parameter instead?

arjunaskykok · 2025-04-24T17:03:37Z

Okay, I'll do it.

arjunaskykok · 2025-04-25T08:41:53Z

PRs:

huggingface/course#895
huggingface/notebooks#574

arjunaskykok mentioned this issue Apr 24, 2025

tokenizer is still being used in Trainer instead of processing_class huggingface/hub-docs#1711

Closed

arjunaskykok linked a pull request Apr 25, 2025 that will close this issue

replace deprecated tokenizer with processing_class in chapter 3 huggingface/notebooks#574

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`tokenizer` is still being used in `Trainer` instead of `processing_class` #37734

`tokenizer` is still being used in `Trainer` instead of `processing_class` #37734

arjunaskykok commented Apr 24, 2025

stevhliu commented Apr 24, 2025

arjunaskykok commented Apr 24, 2025

arjunaskykok commented Apr 25, 2025

tokenizer is still being used in Trainer instead of processing_class #37734

tokenizer is still being used in Trainer instead of processing_class #37734

Comments

arjunaskykok commented Apr 24, 2025

stevhliu commented Apr 24, 2025

arjunaskykok commented Apr 24, 2025

arjunaskykok commented Apr 25, 2025

`tokenizer` is still being used in `Trainer` instead of `processing_class` #37734

`tokenizer` is still being used in `Trainer` instead of `processing_class` #37734