You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I truly appreciate the blogs you've posted about LLMs. I've gained a great deal from them. I have a question to ask you. When reading the blog "2024 - 07 - 29_Finetune_Llama31.html", I noticed that when you processed the dataset, you only converted the data into the chat_template format, but there was no process of processing labels. In this case, will the LLM model also learn sentences like <im_start>..<im_end>? And will it reduce the learning accuracy?
The text was updated successfully, but these errors were encountered:
Hey, thanks! Yes, you want the model to learn the chat template, that's one of the main goals of SFT. The SFTTrainer handles these special tokens appropriately with the dataset_text_field parameter.
Hello! I truly appreciate the blogs you've posted about LLMs. I've gained a great deal from them. I have a question to ask you. When reading the blog "2024 - 07 - 29_Finetune_Llama31.html", I noticed that when you processed the dataset, you only converted the data into the chat_template format, but there was no process of processing labels. In this case, will the LLM model also learn sentences like <im_start>..<im_end>? And will it reduce the learning accuracy?
The text was updated successfully, but these errors were encountered: