You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I’m exploring the possibility of using DeepSeek R1 to generate Chain-of-Thought (CoT) data for Supervised Fine-Tuning (SFT). Could we consider adding support or providing guidance on how to leverage DeepSeek R1 for this purpose?
If this is already feasible, it would be great to have some documentation or examples to help users get started. If not, I’d love to discuss the potential challenges and explore whether this could be a feature worth implementing.