Skip to content

Support Generating CoT SFT Data Using DeepSeek R1 #94

@randydl

Description

@randydl

Hi,

I’m exploring the possibility of using DeepSeek R1 to generate Chain-of-Thought (CoT) data for Supervised Fine-Tuning (SFT). Could we consider adding support or providing guidance on how to leverage DeepSeek R1 for this purpose?

If this is already feasible, it would be great to have some documentation or examples to help users get started. If not, I’d love to discuss the potential challenges and explore whether this could be a feature worth implementing.

Looking forward to your thoughts!

Best regards

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions