Scalable data pre processing and curation toolkit for LLMs
-
Updated
Jun 4, 2026 - Python
Scalable data pre processing and curation toolkit for LLMs
Open source project for data preparation for GenAI applications
Manage Chrome bookmarks with this local-first extension to search, organize, backup, and clean your browser links using AI-enhanced syntax.
SpiralDB connectors for NVIDIA NeMo Curator
Add a description, image, and links to the datarecipes topic page so that developers can more easily learn about it.
To associate your repository with the datarecipes topic, visit your repo's landing page and select "manage topics."