Awesome Unified Multimodal Models
-
Updated
Aug 17, 2025
Awesome Unified Multimodal Models
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation
Planning with unified multimodal models
A curated collection of research papers, models, and resources tracing the evolution from specialized models to unified world models.
A unified multimodal generative AI system designed to learn and adapt across multiple modalities (text, audio, vision, robotics) with minimal data and long-term autonomy through reinforcement learning.
Add a description, image, and links to the unified-multimodal-models topic page so that developers can more easily learn about it.
To associate your repository with the unified-multimodal-models topic, visit your repo's landing page and select "manage topics."