Change the repository type filter
All
Repositories list
31 repositories
- GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
- CogView4, CogView3-Plus and CogView3(ECCV 2024)
- GPT4V-level open-source multi-modal model based on Llama3-8B
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
ComplexFuncBench
Public- Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
- CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
- CodeGeeX2: A More Powerful Multilingual Code Generation Model
CogCoM
Public- ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
- a state-of-the-art-level open visual language model | 多模态预训练模型
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
- Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".