Project: MULTIGPT - Multi-Model RAG with document, image and Audio integration, Python, JavaScript/TypeScript #141

Mokshu3242 · 2025-04-19T17:42:51Z

Project Name

MULTIGPT

Description

MultiGPT - AI Agent with Multi-Modal Capabilities 🚀

An advanced AI agent capable of processing text, audio, images, and documents with visualization support. Built with FastAPI, Cloudflare AI, ElevenLabs, and LangChain.

🌟 Features

1. Core Capabilities

Conversational AI with persistent chat history
Multi-language support (English, Hindi, Marathi)
JWT Authentication + Self-Hosted OTP verification
Document, Image and Audio Processing
Rate-limited API endpoints

2. Input Processing

Type	Endpoint	Technologies Used
Text	`/chat`	Cloudflare LLM
Voice	`/voice`	ElevenLabs TTS + Whisper
Audio	`/audio`	Whisper transcription
Images	`/handle_image`	CLIP image analysis
Documents	`/upload_doc`	PyPDFium2, docx2txt, msoffcrypto

3. Advanced Functions

YouTube transcript extraction
Data visualization (bar/line/pie charts)
Auto-expiring file storage (2-day TTL)

🛠️ Tech Stack

Frontend: React Js
Backend: FastAPI
AI Services:
- Cloudflare (LLaMA-2, Whisper, CLIP)
- ElevenLabs (Text-to-Speech)
Database: MongoDB
Data Processing:
- LangChain (Document chunking)
- Pandas/Plotly (Visualizations)

Language & Framework

Team Members

Mokshu3242, bhavya681, Sudeep10

Registration Check

Each of my team members has filled out the registration form

The text was updated successfully, but these errors were encountered:

multispark changed the title ~~MULTIGPT: Multi-Model RAG with document, image and Audio integration~~ Project: MULTIGPT - Multi-Model RAG with document, image and Audio integration, Python, JavaScript/TypeScript Apr 25, 2025

cole-g-johnson added JavaScript/TypeScript Python labels May 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Project: MULTIGPT - Multi-Model RAG with document, image and Audio integration, Python, JavaScript/TypeScript #141

Project: MULTIGPT - Multi-Model RAG with document, image and Audio integration, Python, JavaScript/TypeScript #141

Mokshu3242 commented Apr 19, 2025 •

edited

Loading

Project: MULTIGPT - Multi-Model RAG with document, image and Audio integration, Python, JavaScript/TypeScript #141

Project: MULTIGPT - Multi-Model RAG with document, image and Audio integration, Python, JavaScript/TypeScript #141

Comments

Mokshu3242 commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Project Name

Description

MultiGPT - AI Agent with Multi-Modal Capabilities 🚀

🌟 Features

1. Core Capabilities

2. Input Processing

3. Advanced Functions

🛠️ Tech Stack

Language & Framework

Project Repository URL

Deployed Endpoint URL

Project Video

Team Members

Registration Check

Mokshu3242 commented Apr 19, 2025 •

edited

Loading