Visual dashboard to evaluate multi-agent & RAG-based AI apps. Compare models on accuracy, latency, token usage, and trust metrics - powered by NVIDIA AgentIQ
nvidia multi-agent-systems model-comparison production-ai rag streamlit trustworthy-ai llmops genai enterprise-ai llm-evaluation open-source-ai agent-evaluation agentiq pipeline-evaluation
-
Updated
Apr 10, 2025 - Python