Skip to content

Conversation

jonigl
Copy link
Owner

@jonigl jonigl commented Jun 20, 2025

🚀 v0.2.0 - Refactor: streaming, thinking mode, and direct Ollama API integration

Overview

This PR delivers significant enhancements and architectural changes to the Ollama MCP Bridge, focusing on improved feature set, user experience, and transparency.


✨ What’s New

  • Direct Ollama API Integration:
    Migrated from the Ollama Python library to direct HTTP API calls using httpx, increasing transparency and flexibility.

  • Streaming Support:
    Incremental responses delivered to clients via FastAPI’s StreamingResponse.

  • Thinking Mode:
    Proxies intermediate “thinking” messages from Ollama and MCP tools.

  • Improved Health Check:
    Now checks Ollama service availability and improves error handling.

  • API Endpoint Alignment:
    /query has been replaced with /api/chat to precisely mirror Ollama’s REST API for seamless drop-in compatibility.
    MCP server tools are added transparently—they are invoked only when the model requests them.

  • Documentation & Testing:

    • README intro and usage updated for clarity and new architecture.
    • Updated and fixed tests; manual test instructions improved.
    • Swagger UI request example added.

jonigl added 5 commits June 20, 2025 17:03
…a python library. /query endpoint was changed to /api/chat to be as transparent as possible. Adding swagger ui request example. Updating tests. Improving status error. Checking if ollama is running. Adding utils.
@jonigl jonigl merged commit e447a68 into main Jun 20, 2025
4 checks passed
@jonigl jonigl deleted the feat/refactor branch June 20, 2025 15:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant