π Major Release: OpenAI Responses API Integration
β¨ New Features
π OpenAI Responses API Integration
- Full support for 2025 Responses API with gpt-4o + image_generation tool
- Native multimodal understanding with better context awareness
- Enhanced prompt following and superior text rendering in images
- Real-time streaming with partial previews via Responses API
π Dual API Architecture
- Seamless API switching via
API_MODEenvironment variable - Default to Responses API (recommended) with fallback to Images API
- Backward compatibility with existing Images API implementations
- Unified interface supporting both legacy and modern workflows
πΌοΈ Enhanced Image Editing
- Multi-image input support with proper message formatting
- Context-aware editing with conversation history integration
- Previous response ID linking for multi-turn editing sessions
- Advanced mask support for precise inpainting operations
π§ͺ Comprehensive Test Coverage
- 99+ tests with full API coverage for both implementations
- Complete error handling and edge case coverage
- Real API validation with integration testing
- Mock-based testing with proper dependency injection
π§ Technical Improvements
βοΈ Enhanced Configuration System
API_MODE: Switch between 'responses' (default) and 'images'RESPONSES_MODEL: Model selection for Responses API (default: gpt-4o)- Flexible model switching between dedicated and integrated approaches
ποΈ SOLID Architecture
- Complete refactoring following dependency injection patterns
- Single Responsibility principle with focused services
- Interface-based design for maximum extensibility
- Test-driven development with comprehensive coverage
π Updated Documentation
π Comprehensive Guides
- Dual API documentation with feature comparison table
- Enhanced environment variables documentation
- Updated streaming examples for both API modes
- API mode selection guidance and best practices
π API Comparison
| Feature | Responses API (gpt-4o) | Images API (gpt-image-1) |
|---|---|---|
| Latest Technology | β 2025 Responses API | |
| Text in Images | β Superior | β Good |
| Context Awareness | β Excellent | |
| Streaming | β Partial previews | |
| Multi-turn | β Full support |
π§ Migration Guide
For New Users
- Default configuration uses Responses API automatically
- No action required - just set
OPENAI_API_KEY
For Existing Users
- Backward compatible - existing configurations continue to work
- Optional upgrade: Set
API_MODE=responsesto use latest features - Gradual migration supported with dual-mode architecture
π¦ Installation
# Clone and install
git clone https://github.com/pavelsukhachev/mcp-server-gpt-image.git
cd mcp-server-gpt-image
npm install
# Configure (uses Responses API by default)
echo "OPENAI_API_KEY=your-api-key-here" > .env
# Run
npm run start:httpπ What's Next
This release establishes MCP Server GPT Image as the most advanced OpenAI image generation server available, supporting both cutting-edge Responses API and legacy Images API with seamless switching.
Perfect for:
- π¨ Advanced image generation workflows
- π Multi-turn conversation contexts
- β‘ Real-time streaming applications
- π§ͺ Development and production environments
Full Changelog: https://github.com/pavelsukhachev/mcp-server-gpt-image/blob/main/CHANGELOG.md