Ollama fix handle incomplete JSON chunks in stream #1019

matmorel · 2025-07-23T22:49:58Z

Addresses one of the issues raised in #686

Problem

Ollama LLM can return chunk in parts when it's too long.

Solution

Improve the json_responses_chunk_handler method to handle incomplete JSON chunks in stream. If a chunk does not end with }, it is considered incomplete and buffered until the next chunk arrives. This prevents JSON parsing errors and ensures all responses are processed correctly.

This PR is heavily inspired from #995 by @berkcaputcu, my implementation doesn't rely on exceptions and also includes additional specs.

handle multiline JSON

cd15583

matmorel changed the title ~~Ollama handle multiline JSON~~ Ollama fix handle multiline JSON Jul 23, 2025

handle on_data called multiple times

253c042

matmorel changed the title ~~Ollama fix handle multiline JSON~~ Ollama fix handle incomplete JSON chunks in stream Jul 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Ollama fix handle incomplete JSON chunks in stream #1019

Ollama fix handle incomplete JSON chunks in stream #1019

Uh oh!

matmorel commented Jul 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Ollama fix handle incomplete JSON chunks in stream #1019

Are you sure you want to change the base?

Ollama fix handle incomplete JSON chunks in stream #1019

Uh oh!

Conversation

matmorel commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

matmorel commented Jul 23, 2025 •

edited

Loading