LLMs Long Context Benchmark Visualization

A visualization website for comparing the performance of various LLMs across different context window sizes based on the Fiction.LiveBench benchmark.

Data Source

All data comes from Fiction.LiveBench for Long Context Deep Comprehension (April 6, 2025). The benchmark data is located in src/data/benchmark.ts.

Fiction.LiveBench is a benchmark specifically designed to measure LLMs' deep comprehension abilities across long contexts. Unlike many other benchmarks that test retrieval or basic understanding, Fiction.LiveBench evaluates how well models can:

Maintain coherent understanding of complex narratives
Track characters, events, and plot developments across very long contexts
Accurately answer questions that require deep comprehension of the entire context

The benchmark presents models with increasingly longer fictional texts and measures their ability to accurately answer detailed questions about the content.

Development

# Install dependencies
bun install

# Run development server
bun dev

# Build for production
bun build

Author

Created by @leodoan_

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.bolt		.bolt
src		src
.env		.env
.gitignore		.gitignore
README.md		README.md
bun.lockb		bun.lockb
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLMs Long Context Benchmark Visualization

Data Source

Development

Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mnismt/llms-long-context-benchmark

Folders and files

Latest commit

History

Repository files navigation

LLMs Long Context Benchmark Visualization

Data Source

Development

Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages