Skip to content

Conversation

oeway
Copy link
Contributor

@oeway oeway commented Sep 9, 2025

Implement optimized S3Vector and Zarr-based vector storage engines with comprehensive benchmarks.

This PR addresses significant performance bottlenecks in the existing S3-backed vector database, including slow search, sequential shard loading, and inefficient HNSW, aiming for competitive performance with pgvector at scale.


Open in Cursor Open in Web

cursoragent and others added 2 commits September 9, 2025 02:50
Co-authored-by: oeway007 <oeway007@gmail.com>
Integrates Zarr for chunked, compressed vector storage in S3Vector. This improves performance and reduces storage costs. Includes a new benchmark suite to compare Zarr with existing S3Vector implementations and PgVector.

Co-authored-by: oeway007 <oeway007@gmail.com>
Copy link

cursor bot commented Sep 9, 2025

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
Learn more about Cursor Agents

@github-actions github-actions bot added the documentation Improvements or additions to documentation label Sep 9, 2025
cursoragent and others added 2 commits September 9, 2025 03:17
Replaces Parquet shards with Zarr for improved performance and cost savings.

Co-authored-by: oeway007 <oeway007@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants