Skip to content

Conversation

rkuo-danswer
Copy link
Contributor

Description

Fixes https://linear.app/danswer/issue/DAN-1842/need-consistent-tool-to-generate-vespa-schemas-for-the-cloud

How Has This Been Tested?

[Describe the tests you ran to verify your changes]

Backporting (check the box to trigger backport action)

Note: You have to check that the action passes, otherwise resolve the conflicts manually and tag the patches.

  • This PR should be backported (make sure to check that the backport attempt succeeds)
  • [Optional] Override Linear Check

Copy link

vercel bot commented Apr 18, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
internal-search ✅ Ready (Inspect) Visit Preview 💬 Add feedback Apr 18, 2025 4:17pm

@rkuo-danswer rkuo-danswer marked this pull request as ready for review April 18, 2025 00:27
@rkuo-danswer rkuo-danswer requested a review from a team as a code owner April 18, 2025 00:27
Copy link
Contributor

@greptile-apps greptile-apps bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR Summary

This PR introduces a new script for generating Vespa schema variations and refactors schema handling functions to improve maintainability and consistency in cloud deployments.

  • Added /backend/scripts/debugging/onyx_vespa_schemas.py to automate schema generation for different embedding models
  • Introduced _replace_tenant_template_value_in_schema in index.py for cleaner tenant ID handling
  • Changed decay_factor input type from float to double in danswer_chunk.sd for improved precision
  • Removed unused random_ rank profile from schema definition
  • Improved error handling and logging in schema generation process

3 file(s) reviewed, 1 comment(s)
Edit PR Review Bot Settings | Greptile

Copy link
Contributor

@evan-onyx evan-onyx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just a small question

@rkuo-danswer rkuo-danswer added this pull request to the merge queue Apr 18, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to no response for status checks Apr 18, 2025
@rkuo-danswer rkuo-danswer added this pull request to the merge queue Apr 18, 2025
Merged via the queue into main with commit e5e0944 Apr 18, 2025
10 of 11 checks passed
@rkuo-danswer rkuo-danswer deleted the feature/vespa-schema-generator branch April 18, 2025 21:44
aronszanto pushed a commit to aronszanto/onyx that referenced this pull request Apr 26, 2025
…4556)

* tool to generate vespa schema variations for our cloud

* extraneous assign

* float, not double

* back to double

---------

Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app>
AnkitTukatek pushed a commit to TukaTek/onyx that referenced this pull request Sep 23, 2025
…4556)

* tool to generate vespa schema variations for our cloud

* extraneous assign

* float, not double

* back to double

---------

Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants