enable and run contextual rag (search untested) #2793

evan-onyx · 2024-10-14T16:09:28Z

Description

implements Contextual RAG across chunk sizes with a UI flag or environment variable (in theory, although I think the env var code path doesn't work for either this or multipass indexing)

How Has This Been Tested?

Indexing has been manually tested, search + retrieval has not. Also working on unit tests.

Accepted Risk

The main risk is that this option adds significant cost for currently untested benefits
It could be fine if we call it a "beta test" feature and just make it available to get community feedback, but before that I need to test that it doesn't break the backend

Related Issue(s)

[If applicable, link to the issue(s) this PR addresses]

Checklist:

All of the automated tests pass
All PR comments are addressed and marked resolved
If there are migrations, they have been rebased to latest main
If there are new dependencies, they are added to the requirements
If there are new environment variables, they are added to all of the deployment methods
If there are new APIs that don't require auth, they are added to PUBLIC_ENDPOINT_SPECS
Docker images build and basic functionalities work
Author has done a final read through of the PR right before merge

vercel · 2024-10-14T16:09:32Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
internal-search	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Dec 12, 2024 11:27pm

greptile-apps

PR Summary

This PR introduces a new Contextual RAG feature across the Danswer application, affecting multiple components:

Added enable_contextual_rag flag to SearchSettings model and database migration
Implemented context generation for chunks using LLM in chunker.py
Modified embedding process to include document summary and chunk context
Updated UI components to display and configure Contextual RAG settings
Added max_tokens parameter to LLM interfaces for output control
Introduced new fields (doc_summary, chunk_context) in various models and Vespa indexing

Key concerns:

Feature is entirely untested, posing significant risks to system stability
Potential for increased computational costs without verified benefits
Lack of proper error handling and edge case considerations
Incomplete implementation of environment variable controls
Absence of performance impact assessment on existing functionalities

_{25 file(s) reviewed, 28 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

greptile-apps · 2024-10-14T16:10:54Z

backend/danswer/configs/app_configs.py

 ENABLE_MULTIPASS_INDEXING = (
    os.environ.get("ENABLE_MULTIPASS_INDEXING", "").lower() == "true"
 )
+ENABLE_CONTEXTUAL_RAG = os.environ.get("ENABLE_CONTEXTUAL_RAG", "").lower() == "true"


logic: This new feature is currently untested. Implement and run tests before merging.

greptile-apps · 2024-10-14T16:10:55Z

backend/danswer/configs/app_configs.py

 ENABLE_MULTIPASS_INDEXING = (
    os.environ.get("ENABLE_MULTIPASS_INDEXING", "").lower() == "true"
 )
+ENABLE_CONTEXTUAL_RAG = os.environ.get("ENABLE_CONTEXTUAL_RAG", "").lower() == "true"


style: Consider adding a comment explaining what Contextual RAG is and its implications.

greptile-apps · 2024-10-14T16:10:55Z

backend/danswer/configs/app_configs.py

 ENABLE_MULTIPASS_INDEXING = (
    os.environ.get("ENABLE_MULTIPASS_INDEXING", "").lower() == "true"
 )
+ENABLE_CONTEXTUAL_RAG = os.environ.get("ENABLE_CONTEXTUAL_RAG", "").lower() == "true"


logic: The PR description mentions that the environment variable code path might not work. Verify and fix this issue before merging.

greptile-apps · 2024-10-14T16:11:30Z

backend/danswer/connectors/models.py

+    def get_content(self) -> str:
+        return " ".join([section.text for section in self.sections])


style: Consider adding a docstring to explain the purpose and usage of this method

greptile-apps · 2024-10-14T16:13:39Z

backend/danswer/document_index/vespa/chunk_retrieval.py

+        content=fields[CONTENT],  # Includes extra title prefix and metadata suffix;
+        # also sometimes context for contextual rag
        source_links=source_links_dict or {0: ""},


logic: The comment suggests that the content field now includes context for Contextual RAG. This change might affect existing functionality that relies on the content field. Verify that this doesn't break any existing features.

greptile-apps · 2024-10-14T16:21:45Z

backend/danswer/llm/interfaces.py

        prompt: LanguageModelInput,
        tools: list[dict] | None = None,
        tool_choice: ToolChoiceOptions | None = None,
+        max_tokens: int | None = None,


style: Add documentation for the max_tokens parameter to explain its purpose and usage

greptile-apps · 2024-10-14T16:23:00Z

backend/danswer/search/postprocessing/postprocessing.py

+    def _remove_contextual_rag(chunk: InferenceChunkUncleaned) -> str:
+        # remove document summary
+        if chunk.content.startswith(chunk.doc_summary):
+            return chunk.content[len(chunk.doc_summary) :].lstrip()
+        # remove chunk context
+        if chunk.content.endswith(chunk.chunk_context):
+            return chunk.content[: -len(chunk.chunk_context)].rstrip()


logic: This function modifies chunk content without any safeguards. Consider adding checks to ensure the content is not empty after removal.

greptile-apps · 2024-10-14T16:23:01Z

backend/danswer/search/postprocessing/postprocessing.py

    for chunk in chunks:
        chunk.content = _remove_title(chunk)
        chunk.content = _remove_metadata_suffix(chunk)
+        chunk.content = _remove_contextual_rag(chunk)


logic: The new _remove_contextual_rag function is called here, but its effects are untested. This could lead to unexpected behavior in the search pipeline.

greptile-apps · 2024-10-14T16:23:02Z

backend/danswer/search/postprocessing/postprocessing.py

+    def _remove_contextual_rag(chunk: InferenceChunkUncleaned) -> str:
+        # remove document summary
+        if chunk.content.startswith(chunk.doc_summary):
+            return chunk.content[len(chunk.doc_summary) :].lstrip()
+        # remove chunk context
+        if chunk.content.endswith(chunk.chunk_context):
+            return chunk.content[: -len(chunk.chunk_context)].rstrip()


logic: The function assumes that doc_summary and chunk_context are always at the start and end of the content respectively. This might not always be true and could lead to incorrect content removal.

greptile-apps · 2024-10-14T16:24:49Z

backend/tests/unit/danswer/indexing/test_embedder.py

            metadata_suffix_keyword="",
            mini_chunk_texts=None,
            large_chunk_reference_ids=[],
+            chunk_context="Test chunk context",


logic: New 'chunk_context' parameter added, but not used in assertions

* Tiny confluence fix * Update utils.py --------- Co-authored-by: pablodanswer <pablo@danswer.ai>

* functional notifications * typing * minor * ports * nit * verify functionality * pretty

* fresh indexing feature branch * cherry pick test * Revert "cherry pick test" This reverts commit 2a62422. * set multitenant so that vespa fields match when indexing * cleanup pass * mypy * pass through env var to control celery indexing concurrency * comments on task kickoff and some logging improvements * disentangle configuration for different workers and beats. * use get_session_with_tenant * comment out all of update.py * rename to RedisConnectorIndexingFenceData * first check num_indexing_workers * refactor RedisConnectorIndexingFenceData * comment out on_worker_process_init * missed a file * scope db sessions to short lengths * update launch.json template * fix types * keep index button disabled until indexing is truly finished * change priority order of tooltips * should be using the logger from app_base * if we run out of retries, just mark the doc as modified so it gets synced later * tighten up the logging ... we know these are ID's * add logging

* add default schema config * resolve circular import * k

* Temporary fix for empty Google App credentials * added it to credential creation

* add multi tenancy to redis * rename context var * k * args -> kwargs * minor update to kv interface * robustify

* clear up llm * remove logs

* working chat feedback dump script (with api addition) * mypy fix * comment out pydantic models (but leave for reference) * small code review tweaks * bump to clear vercel issue?

* add global assistants context * nit * minor cleanup * minor clarity * nit

fix typo

* can't add to primary_worker_locks if it doesn't exist * move init

* try hiding celery task spam * mypy fix

* checkpoint * k * k * k * fixed slack api calls * missed one --------- Co-authored-by: hagen-danswer <hagen@danswer.ai>

* silence log * silence

loopio connector: entry["id"] can apparently be a number, so convert to str

* first cut at deletion hardening * clean up logging * remove commented code

…with size <1 chunk

evan-onyx · 2025-02-17T21:41:45Z

old version of contextual retrieval

greptile-apps bot reviewed Oct 14, 2024

View reviewed changes

vercel bot deployed to Preview October 17, 2024 01:35 View deployment

evan-onyx changed the title ~~enable and run contextual rag (untested)~~ enable and run contextual rag (search untested) Oct 17, 2024

vercel bot deployed to Preview October 17, 2024 02:03 View deployment

vercel bot deployed to Preview October 17, 2024 17:28 View deployment

vercel bot deployed to Preview October 17, 2024 20:31 View deployment

vercel bot deployed to Preview October 17, 2024 22:17 View deployment

vercel bot deployed to Preview October 20, 2024 21:37 View deployment

vercel bot deployed to Preview October 21, 2024 00:05 View deployment

evan-onyx added 8 commits October 23, 2024 14:54

enable and run contextual rag

08e74c5

bug fixes for contextual rag indexing

0a399cf

fix and add embedding test

ea12fce

fix chat tests

8a57f67

fix typing issues

3beaaeb

move prompts and improve chunking test

c9fa083

vespa doesn't send empty strings; add fields to yql query

164da89

refactor and add parallelism

7ea074a

evan-onyx force-pushed the evan-contextual-rag branch from 300c466 to 7ea074a Compare October 23, 2024 18:56

vercel bot deployed to Preview October 23, 2024 19:00 View deployment

hagen-danswer and others added 10 commits October 23, 2024 19:57

Tiny confluence fix (#2885)

7abbfa3

* Tiny confluence fix * Update utils.py --------- Co-authored-by: pablodanswer <pablo@danswer.ai>

sticky credential description (#2886)

786a46c

Gating Notifications (#2868)

8b72264

* functional notifications * typing * minor * ports * nit * verify functionality * pretty

add default schema config (#2888)

14e75bb

* add default schema config * resolve circular import * k

Temporary fix for empty Google App credentials (#2892)

b9fb657

* Temporary fix for empty Google App credentials * added it to credential creation

Multitenant redis update (#2889)

0545fb4

* add multi tenancy to redis * rename context var * k * args -> kwargs * minor update to kv interface * robustify

Clearer azure models (#2898)

1b6b134

* clear up llm * remove logs

working chat feedback dump script (with api addition) (#2891)

2b9a751

* working chat feedback dump script (with api addition) * mypy fix * comment out pydantic models (but leave for reference) * small code review tweaks * bump to clear vercel issue?

update stale workflow

32b595d

pablonyx and others added 27 commits October 24, 2024 21:27

Add global assistants context (#2900)

33eabf1

* add global assistants context * nit * minor cleanup * minor clarity * nit

Merge pull request #2904 from danswer-ai/bugfix/fix-typo

4bce143

fix typo

can't add to primary_worker_locks if it doesn't exist (#2903)

94b4dc1

* can't add to primary_worker_locks if it doesn't exist * move init

try hiding celery task spam (#2905)

9f50417

* try hiding celery task spam * mypy fix

Seeding (#2902)

b49a9ab

* checkpoint * k * k * k * fixed slack api calls * missed one --------- Co-authored-by: hagen-danswer <hagen@danswer.ai>

entry["id"] can apparently be a number, so convert to str

10b5b55

Silence unnecessary debug log (#2908)

eae1dad

* silence log * silence

Auto Backport Partial (#2910)

863f00f

Harmless Backport (#2911)

a931494

Merge pull request #2909 from danswer-ai/bugfix/loopio

b9781c4

loopio connector: entry["id"] can apparently be a number, so convert to str

Bugfix/connector deletion lockout (#2901)

eaa8ae7

* first cut at deletion hardening * clean up logging * remove commented code

Dev Experience (#2912)

07a4e11

Harmless Backport (#2914)

55b9111

Workflow (#2915)

9c0f927

Harmless Backport (#2916)

d7a30b0

Add strict json mode (#2917)

4a47e9a

Fix IT fixture ordering

4ca3820

enable and run contextual rag

732edc1

bug fixes for contextual rag indexing

85db903

fix and add embedding test

d03d3de

fix chat tests

9ceab54

fix typing issues

86cfd47

move prompts and improve chunking test

75a6bf7

vespa doesn't send empty strings; add fields to yql query

a78482b

refactor and add parallelism

4a2bf43

enhanced functionality around contextual retrieval; bug fix for docs …

53a4f63

…with size <1 chunk

merge

94d48f6

vercel bot deployed to Preview December 12, 2024 23:27 View deployment

evan-onyx closed this Feb 17, 2025

		def get_content(self) -> str:
		return " ".join([section.text for section in self.sections])

enable and run contextual rag (search untested) #2793

enable and run contextual rag (search untested) #2793

Uh oh!

Conversation

evan-onyx commented Oct 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How Has This Been Tested?

Accepted Risk

Related Issue(s)

Checklist:

Uh oh!

vercel bot commented Oct 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

evan-onyx commented Feb 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

evan-onyx commented Oct 14, 2024 •

edited

Loading

vercel bot commented Oct 14, 2024 •

edited

Loading