fix drive slowness #4668

evan-onyx · 2025-05-07T03:04:10Z

Description

Fixes https://linear.app/danswer/issue/DAN-1945/make-drive-fast-again

We were seeing a lot of slowness with the drive connector, with occasional hangs that completely interrupted indexing. We believe this was due to many duplicate API calls, in some cases leading to some silent rate limiting from the google apis. We were running 50 generator threads in parallel to get 16 documents, returning a checkpoint, then entering with that checkpoint and starting 50 more generators... etc. Now we tie the number of threads to the number of user emails we process per checkpoint, and finish those users before returning a checkpoint. It remains to be seen whether this will work well across the board, but it will certainly greatly improve API call efficiency (and therefore speed).

How Has This Been Tested?

Tested manually

Backporting (check the box to trigger backport action)

Note: You have to check that the action passes, otherwise resolve the conflicts manually and tag the patches.

This PR should be backported (make sure to check that the backport attempt succeeds)
[Optional] Override Linear Check

vercel · 2025-05-07T03:04:14Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
internal-search	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 7, 2025 9:53pm

greptile-apps

PR Summary

This PR optimizes the Google Drive connector's performance by reducing API call overhead and improving thread management to prevent rate limiting and hangs during document indexing.

Reduced BATCHES_PER_CHECKPOINT from 10 to 1 in /backend/onyx/connectors/google_drive/connector.py to minimize redundant API calls
Added timeout and RefreshError handling to prevent connector hangs
Optimized thread management by matching thread count to active user emails instead of fixed 50 threads
Added per-user drive ID tracking to prevent duplicate processing
Improved logging for better debugging and monitoring of API interactions

_{1 file(s) reviewed, no comment(s)}
_{Edit PR Review Bot Settings | Greptile}

* fix slowness * no more silent failing for users * nits * no silly info transfer

evan-onyx requested a review from a team as a code owner May 7, 2025 03:04

greptile-apps bot reviewed May 7, 2025

View reviewed changes

vercel bot deployed to Preview May 7, 2025 16:30 View deployment

evan-onyx enabled auto-merge May 7, 2025 17:44

evan-onyx added 3 commits May 7, 2025 12:58

fix slowness

6baec6e

no more silent failing for users

14522be

nits

6827dc3

evan-onyx force-pushed the perf/drive-checkpoint-speedup branch from d8c10ca to 6827dc3 Compare May 7, 2025 20:11

vercel bot deployed to Preview May 7, 2025 20:13 View deployment

no silly info transfer

1ae822c

vercel bot deployed to Preview May 7, 2025 21:53 View deployment

Weves approved these changes May 7, 2025

View reviewed changes

evan-onyx added this pull request to the merge queue May 7, 2025

Merged via the queue into main with commit 0eab6ab May 7, 2025
11 checks passed

evan-onyx deleted the perf/drive-checkpoint-speedup branch May 7, 2025 23:45

ferdinandl007 pushed a commit to ferdinandl007/onyx that referenced this pull request May 8, 2025

fix drive slowness (onyx-dot-app#4668)

874bcd2

* fix slowness * no more silent failing for users * nits * no silly info transfer

ferdinandl007 pushed a commit to ferdinandl007/onyx that referenced this pull request May 9, 2025

fix drive slowness (onyx-dot-app#4668)

5ec8916

* fix slowness * no more silent failing for users * nits * no silly info transfer

ZhipengHe pushed a commit to ZhipengHe/onyx that referenced this pull request Jun 6, 2025

fix drive slowness (onyx-dot-app#4668)

ee0e450

* fix slowness * no more silent failing for users * nits * no silly info transfer

AnkitTukatek pushed a commit to TukaTek/onyx that referenced this pull request Sep 23, 2025

fix drive slowness (onyx-dot-app#4668)

1104ab3

* fix slowness * no more silent failing for users * nits * no silly info transfer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix drive slowness #4668

fix drive slowness #4668

Uh oh!

evan-onyx commented May 7, 2025 •

edited

Loading

Uh oh!

vercel bot commented May 7, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

fix drive slowness #4668

fix drive slowness #4668

Uh oh!

Conversation

evan-onyx commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How Has This Been Tested?

Backporting (check the box to trigger backport action)

Uh oh!

vercel bot commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

Uh oh!

Uh oh!

Uh oh!

evan-onyx commented May 7, 2025 •

edited

Loading

vercel bot commented May 7, 2025 •

edited

Loading