Remove hardcoded image extraction flag for PDFs #4970

emerzon · 2025-06-30T16:57:56Z

Description

PDFs currently always have their images extracted. This will make use of the "Enable Image Extraction and Analysis" workspace configuration instead.

vercel · 2025-06-30T16:58:00Z

@emerzon is attempting to deploy a commit to the Danswer Team on Vercel.

A member of the Team first needs to authorize it.

greptile-apps

PR Summary

Improves PDF processing configuration by removing hardcoded image extraction flag, now respecting the workspace's 'Enable Image Extraction and Analysis' setting.

Modified backend/onyx/file_processing/extract_file_text.py to use dynamic configuration instead of forcing extract_images=True for PDFs
Ensures consistent image extraction behavior across document types based on workspace settings
Provides better resource utilization by only extracting images when explicitly enabled

_{1 file reviewed, no comments}
_{Edit PR Review Bot Settings | Greptile}

Remove hardcoded image extraction flag for PDFs

3970888

PDFs currently always have their images extracted. This will make use of the "Enable Image Extraction and Analysis" workspace configuration instead.

emerzon requested a review from a team as a code owner June 30, 2025 16:57

greptile-apps bot reviewed Jun 30, 2025

View reviewed changes

Weves approved these changes Jul 1, 2025

View reviewed changes

Weves merged commit 8272482 into onyx-dot-app:main Jul 1, 2025
4 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove hardcoded image extraction flag for PDFs #4970

Remove hardcoded image extraction flag for PDFs #4970

Uh oh!

emerzon commented Jun 30, 2025

Uh oh!

vercel bot commented Jun 30, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Remove hardcoded image extraction flag for PDFs #4970

Remove hardcoded image extraction flag for PDFs #4970

Uh oh!

Conversation

emerzon commented Jun 30, 2025

Description

Uh oh!

vercel bot commented Jun 30, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

Uh oh!

Uh oh!

Uh oh!