Skip to content

PDF File Ingest Performance prepdocs.ps1 #2516

Open
@PatrickGallucci

Description

@PatrickGallucci

I have 25K 1-5 page PDF files (65kb avg size) that I am uploading and it is taking about 30 seconds per file. At this pace it will take 8 days to process. I have bumped up my search service to standard with 18 search units (3x6) and there was no change in duration. Is there something that I can do to increase the performance of prepdocs.ps1?

Request: Can this be multi-threaded to run in parallel against multiple "data" folders?

This issue is for a: (mark with an x)

  • bug report -> please search issues before submitting
  • feature request
  • documentation issue or request
  • regression (a behavior that used to work and stopped in a new release)

Minimal steps to reproduce
Copy 26K PDF files to .\data\ folder
execute prepdocs.ps1

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions