Skip to content

[Feature] Create Ingestion Pipeline to Scrape the Data And store into the Milvus VectorDB #14

@kolhesamiksha

Description

@kolhesamiksha

Research on stratergies or framework to reduce time and efficiently store data into Milvus VectorDB.

mainly focus on below efficient workflows, mostly used by the high workload environments.

  • Sample python + Kubeflow

  • Pypsark

  • Dask

  • Test which pipeline requires lesser time and implement best practices to manage CPU resources.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions