Skip to content

embedding的推理问题 #3

Open
@dywlegend1002

Description

@dywlegend1002

embedding的推理,根据日志好像是频繁加载再推理的,这个在embedding模型很大的时候,很浪费时间,所以希望优化一下,做到一次加载,多次推理。

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions