We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
embedding的推理,根据日志好像是频繁加载再推理的,这个在embedding模型很大的时候,很浪费时间,所以希望优化一下,做到一次加载,多次推理。