fix: 阿里云百炼流式tts语音卡顿感优化#3008
Open
dbsd11 wants to merge 1 commit into
Open
Conversation
Collaborator
|
试了一下,生成后返回的语音确实比原版流畅一点点,但是如果让LLM执行返回的文字消息比较多的话,会让TTS执行时间非常长 |
Collaborator
|
请问一下用的什么模型和音色,我这边使用的cosyvoice-v2和longxiaochun_v2时响应是缺少original_text字段返回的 |
Author
不会的,llm流式输出的情况下,也是流式调用的阿里云百炼的语音合成。阿里云百炼那边会自动识别句子,只要满足是个句子(或者分句)就会触发句子合成事件,此时一次将一个句子(或者分句)的音频代码合并起来发给设备。这样设备上播放就会显得流畅了。这种对长文本的llm输出更加友好。比如您看到的我回复的这段话,其实拆分成句子后一个句子就一二十个字。 |
Author
用的阿里云百炼流式的cosyvoice-v3-flash, 音色是longanhuan。 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
之前的语音一个词一个词的播,明显流畅度不足。优化成一个句子一个句子的播放,要自然很多