-
Notifications
You must be signed in to change notification settings - Fork 130
Support SpeechT5 text-to-speech pipeline by OpenVINO #1230
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support SpeechT5 text-to-speech pipeline by OpenVINO #1230
Conversation
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
@rkazants could you please provide tests? |
@rkazants please update also import structure for added model classes: |
…ech_speecht5_153160
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Updated |
…ech_speecht5_153160
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
…ech_speecht5_153160
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Done |
@eaidova, @IlyasMoutawwakil, @echarlaix, could you please review PR? Thanks, |
CI failures does not relate to my changes in PR. For example, I see issue with whisper model. Please correct me if I am wrong. |
…ech_speecht5_153160
@IlyasMoutawwakil could you please rerun ci, thanks |
@eaidova done, was this issue fixed ?
|
@IlyasMoutawwakil thanks, yes @rkazants is working on the fix |
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, great addition !
I think there's still some redundancy / room to make the implementation leaner, for example with Whisper which has a custom generate method, we only make sure our class is compliant with its behavior and use the method directly from transformers.
…ech_speecht5_153160
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Responded here #1230 (comment) |
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
left couple of minor comments, good to merge once resolved
Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>
Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>
…ech_speecht5_153160
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
What does this PR do?
This PR introduces support of SpeechT5 text-to-speech pipeline using OpenVINO. Here is a demo code:
Before submitting