Any suggestions of how to use with Speech LLMs #5

mush42 · 2024-11-02T16:05:18Z

Hello

Thanks for making the code available.

Any recommendations regarding using this codec with audio LLMs?

Best
Musharraf

yzGuu830 · 2024-11-28T22:54:17Z

Hi @mush42, similar to mainstream codecs like SoundStream and EnCodec, ESC can also function as an acoustic tokenizer, generating codes from multiple streams. I suggest drawing inspiration from recent audio generative models (AudioLM, VALL-E, MusicGen) to effectively model the multi-level discrete codes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any suggestions of how to use with Speech LLMs #5

Any suggestions of how to use with Speech LLMs #5

mush42 commented Nov 2, 2024

yzGuu830 commented Nov 28, 2024

Any suggestions of how to use with Speech LLMs #5

Any suggestions of how to use with Speech LLMs #5

Comments

mush42 commented Nov 2, 2024

yzGuu830 commented Nov 28, 2024