[Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec
-
Updated
Oct 23, 2025 - Jupyter Notebook
[Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec
Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia
FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
Implementation of the Descript Audio Codec in MLX
Author's code of "Speaker anonymization using neural audio codec language models" (ICASSP 2024).
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Unofficial PyTorch implementation of Higgs Audio V2 Tokenizer with HuBERT semantic features. Complete training pipeline for semantic-acoustic audio tokenization with 960x downsampling and 8-layer RVQ.
Add a description, image, and links to the neural-audio-codec topic page so that developers can more easily learn about it.
To associate your repository with the neural-audio-codec topic, visit your repo's landing page and select "manage topics."