neural-audio-codec

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

text-to-speech transformers tts speech-synthesis speech-recognition speech-to-text whisper audio-processing mlx multimodal apple-silicon silero-vad neural-audio-codec

Updated Aug 26, 2025

pujariaditya / HiggsAudiov2TokenizerUnofficial

Star

Unofficial PyTorch implementation of Higgs Audio V2 Tokenizer with HuBERT semantic features. Complete training pipeline for semantic-acoustic audio tokenization with 960x downsampling and 8-layer RVQ.

pytorch audio-synthesis speech-processing audio-processing vector-quantization dac semantic-features hubert audio-generation neural-audio-codec rvq audio-tokenizer neural-codec higgs-audio speech-tokenization

Updated Oct 8, 2025
Python

Improve this page

Add a description, image, and links to the neural-audio-codec topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the neural-audio-codec topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

neural-audio-codec

Here are 7 public repositories matching this topic...

jiaqili3 / DualCodec

DillionLowry / NeuralCodecs

amphionspace / FlexiCodec

lucasnewman / descript-mlx

m-pana / spk_anon_nac_lm

Swap98-Coder / mlx-audio

pujariaditya / HiggsAudiov2TokenizerUnofficial

Improve this page

Add this topic to your repo