🗣️ Voice Activity Detection with WebRTC & Silero

A cross-model Voice Activity Detection (VAD) tool with real-time and file-based analysis using both WebRTC VAD and Silero VAD models, built in PyQt6. Designed for visualization, evaluation, and comparison.

🚀 Features

🎙️ Live microphone-based VAD
📂 File-based VAD analysis
📊 Comparison mode: WebRTC vs Silero metrics side by side
📈 Generates:
- Spectrograms
- Waveforms
- Confusion matrices
- Metric charts (Accuracy, Precision, Recall, F1-score, etc.)
🧠 Silero model with intelligent frame-by-frame classification
🌐 WebRTC model integrated for fast binary VAD

🧰 Tech Stack

Python 3.9+
PyQt6
matplotlib, numpy
simpleaudio
wave
WebRTC VAD wrapper
Silero VAD

💻 Installation

Clone the repo

git clone https://github.com/edyamza/Voice-Activity-Detection-WebRTC-Silero.git
cd Voice-Activity-Detection-WebRTC-Silero

Install dependencies

pip install -r requirements.txt

Run the app

python vad_guide.py

🖼️ GUI Preview

🎛️ Main Interface

📊 Confusion Matrices – WebRTC vs Silero

📈 Metric Comparison Bar Chart

🔊 Spectrogram + Waveform View

📁 Project Structure

├── vad_guide.py            # Main GUI application
├── vad_rec.py              # WebRTC file-based VAD
├── vad_rec_silero.py       # Silero VAD interface
├── vad_live.py             # Live audio processing
├── evaluation.py           # Metrics & graph generation
├── output/                 # Saved plots and images
└── requirements.txt

📄 License

This project is licensed under the MIT License.

✨ Author

Eduard Amza — GitHub

🧠 Inspired by

Feel free to ⭐ the project or contribute!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
AUDIO FILES		AUDIO FILES
generareetichete		generareetichete
output		output
screenshots		screenshots
silero-0.4.1		silero-0.4.1
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
etichete_audio.json		etichete_audio.json
evaluation.py		evaluation.py
images.png		images.png
output.txt		output.txt
requirements.txt		requirements.txt
vad_guide.py		vad_guide.py
vad_live.py		vad_live.py
vad_rec.py		vad_rec.py
vad_rec_silero.py		vad_rec_silero.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🗣️ Voice Activity Detection with WebRTC & Silero

🚀 Features

🧰 Tech Stack

💻 Installation

🖼️ GUI Preview

🎛️ Main Interface

📊 Confusion Matrices – WebRTC vs Silero

📈 Metric Comparison Bar Chart

🔊 Spectrogram + Waveform View

📁 Project Structure

📄 License

✨ Author

🧠 Inspired by

About

Uh oh!

Releases

Packages

Uh oh!

Languages

edyamza/Voice-Activity-Detection-WebRTC-Silero

Folders and files

Latest commit

History

Repository files navigation

🗣️ Voice Activity Detection with WebRTC & Silero

🚀 Features

🧰 Tech Stack

💻 Installation

🖼️ GUI Preview

🎛️ Main Interface

📊 Confusion Matrices – WebRTC vs Silero

📈 Metric Comparison Bar Chart

🔊 Spectrogram + Waveform View

📁 Project Structure

📄 License

✨ Author

🧠 Inspired by

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages