Skip to content

wavekat/wavekat-lab

Repository files navigation

WaveKat Lab

CI Release Please DeepWiki

A research repo for the WaveKat project — interactive tools and Jupyter notebooks for working with audio models (VAD, turn detection, voice datasets, and more).

Warning

Early development. Things may change.

What's In Here

wavekat-lab/
├── tools/
│   ├── audio-lab/     Real-time VAD + Turn Detection comparison app (Rust + React)
│   └── cv-explorer/   Mozilla Common Voice dataset browser (Cloudflare Workers + React)
├── notebooks/         Jupyter notebooks (training, validation, dataset splits)
└── docs/              Plans and design docs

Each tool is self-contained — its own Makefile, lockfiles, and build setup live inside its folder.

Tools

Audio Labtools/audio-lab/

Web app for testing and comparing WaveKat library backends side by side in real time. Live mic capture, WAV upload, multi-config fan-out, VAD-gated pipeline mode, waveform + spectrogram + probability timelines.

Backends: webrtc-vad, silero-vad, ten-vad, firered-vad, pipecat smart-turn. Details →

Common Voice Explorertools/cv-explorer/

Web app for browsing and playing audio clips from the Mozilla Common Voice dataset. Filter by locale, split, demographics, and search sentences — with waveform playback powered by WaveSurfer.js. Built on Cloudflare Workers + D1 + R2. Details →

Live: https://commonvoice-explorer.wavekat.com/

Notebooks

notebooks/ is the home for Jupyter notebooks covering training, validation, and dataset-splitting workflows. Python env is managed by uv.

make setup-notebooks   # one-time: uv sync the notebook env
make lab               # start Jupyter Lab on notebooks/

Repo Layout Conventions

  • Per-tool Makefilestools/<name>/Makefile owns dev/build/CI for that tool. Run cd tools/<name> && make help to see what's there.
  • Root Makefile — repo-wide only: setup, lab, ci, and per-tool CI delegators.
  • No shared Cargo workspace at root — each Rust tool keeps its own Cargo.toml / Cargo.lock / target/ inside its folder.

Videos

Video Description
Common Voice Explorer Demo Exploring Mozilla Common Voice with Common Voice Explorer
Introducing Common Voice Explorer — browse and listen to 1.8M+ real voice clips from the Mozilla Common Voice dataset.
Pipecat Smart Turn Visual Test Testing Pipecat Smart Turn with WaveKat Lab
Visual test of Pipecat Smart Turn v3 — live recording and VAD-gated pipeline mode simulating production workflows.
FireRed VAD Showdown Adding FireRedVAD as the 4th backend
Benchmarking Xiaohongshu's FireRedVAD against Silero, TEN VAD, and WebRTC across accuracy and latency.
VAD Lab Demo VAD Lab: Real-time multi-backend comparison
Live demo of VAD Lab comparing WebRTC, Silero, and TEN VAD side by side with real-time waveform visualization.

License

Licensed under Apache 2.0.

Copyright 2026 WaveKat.

About

Developer experimentation tools for the WaveKat libraries. Includes vad-lab, a web-based tool for testing and comparing VAD backends side by side.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors