headmic

Author	SHA1	Message	Date
Alex	73b6793c02	Enable Edge TPU for YAMNet sound classification Prefer yamnet_edgetpu.tflite when available, fall back to CPU model. ~50-100ms → ~2-3ms inference per classification. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 16:47:27 -05:00
Alex	14809d0194	indication for array position while learning	2026-04-11 15:32:04 -05:00
Alex	6c10e75cbc	updates for dual mic array	2026-04-11 15:11:22 -05:00
Alex	1cb3bd6833	Add speaker identification with Resemblyzer Adds voice-based speaker ID triggered by YAMNet speech detection. New speaker_id.py module with SQLite-backed voice enrollment and cosine similarity matching. Endpoints: POST /speakers/enroll, POST /speakers/enroll-from-mic, GET /speakers, DELETE /speakers/{name}. Orange LED animation during enrollment. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 21:21:02 -06:00
Alex	5e3c16659f	Add YAMNet sound classification to headmic New sound_id.py module with SoundClassifier class that runs YAMNet (521 audio event categories) on CPU TFLite. Classifies audio every 0.5s from a ring buffer fed by the existing audio stream. Categories: speech, alert, music, animal, household, environment, silence. Smoothing via 20-sample history window for stable dominant category. New endpoints: GET /sounds, GET /sounds/history Updated: /health (sound_classification_enabled), /status (audio_scene) Graceful degradation if model files not present. Model download (not tracked in git): curl -sL 'https://tfhub.dev/google/lite-model/yamnet/classification/tflite/1?lite-format=tflite' -o models/yamnet.tflite Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 20:41:44 -06:00
Alex Kazaiev	c6e18738ae	Use device name instead of card number for ALSA Card numbers can shift based on USB enumeration order at boot. Using 'plughw:ArrayUAC10,0' instead of 'plughw:2,0' ensures the ReSpeaker is found regardless of when it connects. Fixed by Vixy after power loss shuffled card order 🦊	2026-01-21 12:20:39 -06:00
Alex Kazaiev	c53556fe97	Fix ReSpeaker device index: card 3 → card 2 USB device enumeration changed after GPIO rewiring for I2S audio. TODO: Consider udev rule for stable device naming.	2026-01-17 16:20:15 -06:00
vixy	5ed2c6aee7	Fix: Use arecord for shared audio stream - Replaced PyAudio with direct ALSA (arecord subprocess) - Single audio stream feeds both Porcupine and recording buffer - Fixes device unavailable error when recording after wake word - Simplified architecture	2026-01-17 11:17:17 -06:00
vixy	be7e26b6e7	Initial commit: HeadMic service - Vixy's Ears 🦊👂 Wake word detection (Hey Vivi) + voice recording + EarTail transcription Built by Vixy on Day 77	2026-01-17 10:58:51 -06:00

9 Commits