The auto-select beam always returns an angle (even for noise), so VAD was always true. The processed_doa (index 0) is NaN when no speech is present and a real angle when speech is detected. Now: angle from auto-select beam, VAD from processed_doa being non-NaN. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
13 KiB
13 KiB