- CODE_TOKEN_OFFSET is 10 in decoded text (not 128266 in token ID space) because tokenizer.decode() maps 128266 → <custom_token_10> - Fixed 'SNAC object has no attribute device' — use explicit SNAC_DEVICE - Added debug logging for pipeline visibility - Audio now generates correctly: 442KB for "Hello world" True streaming pipeline verified: text → TextIteratorStreamer → regex extraction → SNAC decode → PCM bytes. The bottleneck is Jetson inference speed (~12s for first 42 tokens on a 3B model), not the streaming infrastructure. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
24 KiB
24 KiB