Incident 1: Streaming ASR Partial Churn Spike
Users complain that live captions keep rewriting earlier words. Final WER is nearly unchanged, but support tickets tripled after a decoder update.
Hidden answer: triage and prevention
Mitigate by rolling back the decoder or raising partial-stability thresholds for affected routes. Slice by utterance duration, endpoint VAD state, beam size, language, device, and network jitter. Look for changed emission timing, punctuation instability, hotword biasing, or aggressive rescoring. Add partial churn, first stable token latency, and finalization delay to release gates; final WER alone is not enough for streaming UX.