Prompt 1: End-To-End Architecture
Give a advanced design answer for the online serving path, offline eval path, data feedback path, rollout process, and rollback plan.
Hidden answer: strong architecture outline
Prefer a cascaded VAD, streaming ASR, retrieval, LLM policy engine, tool router, and streaming TTS design unless the product requires direct speech-to-speech research. Separate interactive capacity from batch evaluation. Version ASR, retrieval index, prompt, tools, LLM, TTS voice, and client config. Gate launches by ASR slice WER, retrieval grounding, task success, unsafe action rate, first audio byte, end-to-end turn latency, interruption recovery, escalation precision, cost per resolved ticket, and rollback time.