Overview
The client operates a telehealth platform serving patients across multiple geographies. Consultations are handled by either a human clinical agent or an AI-assisted workflow depending on triage — requiring a video infrastructure that could support both paths seamlessly.
Challenge
The platform needed real-time video capabilities that could:
- Scale to thousands of simultaneous sessions during peak hours
- Support AI agent integration without disrupting patient-facing audio/video quality
- Provide session recording for compliance and audit purposes
- Operate reliably across variable network conditions
Solution
KubeAce deployed a production LiveKit SFU cluster on Kubernetes (GKE), with:
- LiveKit Agents for server-side AI pipeline integration — handling transcription, triage routing, and AI voice responses without client SDK changes
- Multi-region deployment with latency-based routing for sub-100ms session establishment
- Egress pipelines for composite recording to GCS, triggered per session type
- WebRTC observability — packet loss, jitter, and MOS score dashboards in Grafana with alerting thresholds before quality degradation becomes user-visible
Results
The platform handled peak load from day one, with no unplanned downtime in the first 90 days post-launch. AI-assisted consultations run on the same infrastructure as human-agent sessions with no architectural distinction from the client SDK perspective.