ElevenLabs raises $500M to scale conversational AI for telcos
A fast-growing pioneer in human-like voice and audio AI has secured major new funding and is leaning into telecom use cases that are moving from pilots to production.
Funding round and $11B valuation
ElevenLabs raised $500 million in Series D funding at an $11 billion valuation, led by Sequoia Capital with continued participation from Andreessen Horowitz and ICONIQ, and new backing from Lightspeed, Evantic Capital and BOND alongside existing investors.
The raise more than triples the company’s valuation versus a year ago and pushes total funding to more than $750 million since its 2022 founding, signaling investor conviction that voice-native interfaces will be a core layer of enterprise AI stacks.
Platform scope and roadmap
Originally known for ultra-realistic text-to-speech, ElevenLabs now spans speech-to-text, multilingual dubbing, sound effects, music and conversational agents, wrapping models with orchestration tools and enterprise-grade runtime infrastructure.
The roadmap emphasizes two vectors: ElevenAgents for real-time agents that can speak, type and execute tasks grounded in enterprise data, and ElevenCreative for fusing audio with video and workflow automation, with go-to-market expansion across North America, Europe, Asia-Pacific and Latin America.
ARR milestones and enterprise adoption
The company says it has surpassed key ARR milestones and reported strong enterprise adoption across sectors through 2025, with telecom emerging as a priority vertical as operators seek to modernize legacy IVR and contact center stacks.
A planned path to IPO remains on the horizon, but the near-term focus is scaling production deployments and geographic coverage with locally embedded teams.
Why conversational AI matters for telcos now
Operators face rising service expectations, cost pressure and complex product catalogs, and voice-native AI is maturing to meet those constraints in real time.
From IVR menus to goal-driven conversations
Conversational agents can replace keypad IVRs with natural dialogue that recognizes intent, confirms identity, retrieves context and executes actions across channels, improving containment while easing live-agent load.
OPEX reduction and NPS impact
Early adopters target faster resolution, lower average handle time, higher first-contact resolution and better CSAT, while reducing cost-to-serve and smoothing call arrival variability during outages or promotions.
Multilingual reach and personalization at scale
Multilingual speech, accent robustness and dynamic voice selection enable localized experiences across markets, while grounding in operator data allows personalized offers, proactive care and targeted retention flows.
Telco deployment patterns for ElevenLabs
Recent operator projects illustrate how voice agents integrate with telecom back-ends and contact center tooling at scale.
Scaling customer care with voice agents
Europe’s largest operators are piloting and deploying ElevenLabs-powered voice agents in call centers to handle high-volume intents such as billing, plan changes, SIM activation, device troubleshooting and appointment scheduling, shifting traffic away from legacy IVR trees.
Ecosystem integrations and APIs
Agents connect via SIP or WebRTC into CCaaS platforms and call flows, integrate with CRM, order management, billing and ticketing systems, and can leverage TM Forum Open APIs or operator-specific middleware for orchestration.
Infrastructure alignment with Nvidia GPUs
Backing from Nvidia aligns ElevenLabs with high-performance AI infrastructure and GPU ecosystems, relevant for operators exploring on-prem, edge or telco cloud deployments where latency, concurrency and data residency drive architecture choices.
Operator implementation playbook for conversational AI
Success depends on disciplined scoping, tight integration and rigorous governance, not just model quality.
Start with high-volume, well-defined intents
Prioritize intents with clear workflows and high containment potential, such as payment promises, plan upgrades, move/add/change, porting status and outage triage, and build opt-in fallbacks to live agents.
Engineer for real-time performance and reliability
Target sub-300 ms end-to-end latency with local or edge inference where needed; plan for autoscaling, burst handling, packet loss resilience, retriable actions and telco-grade SLAs across speech, NLU, orchestration and back-end calls.
Data governance, security and compliance
Ground agents on governed knowledge sources and transaction systems with strict role-based access; enforce PII redaction, call recording controls, PCI for payments, GDPR/CCPA compliance and auditable decision logs.
Voice safety and brand protection controls
Mitigate cloning abuse with consent workflows, watermarking or provenance features, restricted voice libraries, liveness checks in authentication and policies aligned to carrier fraud controls and STIR/SHAKEN practices.
Integration, monitoring and operations
Build connectors to CRM/BSS, IAM, device management and field service; use experiment frameworks for prompt, policy and model updates; monitor intent recognition accuracy, interruption handling, barge-in, silence time, and downstream error rates.
What to watch next in telco AI
Key milestones over the next 12 months will signal how fast conversational AI becomes mainstream in telecom.
Benchmarks, SLAs and total cost to serve
Expect sharper benchmarks on word error rate under noise, dialog success rates, latency stability at peak load, and cost-per-contained-interaction versus human and traditional IVR baselines.
Productization and M&A in voice AI
Look for deeper packaged integrations with contact center providers, OSS/BSS vendors and RAN/NOC tooling, along with potential consolidation across voice AI, agent orchestration and observability.
Regulatory and standards evolution (TM Forum, GSMA)
Watch developments around AI safety guidance, synthetic media labeling, biometric consent, and alignment with TM Forum and GSMA initiatives that standardize APIs, policy controls and telco cloud deployment patterns.
The bottom line: ElevenLabs’ funding and momentum validate a shift toward voice-first interfaces, and operators that move now—on the right intents, with the right guardrails—can unlock lower cost-to-serve and better customer experience at scale.









