cartesia

AI Voice Generators

Cartesia

Low-latency Sonic TTS, Ink transcription, voice cloning, and Line agents for real-time voice AI.

Cartesia: Low-latency Sonic TTS, Ink transcription, voice cloning, and Line agents for real-time voice AI. Pricing: From $5/mo + usage. Best for Real-time voice agents and conversational audio products and Low-latency TTS APIs for interactive apps.

Tool score: 8.3 / 10
Self-serve developer plansFrom $5/moLine Voice AgentsUsage-based from $0.06Team and organization plansFrom $49/mo
Free plan available

Tool scorecard

How the tool scores

A standardized tool profile score across usability, pricing value, feature depth, and support quality.

Tool score

8.3

/ 10

Ease of use

8.0

Onboarding, navigation, and day-to-day workflow friction.

Value for money

8.2

How much practical capability the product delivers for the price.

Features

8.8

Breadth, polish, and depth across the core product surface.

Support

8.2

Docs, help channels, and how easily issues get resolved.

Overview

Bottom line

Cartesia is a developer-first voice AI platform for building low-latency speech products around Sonic text-to-speech, Ink speech-to-text, voice cloning, localization, and Line voice agents. Its strongest fit is teams building real-time voice agents, interactive apps, dubbing workflows, narration systems, or product audio where first-byte latency, streaming, concurrency, and API control matter more than a finished creator studio.

The core caveat is that Cartesia is a usage and infrastructure decision as much as a voice-quality decision. Pricing combines monthly credit pools, prepaid agent dollars, metered agent minutes, concurrency limits, overages, and enterprise deployment routes, so teams need to model real scripts, call duration, STT volume, cloned-voice usage, and phone-number costs before scaling. Start with the free or Pro self-serve route for evaluation, move to Startup or Scale when commercial usage, organizations, professional voice cloning, and higher concurrency matter, and use Enterprise when custom deployment, compliance, or volume terms are required.

Pricing

Pricing snapshot

Pricing routes

Use these route cards to separate app subscriptions, API meters, team workspaces, and enterprise purchasing without flattening them into a single plan table. Open the pricing page for detailed plan or API meter facts.

Direct APIPrimary

Self-serve developer plans

From $5/mo

Monthly plans provide credits for Sonic TTS and Ink STT plus prepaid dollars for Line voice agents, making this the default route for developers evaluating and scaling API usage.

Direct API

Line Voice Agents

Usage-based from $0.06

Line agent usage is billed by agent minutes, with a separate per-minute telephony charge when using a Cartesia-provided phone number.

Team workspace

Team and organization plans

From $49/mo

Startup and Scale add organization features, professional voice cloning, larger credit pools, higher concurrency, and priority support for production teams.

Enterprise sales

Enterprise deployments

Custom

Enterprise is the route for custom credits, custom concurrency, SSO, compliance paperwork, shared support channels, on-premise, VPC, OEM, or regulated deployments.

Free

$0/mo + usage

Usage: 20K credits/mo; ~27 TTS min; ~1h51m STT; 1 agent slot; $1 prepaid agents

  • Text to Speech and Speech to Text
  • Instant voice cloning

Pro

$5/mo + usage

Usage: 100K credits/mo; ~133 TTS min; ~9h16m STT; 3 TTS concurrency; $5 prepaid agents

Most popular
  • Commercial use license
  • Instant voice cloning

Startup

$49/mo + usage

Usage: 1.25M credits/mo; ~1,667 TTS min; ~115h42m STT; 5 TTS concurrency; $49 prepaid agents

  • Professional voice cloning
  • Organizations

Scale

$299/mo + usage

Usage: 8M credits/mo; ~10,667 TTS min; ~740h44m STT; 15 TTS concurrency; $299 prepaid agents

  • Priority support
  • High concurrency limits

Line voice agent usage

Usage-based API

Usage: $0.06 per minute for Line calls; $0.014 per minute with a Cartesia phone number; prepaid agent dollars vary by plan

  • Metered voice agent minutes
  • Separate telephony charge for Cartesia phone numbers

Enterprise

Custom quote or bundled pricing

Usage: Custom credits, agent usage, concurrency, deployment, and compliance terms

  • Custom concurrency limits
  • DPAs and BAAs for compliance
  • SSO and shared Slack channel
  • On-premise, VPC, or OEM licensing by agreement

Capabilities

Input types

text, audio

Output types

audio, text

Supported platforms

Web

Delivery modes

Web app, API, SDK, Developer console

Company context

Company

Cartesia

Founded

Not specified

Headquarters

Not specified

Category

AI Voice Generators

FAQ

Common questions

Does Cartesia support real-time voice agents?

Yes. Cartesia positions Sonic TTS, Ink STT, and Line voice agents as a real-time voice stack, with pricing that separates model credits from metered agent minutes and telephony costs.

Can Cartesia clone voices?

Cartesia supports instant voice cloning on its self-serve plans and professional voice cloning on higher plans, with official terms requiring rights and consent for cloned voices.

Is Cartesia only an API?

No. Cartesia is developer-led and API-first, but it also offers a web workspace and developer console for account, voice, and agent workflows.

Does Cartesia allow commercial use?

Cartesia lists commercial use as included on paid self-serve plans. Enterprise use cases that need custom deployment, compliance, or high volume should verify terms with sales.

Recent updates

Changelog snapshot

feature

May 4, 2026sonic-3.5-2026-05-04

Cartesia documented a stable Sonic 3.5 TTS snapshot for production use alongside its broader Sonic 3.5 and Ink 2 real-time voice stack launch.

Decision rail

Keep the key facts, primary action, and section jumps visible while you evaluate the profile.

Visit Official Site

Last verified June 24, 2026

Pricing
From $5/mo + usage
Platforms
Web
Access
Web app, API, SDK, Developer console
Best for
Real-time voice agents and conversational audio products and Low-latency TTS APIs for interactive apps and Consent-based voice cloning and localized speech and Developer-owned voice stacks that combine TTS, STT, and agent minutes
Category
AI Voice Generators

On this page

Share

Pass this page along

Copy the link or send it to the channel where your team compares tools, pricing, and tradeoffs.

Navigation

Tool cluster

Reference links