Portuguese
TTS Voices

Portuguese text-to-speech voices with natural vowel reduction

TelnyxInWorldMiniMaxRimeAzureAWS

Top 7 TTS for Portuguese

Name	Provider
sol	telnyx
Smart Young Girl	minimax
baltasar	telnyx
celso	rime
Antonio	azure
Camila	aws
Maitê	inworld

Test Portuguese voices

[ VOICE AI PLATFORM ]

From text to talk.
Pick your path.

Call our TTS & STT endpoints directly, wire voice into LiveKit rooms with one plug-in, or spin up an AI assistant on a real phone number.

TTS & STT Endpoints

Production-grade streaming and batch TTS/STT. Low latency, 50+ languages, customizable voices, and SDKs for Node/Python/Browser.

›Streaming for live apps
›Multi-speaker diarization & punctuation
›SDKs, code samples, and latency benchmarks

TTS — CURL
$ curl -X POST \
".../v1/tts" \
-H "Authorization: Bearer $API_KEY" \
-d '{
"voice": "alloy_female_v1",
"language": "en-US",
"format": "mp3",
"text": "Hello, welcome..."
} ' --output speech.mp3

Sends text to the TTS endpoint and saves the synthesized audio as an MP3 file.

View TTS docs →

LiveKit Plug-in

Plug our real-time speech pipeline into LiveKit rooms — transcribe live sessions, synthesize responses and stream audio back into the room.

›One-line install, example room demo
›WebRTC + server bridge patterns
›Works in browser & mobile

LIVEKIT — NODE.JS
import { Room } from "livekit-client";
import { TelnyxSpeechPlugin }
from "@telnyx/livekit-plugin";
const room = new Room();
await room.connect(URL, token);
const plugin = new TelnyxSpeechPlugin({
apiKey: process.env.TELNYX_API_KEY,
voice: "alloy_female_v1",
});
plugin.attach(room);

Connects to a LiveKit room and attaches real-time TTS/STT — transcribes audio in, synthesizes audio out.

Try LiveKit demo →

AI-Assistants (Phone)

Deploy a phone-number based AI assistant in minutes — inbound/outbound calls, IVR, call recording, and DTMF support.

›Purchase & map a phone number
›Templates: Support Bot, Sales Assistant, Reminder Bot
›PSTN reliability & compliance tools

AI-ASSISTANT — CURL
$ curl -X POST \
".../v1/assistants" \
-H "Authorization: Bearer $API_KEY" \
-d '{
"name": "Support Bot",
"phone_number": "+18005551234",
"voice": "alloy_female_v1",
"system_prompt": "You are a
helpful support agent.",
"capabilities": ["inbound",
"recording", "dtmf"]
} '

Creates an AI assistant bound to a phone number with inbound call handling, recording, and DTMF support.

Create your assistant →

Spanish voices

294TTS voices

Español

Browse →

French voices

98TTS voices

Français

Browse →

German voices

82TTS voices

Deutsch

Browse →

Indonesian voices

31TTS voices

Bahasa Indonesia

Browse →

Italian voices

51TTS voices

Italiano

Browse →

Japanese voices

85TTS voices

日本語

Browse →

Korean voices

171TTS voices

한국어

Browse →

Portuguese voices

277TTS voices

Português

Browse →

Russian voices

34TTS voices

Русский

Browse →

Chinese voices

189TTS voices

中文

Browse →

Portuguese phonology and prosody

Vowels that vanish between dialects

Portuguese runs two vowel reduction systems under one language. European Portuguese compresses unstressed vowels aggressively: unstressed /e/ often reduces to [ɨ] or disappears entirely in fast speech, giving Lisbon Portuguese its "mumbled" reputation. Brazilian Portuguese keeps unstressed vowels far more intact, producing clearer, open syllables. English reduces unstressed vowels to schwa but never deletes them the way European Portuguese does. A TTS system that handles one dialect correctly sounds wrong in the other. Producing both demands inference that applies the right reduction rules per variant, running where audio is processed: not split across providers.

Open, closed, and the mid-vowel split

Portuguese distinguishes open and closed mid vowels: /ɛ/ vs. /e/, /ɔ/ vs. /o/: a contrast English does not make. The word "avô" (grandfather) carries a closed /o/, while "avó" (grandmother) carries an open /ɔ/; the written accent is the only visible difference, and vowel quality carries the entire meaning. These contrasts hold in stressed syllables but collapse in unstressed positions. Flattening this four-way mid-vowel space into English-style contrasts produces speech that sounds foreign immediately. Accurate Portuguese requires models trained on this stress-conditioned vowel inventory, co-located with telephony so the spectral detail survives without inter-provider degradation.

Two rhythms in one language

European Portuguese patterns as stress-timed: stressed syllables land at regular intervals while unstressed material compresses between them, with heavier reduction than English. Brazilian Portuguese shifts toward syllable-timing, distributing duration more evenly, producing the flowing quality English speakers often call melodic. Brazilian varieties also use wider pitch movements and characteristic final rise-fall patterns. Imposing one rhythmic model on both dialects breaks naturalness. Getting this right requires synthesis co-located with telephony: no inter-provider hops adding latency or flattening the prosodic signal.

Portuguese
TTS Voices

Female Portuguese TTS Voices

Male Portuguese TTS Voices

Brazil Portuguese TTS Voices

Portugal Portuguese TTS Voices

Spanish voices

French voices

German voices

Indonesian voices

Italian voices

Japanese voices

Korean voices

Portuguese voices

Russian voices

Chinese voices

Portuguese phonology and prosody

Vowels that vanish between dialects

Open, closed, and the mid-vowel split

Two rhythms in one language