Amazon Polly
AWSAI & MLFree tier availableNeural text-to-speech service that converts text into lifelike speech in 40+ languages with dozens of voices, including expressive long-form, generative, and newscaster speaking styles plus SSML markup and phoneme control
Attributes
- SLA Uptime
- 99.9%
- Streaming
- Yes
Sub-services (4)
Standard Voices
Concatenative voices optimised for low cost and wide language coverage
Neural Voices
Deep-learning voices with natural intonation and prosody
Generative Voices
Emotionally expressive voices built on a billion-parameter generative model
Long-Form Voices
Audiobook- and podcast-grade voices tuned for sustained narration
Compliance & Certifications
This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.
Where this runs
Sovereign regions (5)
- AWS GovCloud (US-East) · AshburnAWS GovCloud (US)
- AWS GovCloud (US-West) · HillsboroAWS GovCloud (US)
- AWS European Sovereign Cloud (Brandenburg) · BrandenburgAWS European Sovereign Cloud
- China (Beijing) · BeijingAWS China (Sinnet)
- China (Ningxia) · YinchuanAWS China (NWCD)
Commercial regions (33)
Europe (8)
- Europe (Paris)
- Europe (Frankfurt)
- Europe (Ireland)
- Europe (Milan)
- Europe (Spain)
- Europe (Stockholm)
- Europe (Zurich)
- Europe (London)
North America (7)
- Canada West (Calgary)
- Canada (Central)
- Mexico (Central)
- US East (N. Virginia)
- US West (Oregon)
- US East (Ohio)
- US West (N. California)
South America (1)
- South America (São Paulo)
Asia (11)
- Asia Pacific (Hong Kong)
- Asia Pacific (Hyderabad)
- Asia Pacific (Mumbai)
- Asia Pacific (Jakarta)
- Asia Pacific (Osaka)
- Asia Pacific (Tokyo)
- Asia Pacific (Malaysia)
- Asia Pacific (Singapore)
- Asia Pacific (Seoul)
- Asia Pacific (Taipei)
- Asia Pacific (Thailand)
Oceania (2)
- Asia Pacific (Melbourne)
- Asia Pacific (Sydney)
Middle East (3)
- Middle East (Bahrain)
- Israel (Tel Aviv)
- Middle East (UAE)
Africa (1)
- Africa (Cape Town)
Tags
Equivalent services on other platforms
Pre-built AI APIs for vision, speech, language, and decision
Neural text-to-speech with 380+ voices in 50+ languages, including premium Journey voices, Studio voices for long-form narration, and Custom Voice for cloning an organisation's brand voice, plus full SSML and phoneme control