Polyglot Voice
You are reading a Polyglot Voice guide — our platform for turning speech into text and translating it.
To transcribe or translate audio or video into text, sign in or create a free account (it only takes a minute). The studio supports 98 languages; you can change direction and models anytime after upload.
Speech-to-Text API
Use the API when transcription and translation should run inside your product, bot, internal tool or automation. Create tasks from files or links, track status, receive webhooks and fetch text results.
Upload audio/video or create tasks from supported URLs
Receive completion webhooks and fetch structured results
Connect transcription to translation, subtitles and automation
Try the workflow before paying
From recording to text, translation or voiceover in a few steps
Give visitors a clear path: upload a file or record speech, get text, then translate, dub or export the result. The free start is enough to understand the quality.
Quick price estimate
Transcription from: 490 ₽
Voiceover approx.: 10 ₽
This is an estimate: final cost depends on quality mode, pack and tariff.
How it works
- Upload audio/video, paste a link or record speech in the browser.
- Choose language, quality and the target result: text, translation, subtitles or voiceover.
- Get the result in the studio and add minutes, TTS characters or API access when needed.
Example result
Before
Before: a lecture recording, interview or video in another language.
After
After: structured text, translation, subtitles and a base for voiceover or clips.
Why users can trust it
- Source files are removed from disk about 1 hour after processing.
- Start for free and pay only for the minutes, attempts or TTS characters you need.
- Use the web studio for one-off jobs or API access for automation.
What plans include
- Transcription minutes/attempts
- TTS characters for voiceover
- Task history, export, API keys and webhooks
Honest limitations
- Demo is limited to 60 seconds
- Quality depends on noise and speech clarity
- Long files and heavier models require a plan
How it works
- Upload the source audio or video file that matches your workflow.
- Choose the task, language or translation direction that fits your goal.
- Review the result, export the output and continue with subtitles, clips or API automation.
Supported formats and inputs
Best for
- voice message bots
- call processing
- media archives
- internal automation
FAQ
How do I start using the speech-to-text API?
Create an API key in the dashboard, send an upload request and fetch the result when the task is completed.
Does the API support webhooks?
Yes. You can configure a completion webhook to receive events when transcription tasks finish.
Can I transcribe video through the API?
Yes. The API accepts supported audio/video files and URL-based tasks depending on your plan and limits.