Polyglot Voice

You are reading a Polyglot Voice guide — our platform for turning speech into text and translating it.

To transcribe or translate audio or video into text, sign in or create a free account (it only takes a minute). The studio supports 98 languages; you can change direction and models anytime after upload.

Speech-to-Text API

Use the API when transcription and translation should run inside your product, bot, internal tool or automation. Create tasks from files or links, track status, receive webhooks and fetch text results.

Upload audio/video or create tasks from supported URLs

Receive completion webhooks and fetch structured results

Connect transcription to translation, subtitles and automation

Try the workflow before paying

From recording to text, translation or voiceover in a few steps

Give visitors a clear path: upload a file or record speech, get text, then translate, dub or export the result. The free start is enough to understand the quality.

AI speechTranslationSubtitles

60 sec demo Try for free View pricing

Quick price estimate

Transcription minutes10 min

Voiceover characters10,000

Transcription from: 490 ₽

Voiceover approx.: 10 ₽

This is an estimate: final cost depends on quality mode, pack and tariff.

How it works

Upload audio/video, paste a link or record speech in the browser.
Choose language, quality and the target result: text, translation, subtitles or voiceover.
Get the result in the studio and add minutes, TTS characters or API access when needed.

Example result

Before

Before: a lecture recording, interview or video in another language.

After

After: structured text, translation, subtitles and a base for voiceover or clips.

Why users can trust it

Source files are removed from disk about 1 hour after processing.
Start for free and pay only for the minutes, attempts or TTS characters you need.
Use the web studio for one-off jobs or API access for automation.

What plans include

Transcription minutes/attempts
TTS characters for voiceover
Task history, export, API keys and webhooks

Honest limitations

Demo is limited to 60 seconds
Quality depends on noise and speech clarity
Long files and heavier models require a plan

How it works

Upload the source audio or video file that matches your workflow.
Choose the task, language or translation direction that fits your goal.
Review the result, export the output and continue with subtitles, clips or API automation.

Supported formats and inputs

REST APImultipart uploadURL taskswebhooks

Best for

voice message bots
call processing
media archives
internal automation

FAQ

How do I start using the speech-to-text API?

Create an API key in the dashboard, send an upload request and fetch the result when the task is completed.

Does the API support webhooks?

Yes. You can configure a completion webhook to receive events when transcription tasks finish.

Can I transcribe video through the API?

Yes. The API accepts supported audio/video files and URL-based tasks depending on your plan and limits.

Audio to Text Video to Text Subtitles FAQ

Speech-to-Text API

From recording to text, translation or voiceover in a few steps

Quick price estimate

How it works

Example result

Why users can trust it

What plans include

Honest limitations

How it works

Supported formats and inputs

Best for

FAQ

How do I start using the speech-to-text API?

Does the API support webhooks?

Can I transcribe video through the API?

Related guides

Related pages