AI platform for speech workflows

Turn audio and video into text with AI — 98+ languages

Online transcription service: speech to text, audio to text and video to text with subtitles and export. Try the free demo without registration.

  • 98 languages
  • API
  • Subtitles
  • Voiceover
No registration60 sec demoSecure
New task
restaurant-dialogue.mp3586 KB·0:30Russian

00:00:00Good evening and welcome to our restaurant.

00:00:03Good evening. We have a dinner reservation for two people today at 7 pm.

00:00:08Yes, sure. Can I have your name please?

00:00:12The reservation is under the name of Brian.

00:00:15Okay, please follow me.

00:00:18Yes, of course.

00:00:20Here's your table and the menu.

00:00:22Your waiter will be with you in a moment.

00:00:25Thank you very much.

00:00:27You're very welcome.

00:00:28Enjoy your evening.

00:00:00 / 00:00:30
English → RussianModel: Fast

00:00:00Добрый вечер, добро пожаловать в наш ресторан.

00:00:03Добрый вечер. У нас забронирован стол на двоих сегодня в 19:00.

00:00:08Да, конечно. Могу я узнать ваше имя?

00:00:12Бронь на имя Брайана.

00:00:15Хорошо, пройдёмте за мной.

00:00:18Да, конечно.

00:00:20Вот ваш стол и меню.

00:00:22Официант подойдёт к вам через минуту.

00:00:25Большое спасибо.

00:00:27Пожалуйста.

00:00:28Приятного вечера.

Trusted by creators and teams

250 000+

minutes processed

20 000+

files uploaded

98+

supported languages

99.9%

service uptime

24/7

support

60 sec demo

Try Polyglot Voice without registration

Record or upload audio/video, or paste a link (including YouTube / RuTube). For the demo we process the start of the clip within limits: long files are automatically trimmed to ~60 s and demo size; you get transcript, translation, and a voice sample.

The server deletes the upload and temporary processing files after building the response. A browser recording stays only on your device; after a successful demo we clear the selected file in the form. Limits are on the right.

Full studio without demo limits — sign up or log in (about a minute).

Which microphone or headset is used is chosen in your OS and in the browser’s site settings (lock icon in the address bar). There are no separate “headphones vs mic” buttons here—it is one audio input, with different default devices.

Demo limits

  • Up to 60 s of audio or video per attempt
  • File size up to 25 MB
  • Per-attempt upload up to 150 MB (then the server trims for demo)
  • Up to 3 requests per IP per hour
  • Voice sample (TTS) uses up to 180 characters of text
  • For a link, only the first ~60 s are downloaded; device uploads are received whole but only the start is processed
Record or upload audio/video, or paste a link (including YouTube / RuTube). For the demo we process the start of the clip within limits: long files are automatically trimmed to ~60 s and demo size; you get transcript, translation, and a voice sample.

How it works

Upload a file

Audio, video, link, or microphone recording.

Choose a workflow

Transcription, translation, subtitles, or voiceover.

AI processing

Speech recognition and translation to your target language.

Export

TXT, SRT, VTT, ASS, CSV, ZIP, or media file.

What you can do with Polyglot Voice

Accurate transcription

Speech recognition in 98+ languages with timestamps.

AI on your transcript

Summaries, study notes, quizzes, meeting notes, and YouTube descriptions from ready text.

Video translation

Translate speech and subtitles in one workspace.

Audio to text

MP3, WAV, podcasts, and recordings into editable text.

Real-time speech

Record from microphone with live processing.

Subtitles

SRT, VTT, ASS — ready files for publishing.

Voiceover (TTS)

Natural voices for translated content.

Clip cutting

Short clips and reels from long videos.

Developer API

REST API, webhooks, and keys in your dashboard.

Who it's for

For students

Lectures → study notes

For creators

Video → subtitles and translation

For business

Meetings → protocols and docs

For developers

API → pipeline automation

Supported formats

Audio & video

MP4MOVWEBMM4VMP3WAVOGGFLACAAC

and other popular formats

Export

TXTSRTVTTASSCSVZIP

and other popular formats

We support 98+ languages

English

en

Russian

ru

German

de

French

fr

Spanish

es

Chinese (Simplified)

zh

Enter a language name or code

Frequently asked questions

Can I convert audio and video to text online?

Yes. Polyglot Voice is designed for audio-to-text and video-to-text workflows with support for multilingual transcription and export-friendly results.

Can I translate speech into another language?

Yes. You can use transcript and translation workflows together to turn spoken content into translated text for subtitles, notes and publishing.

Is it useful for lectures, interviews and podcasts?

Yes. The workflow is especially useful for lectures, interviews, meetings, podcasts and creator content that needs searchability, subtitles or repurposing.

Do you support many languages?

The platform is built around broad language support, including the ability to work with many spoken input languages and multilingual output workflows.

How do refunds work?

See Payment & refunds for terms. If your quota was not credited after payment, email polyglotvoicehello@gmail.com with the payment date and account email.

Try Polyglot Voice right now

No registration • 60-second demo • All features