Turn audio and video into text with AI — 98+ languages
Online transcription service: speech to text, audio to text and video to text with subtitles and export. Try the free demo without registration.
- 98 languages
- API
- Subtitles
- Voiceover
00:00:00Good evening and welcome to our restaurant.
00:00:03Good evening. We have a dinner reservation for two people today at 7 pm.
00:00:08Yes, sure. Can I have your name please?
00:00:12The reservation is under the name of Brian.
00:00:15Okay, please follow me.
00:00:18Yes, of course.
00:00:20Here's your table and the menu.
00:00:22Your waiter will be with you in a moment.
00:00:25Thank you very much.
00:00:27You're very welcome.
00:00:28Enjoy your evening.
00:00:00Добрый вечер, добро пожаловать в наш ресторан.
00:00:03Добрый вечер. У нас забронирован стол на двоих сегодня в 19:00.
00:00:08Да, конечно. Могу я узнать ваше имя?
00:00:12Бронь на имя Брайана.
00:00:15Хорошо, пройдёмте за мной.
00:00:18Да, конечно.
00:00:20Вот ваш стол и меню.
00:00:22Официант подойдёт к вам через минуту.
00:00:25Большое спасибо.
00:00:27Пожалуйста.
00:00:28Приятного вечера.
Trusted by creators and teams
250 000+
minutes processed
20 000+
files uploaded
98+
supported languages
99.9%
service uptime
24/7
support
60 sec demo
Try Polyglot Voice without registration
Record or upload audio/video, or paste a link (including YouTube / RuTube). For the demo we process the start of the clip within limits: long files are automatically trimmed to ~60 s and demo size; you get transcript, translation, and a voice sample.
The server deletes the upload and temporary processing files after building the response. A browser recording stays only on your device; after a successful demo we clear the selected file in the form. Limits are on the right.
Full studio without demo limits — sign up or log in (about a minute).
Which microphone or headset is used is chosen in your OS and in the browser’s site settings (lock icon in the address bar). There are no separate “headphones vs mic” buttons here—it is one audio input, with different default devices.
Demo limits
- Up to 60 s of audio or video per attempt
- File size up to 25 MB
- Per-attempt upload up to 150 MB (then the server trims for demo)
- Up to 3 requests per IP per hour
- Voice sample (TTS) uses up to 180 characters of text
- For a link, only the first ~60 s are downloaded; device uploads are received whole but only the start is processed
How it works
Upload a file
Audio, video, link, or microphone recording.
Choose a workflow
Transcription, translation, subtitles, or voiceover.
AI processing
Speech recognition and translation to your target language.
Export
TXT, SRT, VTT, ASS, CSV, ZIP, or media file.
What you can do with Polyglot Voice
Accurate transcription
Speech recognition in 98+ languages with timestamps.
AI on your transcript
Summaries, study notes, quizzes, meeting notes, and YouTube descriptions from ready text.
Video translation
Translate speech and subtitles in one workspace.
Audio to text
MP3, WAV, podcasts, and recordings into editable text.
Real-time speech
Record from microphone with live processing.
Subtitles
SRT, VTT, ASS — ready files for publishing.
Voiceover (TTS)
Natural voices for translated content.
Clip cutting
Short clips and reels from long videos.
Developer API
REST API, webhooks, and keys in your dashboard.
Who it's for
For students
Lectures → study notes
For creators
Video → subtitles and translation
For business
Meetings → protocols and docs
For developers
API → pipeline automation
Supported formats
Audio & video
and other popular formats
Export
and other popular formats
We support 98+ languages
English
en
Russian
ru
German
de
French
fr
Spanish
es
Chinese (Simplified)
zh
Enter a language name or code
Frequently asked questions
Can I convert audio and video to text online?
Yes. Polyglot Voice is designed for audio-to-text and video-to-text workflows with support for multilingual transcription and export-friendly results.
Can I translate speech into another language?
Yes. You can use transcript and translation workflows together to turn spoken content into translated text for subtitles, notes and publishing.
Is it useful for lectures, interviews and podcasts?
Yes. The workflow is especially useful for lectures, interviews, meetings, podcasts and creator content that needs searchability, subtitles or repurposing.
Do you support many languages?
The platform is built around broad language support, including the ability to work with many spoken input languages and multilingual output workflows.
How do refunds work?
See Payment & refunds for terms. If your quota was not credited after payment, email polyglotvoicehello@gmail.com with the payment date and account email.