Translate Audio to Any Language
Upload any audio file — Voxlated's AI engine transcribes, translates, and synthesises natural speech in 20+ languages. Free, no sign-up, runs in your browser.
1. Upload Audio File
Drop audio file here or click to browse
MP3, WAV, OGG, M4A, AAC, FLAC · Max 10 MB
2. Source Language (audio / video)
Use Auto-detect to detect the language from the file, or choose a language to force it.
3. Select Target Language
4. Select Voice
Your translated audio will appear here
Upload an audio file, pick a language, and start translation.
How Voxlated Works
Translate audio or video to any language in three simple steps — powered by state-of-the-art AI models.
Upload Your File
Drop an audio file (MP3, WAV, OGG, M4A, AAC, FLAC) or video file (MP4, MOV, WEBM, AVI, MKV). Processing starts instantly in your browser.
Choose Language & Voice
Select from 20+ target languages and preview natural-sounding voices. Auto-detect or manually set the source language.
Download Translation
Whisper transcribes → LLM translates → Edge TTS synthesises. Download your translated audio (MP3) or video (MP4) instantly.
20+ Supported Languages
Translate audio and video between any of these languages with natural-sounding AI voices.
Frequently Asked Questions
Everything you need to know about using Voxlated to translate your audio and video files.
Is Voxlated free to use?
Yes, Voxlated is completely free. There are no subscriptions, hidden fees, or usage limits. Simply upload your audio or video file and translate it to any supported language.
Which languages does Voxlated support?
Voxlated supports 20+ languages including Hindi, Spanish, French, German, Japanese, Korean, Arabic, Portuguese, Russian, Italian, Dutch, Indonesian, Polish, Bangla, Gujarati, Hebrew, Malayalam, Punjabi, Tamil, Telugu, and Ukrainian.
How does the AI translation work?
Voxlated uses a three-step AI pipeline: (1) OpenAI Whisper transcribes the speech from your file, (2) a large language model (Mistral or Qwen) translates the text, and (3) Microsoft Edge TTS synthesizes natural-sounding speech in the target language. For video, the translated audio replaces the original track.
Is my data private and secure?
Yes. Audio and video processing (FFmpeg) runs entirely in your browser using WebAssembly. Only the audio data is sent to the server for transcription and translation. Output files are stored temporarily (4-hour TTL) and then automatically deleted.
What file formats are supported?
For audio: MP3, WAV, OGG, M4A, AAC, and FLAC (up to 10 MB). For video: MP4, MOV, WEBM, AVI, and MKV (up to 50 MB).
Do I need to install anything?
No. Voxlated runs entirely in your browser — no downloads, no sign-ups, no extensions. Just open the website and start translating.