Question 1

Is my audio uploaded to a server?

Accepted Answer

Transcription runs 100% locally in your browser. After completing, you can optionally share the audio and transcript with us (consent checkbox) to help improve the service — but it's entirely optional and never happens without your consent.

Question 2

Which audio formats are supported?

Accepted Answer

MP3, WAV, M4A, OGG, FLAC, WEBM, MP4, MOV and any format your browser can decode. Video containers are auto-converted.

Question 3

How long can my audio file be?

Accepted Answer

Up to 10 minutes per file in free mode. For longer audio, our Premium plan supports up to 10 hours.

Question 4

Which languages are supported?

Accepted Answer

The Whisper model supports 99 languages including English, Portuguese, Spanish, French, German, Japanese, Arabic and many more. Detection is automatic.

Question 5

Do I need to install anything?

Accepted Answer

No. It works directly in your browser. The AI model (~40MB) downloads once and stays cached for future visits.

Question 6

Is the transcription saved anywhere?

Accepted Answer

By default, no — results stay only in your browser. If you check the consent box, the audio and transcript are sent to our servers and deleted after 7 days. You can decline consent at any time.

Question 7

What is the difference vs. Premium?

Accepted Answer

The free mode uses the VoxScriber Nano model (4-bit quantized, q4) running locally: 10-min limit per file, ~85% accuracy, no speaker diarization, and timestamps only at segment level (~30s chunks — not per word). Premium uses cloud models (AssemblyAI + Whisper Large float32): >95% accuracy, diarization up to 30 speakers, word-level timestamps, files up to 10h, MP4/MOV/MKV video support, and exports to DOCX, PDF, and JSON. Speed: 1h of audio takes ~20min on your local CPU vs ~2min on Premium's dedicated GPU.

Question 8

Does it work on mobile?

Accepted Answer

Yes, but performance depends on your device. On low-RAM smartphones, transcription may be slower.

Question 9

Is it really free?

Accepted Answer

Yes. The browser transcriber is genuinely free with no trial period, no watermark and no signup. We make money from the Premium cloud plans, not from the free tool.

Question 10

Does my audio leave my device?

Accepted Answer

No — transcription runs locally via WebAssembly. The only exception is if you explicitly tick the optional consent checkbox to share a recording with us.

Question 11

Is there a file size limit?

Accepted Answer

The practical limit is duration (10 minutes per file) and your device's memory, not megabytes. A 10-minute MP3 is typically 10-20MB and works fine on most devices.

Question 12

How long does transcription take?

Accepted Answer

With the Nano model, expect roughly 1-2x the audio duration on a modern laptop — a 5-minute file takes about 5-10 minutes. The first run adds a one-time model download of ~40MB.

Question 13

Can I export subtitles (SRT)?

Accepted Answer

Yes — free exports include .txt, .srt and .vtt with segment timestamps. For word-level timestamp precision and DOCX/PDF/JSON exports, see Premium.

Question 14

Can I transcribe several files at once?

Accepted Answer

Yes — you can queue up to 5 files and they are processed one after another in your browser. Premium removes the queue limit and processes files in parallel in the cloud.

Question 15

Why does the first transcription take longer?

Accepted Answer

On your first visit the AI model is downloaded and compiled by your browser. It is then cached, so every later transcription starts immediately.

Question 16

Does it work offline?

Accepted Answer

Partially — once the model is cached, the transcription itself needs no connection. You still need to be online to load the page itself.

	Free (browser)	Premium (cloud)
File limit	10 min	10 horas
Accuracy	~85%	>95%
Speaker diarization	❌	✅
Word-level timestamps	❌	✅
Video support (MP4/MOV)	❌	✅
Export formats	TXT, SRT, VTT	DOCX, PDF, JSON…
Speed (1h of audio)	~2 min / 1h	~2 min / 1h
Privacy	100% local	☁️ + 🔒

Free audio transcription — right in your browser

Local AI

Fast and local

99 languages

No signup needed

How it works

Upload or record audio

AI runs on your device

Copy or download the text

How accurate is browser transcription?

Browser vs cloud transcription: which one do you need?

Supported audio formats

Need more? Try Premium

Speaker diarization

Files up to 10 hours

Summary, sentiment & topics

Full export options

Frequently asked questions

Free transcription in 20 languages

Free transcription by language

Convert by format