Back
3 min read
Getting Started

How to Make Your First Audio Transcription - VoxScriber Tutorial

Learn step by step how to make your first audio-to-text transcription using AI. Complete guide with tips and best practices.

How to Make Your First Transcription

Transform your first audio file into text using our advanced artificial intelligence. A step-by-step guide for perfect results.

Overview: Transcription in 4 Simple Steps

  1. Upload Audio - Send your file
  2. Settings - Adjust the options
  3. AI Processing - Wait for processing
  4. Download Text - Download the result

Before You Start - Checklist

You should have:

  • An account created and verified on VoxScriber
  • An audio file saved on your computer
  • Credits available in your account
  • A stable internet connection

Quality Recommendations

  • Clear audio with minimal background noise
  • Speech in a supported language
  • MP3, WAV, M4A, or FLAC format
  • Maximum duration of 2 hours for testing

Step 1: Upload Your Audio File

Upload Your Audio File

The first step is to send your audio file to our platform. There are two main ways to do this:

Method 1: Drag and Drop

  1. Log in to your VoxScriber account
  2. Go to the "New Transcription" area
  3. Drag the file from your computer
  4. Drop it onto the indicated area on the screen
  5. Wait for the upload to complete (progress bar)

Method 2: Upload Button

  1. Click the "Select File" button
  2. Navigate to the folder with your audio
  3. Select the desired file
  4. Click "Open" to confirm
  5. The upload will start automatically

Important Upload Tips:

  • File name: Use descriptive names (e.g., "project_meeting_2024.mp3")
  • Upload time: Depends on file size and internet speed
  • Do not close the tab: Wait for the upload to finish completely
  • Large files: May take several minutes to upload

Step 2: Configure Transcription Options

Configure Transcription Options

After uploading, you can adjust some settings to optimize the transcription result:

Transcription Name

The name that will appear in your history to identify this transcription.

  • Use descriptive names
  • Include dates if relevant
  • Avoid special characters

Processing Quality

Standard (Recommended) - Best value for money

  • Speed: Fast
  • Accuracy: Good (90-95%)
  • Use: Ideal for most cases

High Quality - Maximum accuracy

  • Speed: Moderate
  • Accuracy: Excellent (95-98%)
  • Use: Important audio files

Quick - Fast processing

  • Speed: Very fast
  • Accuracy: Good (85-92%)
  • Use: Tests and drafts
For your first transcription, we recommend using Standard Quality with an audio file of up to 5 minutes. This lets you get familiar with the system quickly and see the results without using too many credits.

Step 3: Start AI Processing

Start AI Processing

Now is when the magic happens! Our artificial intelligence will process your audio and convert it to text:

Starting the Processing

  1. Review your settings
  2. Click "Start Transcription"
  3. Confirm the credit usage
  4. Wait for the AI to process the audio
  5. Track progress in real time

Processing Times

  • 1 minute of audio: ~15-30 seconds
  • 5 minutes of audio: ~1-2 minutes
  • 30 minutes of audio: ~5-10 minutes
  • 1 hour of audio: ~10-20 minutes

Approximate times. May vary depending on quality selected and server load.

What Happens During Processing?

AI Stages:

  1. Audio file analysis
  2. Segmentation into smaller parts
  3. AI speech recognition
  4. Text correction and refinement
  5. Final result formatting

While Waiting:

  • Keep the browser tab open
  • You will receive a notification when it is done
  • You do not need to stay on the page

Step 4: Download and Use the Result

Download and Use the Result

When processing is complete, you will have access to the full text. Review, edit, and export in the formats you need.

Next Steps