Back
4 min read
Transcription

Large File Processing | VoxScriber - Transcribe Long Audio Files

Learn how to process large audio files. Size limits, splitting strategies, and upload optimization tips.

Large File Processing

Complete guide for processing long and large audio files. Learn about limits, optimization strategies, cost management, and best practices for extensive transcriptions.

Limits and Specifications

Individual File

Up to 8 hours - Limit per single upload. Ideal for most cases.

Daily Processing

Up to 50 hours - Total per user per day. For intensive use.

Simultaneous Upload

10 files - Multiple files at the same time. For efficiency.

Processing Strategies

Strategy 1: Manual Splitting

Difficulty: Easy to Medium

Split the file using editing software.

Step by Step:

  1. Use Audacity (free) or similar software
  2. Split into segments of 1-2 hours
  3. Maintain 10-30 seconds of overlap
  4. Export with the same original quality
  5. Name sequentially (part1, part2...)

Advantages:

  • Full control over splitting
  • Can split at logical points
  • Works with any format
  • Allows pre-processing

Disadvantages:

  • Requires additional software
  • Time-consuming manual process
  • Need to combine parts afterward

Strategy 2: Sequential Upload

Difficulty: Easy

Upload the complete file and split if necessary.

Step by Step:

  1. Try a direct upload first
  2. If it fails, check your connection
  3. Compress the file if possible
  4. Use off-peak hours
  5. Monitor upload progress

Advantages:

  • Simpler if it works
  • Less editing work
  • Continuous transcription
  • No loss of context

Disadvantages:

  • May fail with a slow connection
  • Higher bandwidth usage
  • Risk of timeout

Strategy 3: Smart Compression

Difficulty: Medium

Reduce file size while maintaining adequate quality.

Step by Step:

  1. Convert to MP3 at 192kbps or higher
  2. Use mono if you do not need stereo
  3. Remove long silences if appropriate
  4. Normalize volume if needed
  5. Test quality before uploading

Advantages:

  • Significantly reduces file size
  • Faster upload
  • Maintains quality for transcription
  • Saves bandwidth and time

Disadvantages:

  • May reduce audio quality
  • Requires technical knowledge
  • Does not always solve the problem

Optimization Tips

File Preparation

Choose an efficient format - MP3 at 320kbps offers excellent quality with reduced file size

Remove unnecessary parts - Cut musical introductions, long silences, or irrelevant sections

Normalize the audio - Adjusting volume to a consistent level improves efficiency

Upload and Connection

Use a wired connection - Ethernet is more stable than Wi-Fi for long uploads

Avoid peak hours - Uploads are faster during off-peak times

Do not close the browser during upload - Keep the tab open until the upload finishes completely.

Costs for Large Files

Remember: cost is based on audio duration, not file size.

  • 1 hour of audio = 240 cycles (with AssemblyAI at 15 cycles/min)
  • 2 hours = 480 cycles
  • 4 hours = 960 cycles
  • 8 hours = 1,920 cycles
Use the AssemblyAI engine (default) to maximize savings - it costs 15 cycles/min vs 30 cycles/min with Whisper/ElevenLabs.

Continue learning