Audio Import

new

Import audio files by drag-and-drop or file picker. Supports 10 formats including MP4, WAV, MP3, and FLAC with automatic chunked transcription.

Last updated: March 25, 2026

TL;DR

Meetily can import existing audio files and transcribe them automatically. Drag an audio file onto the app window -or use the file picker -and Meetily handles decoding, voice detection, and transcription. Supports 10 audio formats up to 20 GB. Once imported, the recording appears in your meeting list just like a live recording. Find it via the Import Audio button or by dragging a file onto the app.

Audio Import

Already have a recorded meeting sitting on your hard drive? Meetily can import it and produce a full transcript -no re-recording needed.

Think of it like adding photos to a library -you drop the file in, Meetily processes it in the background, and when it's done you have a fully searchable, editable transcript sitting alongside your live recordings.

Everything starts from the main window: drag a file onto Meetily or click the Import Audio button.


Quick Start

  1. Drag an audio file onto the Meetily window (or click Import Audio)
  2. Review the file info -duration, size, and format are shown automatically
  3. Enter a meeting title (auto-populated from the filename)
  4. Click Import
  5. Watch the progress bar move through each stage
  6. When it finishes, you're taken directly to the new meeting's details page
Code language: text
Have an audio file?
  → Drag it onto Meetily
  → Review file info (duration, size, format)
  → Enter meeting title
  → Click Import
  → Transcript appears in ~30 seconds to a few minutes (depends on file length)

Supported Formats

Meetily supports 10 audio and video formats:

Native Formats

These are decoded directly by Meetily's built-in audio engine:

FormatExtensionTypical use
WAV.wavStudio recordings, voice memos
MP3.mp3Most common audio format
MP4.mp4Video recordings with audio
M4A.m4aApple voice memos, iTunes
FLAC.flacLossless audio archives
OGG.oggOpen-source recordings
AAC.aacStreaming audio, mobile recordings

FFmpeg-Backed Formats

These require FFmpeg for decoding (included with Meetily):

FormatExtensionTypical use
MKV.mkvVideo containers with meeting recordings
WebM.webmBrowser-based recordings (Google Meet, Zoom web)
WMA.wmaWindows Media recordings

File Size Limit

The maximum supported file size is 20 GB. Files must be larger than 0 bytes. Duration is detected automatically from file metadata.


The Import Dialog

When you drag a file onto Meetily or click Import Audio, the Import Dialog opens.

File Preview Card

At the top, you'll see a card showing:

  • Filename -the original file name
  • Duration -auto-detected from metadata (e.g., "1h 23m 45s")
  • File Size -in human-readable format (e.g., "245 MB")
  • Format -the audio format (e.g., "MP3")

Meeting Title

Below the preview, there's an editable Meeting Title field. Meetily auto-fills this from the filename -for example, team-standup-2026-03-15.mp3 becomes "team-standup-2026-03-15". Edit it to whatever you'd like.

Advanced Options

Click Advanced Options to expand additional settings:

OptionWhat it doesDefault
ProviderWhich transcription engine to useYour current default provider
ModelWhich model within that providerYour current default model
LanguageLanguage hint for transcriptionAuto-detect

Provider Selection

The provider dropdown shows all configured transcription providers -Whisper, Parakeet, Groq, OpenAI, and OpenAI-Compatible. If no providers are configured, Meetily shows an alert with a link to Settings to set one up first.

Language Selection

Most providers support an explicit language hint. When set to Auto-detect, the transcription engine guesses the spoken language. Setting a specific language can improve accuracy for non-English recordings.

Note: Parakeet always uses auto-detection -the language dropdown is disabled when Parakeet is selected.


How Import Works

When you click Import, Meetily processes the file through a multi-stage pipeline:

Code language: text
Audio File → Copy → Decode → Resample → VAD → Transcribe → Save

Processing Stages

StageProgressWhat happens
Copying5–10%File is copied to Meetily's data directory
Decoding15–20%Audio is extracted and decoded to raw PCM
Resampling20–25%Audio is resampled to 16kHz mono (required by transcription engines)
VAD25–30%Voice Activity Detection filters out silence, noise, and non-speech
Transcribing30–80%Speech segments are sent to your chosen transcription provider
Saving85–100%Transcript segments are saved to the database

What you see during processing

  • A progress bar shows the current percentage and stage name
  • The stage name updates as processing moves through each phase
  • The Import button is disabled -you cannot close the dialog or press Escape while processing
  • A Cancel button lets you abort the import at any time

When it finishes

  • A success toast appears showing the number of transcript segments created
  • Meetily automatically navigates to the new meeting's details page
  • The meeting appears in your sidebar like any other recording

Drag-and-Drop

The fastest way to import is drag-and-drop:

  1. Find your audio file in Finder (macOS) or Explorer (Windows)
  2. Drag it onto the Meetily window
  3. A blue overlay appears covering the app with the message: "Drop audio file to import"
  4. Drop the file
  5. The Import Dialog opens with the file pre-loaded

The overlay lists all supported formats so you can quickly check if your file type is compatible.

Recording in Progress

You cannot import audio files while a live recording is in progress. If you try, Meetily shows an alert asking you to stop the current recording first.


Real-World Examples

Example 1: Importing a Zoom recording

"I have an hour-long Zoom recording as an MP4 file. I want to get a transcript without re-playing the whole thing."

Setup: Drag the .mp4 file onto Meetily → title it "Q1 Planning Call" → click Import. Meetily extracts the audio track from the video container, runs VAD to skip silence, and transcribes the speech. In a few minutes you have a full, searchable transcript.

Example 2: Processing voice memos from your phone

"I record voice memos on my iPhone during client visits. I want them transcribed and summarized in Meetily."

Setup: Transfer the .m4a files from your phone to your Mac → drag each one onto Meetily → they're imported and transcribed one at a time. Each appears as a separate meeting in your sidebar with the memo filename as the title.

Example 3: Batch-processing old recordings with a specific language

"I have meeting recordings in German. I want to make sure the transcription knows it's German, not English."

Setup: Drag the .wav file onto Meetily → click Advanced Options → set Language to German → click Import. The language hint tells the transcription engine to expect German, which significantly improves accuracy for non-English recordings.

Example 4: Re-importing with a different provider

"I imported a file using Whisper but the quality wasn't great. I want to try Groq instead."

Setup: The original meeting is already in Meetily. Use the Retranscribe feature (see Retranscription) to re-process the same audio file with Groq -no need to re-import.


Screenshots Guide

These are the key screens worth capturing for visual reference:

#What to captureWhy it helps
1Drag-and-drop overlay -blue fullscreen overlay with "Drop audio file to import" and format listShows users what to expect when dragging a file
2Import dialog with file preview -showing filename, duration, size, and format cardShows the first thing users see after dropping a file
3Advanced Options expanded -provider, model, and language dropdowns visibleShows power-user options that are hidden by default
4Progress bar mid-import -showing a stage name like "Transcribing" and ~60%Shows what users see during the wait
5Success toast -green notification showing segment countConfirms the import worked
6Meeting details after import -the resulting transcript in the Transcript tabShows the end result of the import workflow
7No provider alert -the alert shown when no transcription providers are configuredHelps users understand why Import is blocked
8Recording-in-progress alert -the alert shown when trying to import during a live recordingPrevents confusion about why drag-drop isn't working

Frequently Asked Questions

Meetily supports 10 formats: WAV, MP3, MP4, M4A, FLAC, OGG, AAC (native), and MKV, WebM, WMA (via FFmpeg). These cover the vast majority of recording formats from phones, video conferencing tools, and professional audio equipment.
Yes, the maximum file size is 20 GB. Files must be larger than 0 bytes. For reference, a 1-hour WAV recording is typically around 300-600 MB, and an MP3 version of the same recording is around 50-100 MB.
Yes. MP4, MKV, and WebM are video container formats, and Meetily extracts the audio track automatically. The video itself is not stored -only the audio is processed and transcribed.
It depends on the file length and your transcription provider. A 1-hour recording typically takes 2-5 minutes with a local provider like Whisper (on Apple Silicon), or 30-60 seconds with a cloud provider like Groq. The progress bar shows real-time status so you always know where things stand.
Currently, Meetily imports one file at a time. To import multiple recordings, drop them one after another -each creates a separate meeting entry.
Click the Cancel button during processing. The import stops, partial data is cleaned up, and no meeting entry is created. Your original audio file is never modified.
No. Audio Import is available in both Community and Pro editions. You can import and transcribe audio files regardless of your license tier.
Yes. Click Advanced Options in the Import dialog to select a specific provider, model, and language. By default, Meetily uses your currently configured transcription provider.
VAD stands for Voice Activity Detection. It analyzes the audio to find segments where someone is actually speaking, filtering out silence, background noise, and non-speech sounds. This stage runs quickly (25-30% of progress) and makes transcription much faster and more accurate by only sending real speech to the transcription engine.
Check the error message in the dialog. Common causes: the file might be corrupted, the format might not be supported, or the transcription provider might be unreachable. Try a different audio format or switch to a local provider like Whisper which doesn't require internet. Your original file is never modified.

Ready to get started?

Download Meetily and start transcribing your meetings locally with full privacy.

Have questions? or join our GitHub community