How is this different from the Voice Enhancer?

Voice Enhancer runs a single denoise pass and keeps the natural feel of the recording — best for cleaning up steady noise like fans or AC hum. Voice Isolator stacks multiple passes and adds a voice-activity gate that silences anything outside the speaker — best for stripping music, crowds, or another conversation from the background.

Can it remove background music from a voice recording?

Yes, when the music is clearly behind the voice in level. The gate silences frames with no detected speech, and the multi-pass denoise pulls down music bleeding through during words. Heavy mastered music at the same loudness as the voice is harder — try strength 90–100 and three passes for those cases.

What does the isolation strength slider do?

It sets how aggressively non-voice frames are attenuated. At 0 the gate is loose and you'll hear faint background; at 100 anything the model isn't confident is voice goes to silence. 70–80 is a good starting point for podcasts, 90–100 for music or crowd removal.

Is the audio uploaded to a server?

No. The model and your file stay in your browser. The pipeline is a small WebAssembly module that runs locally on your CPU, so nothing leaves your computer.

What output format do I get?

A mono 48 kHz WAV file in 16-bit PCM. WAV is uncompressed and works in every audio editor and podcast host. Use the Convert audio tool to export an MP3 if you need a smaller file.

How long can the recording be?

Files up to 200 MB are accepted. Two passes process at roughly 3–5× real-time on a modern laptop, so a 10-minute recording isolates in two to three minutes.

AudioFreeRuns locally

Voice isolator

Isolate the speaker from music, crowds, and background noise in your browser.

.mp3.wav.ogg.m4a.aac.flac.webm.opus

Loading model…

Runs entirely in your browser.

Drop an audio file here

MP3 · WAV · OGG · M4A · FLAC · WebM · max 200 MB

First run loads a small (~125 KB) neural network; cached afterwards.

Choose file

About the Voice isolator

Voice isolator pulls the speaker out of recordings drowning in background music, crowd chatter, traffic, or room noise — perfect for rescuing podcast guests recorded in cafes, lifting interview audio off a noisy street, or stripping music beds from a vocal stem. Drop in an MP3, WAV, M4A, OGG, or FLAC and Handytool runs a stacked RNNoise pipeline with a voice-activity-driven gate, entirely inside your browser, so your audio never leaves your device. Two controls — isolation strength and the number of cleanup passes — let you choose between a soft cleanup and a hard isolation that silences anything outside the speaker. The result downloads as a 48 kHz mono WAV.

Voice isolator features

01
Two-stage isolation, not just denoise
Multi-pass neural-network denoising tightens the noise floor on each pass. A voice-activity-driven gate then silences frames the model is confident contain no speech — so background music, applause, and conversation drop out entirely between phrases.
02
Tunable for podcast or rescue work
Isolation strength controls how aggressively non-voice frames are gated. Lower it for natural-sounding podcasts, push it up to fully strip a music bed or crowd from a noisy recording.
03
Runs locally, no upload
The whole pipeline is a 125 KB WebAssembly module that loads once and stays cached. Audio is decoded, isolated, and downloaded entirely on your machine — no server round-trip, no account, no length limits beyond the 200 MB file cap.

Voice isolator FAQ

How is this different from the Voice Enhancer?: Voice Enhancer runs a single denoise pass and keeps the natural feel of the recording — best for cleaning up steady noise like fans or AC hum. Voice Isolator stacks multiple passes and adds a voice-activity gate that silences anything outside the speaker — best for stripping music, crowds, or another conversation from the background.
Can it remove background music from a voice recording?: Yes, when the music is clearly behind the voice in level. The gate silences frames with no detected speech, and the multi-pass denoise pulls down music bleeding through during words. Heavy mastered music at the same loudness as the voice is harder — try strength 90–100 and three passes for those cases.
What does the isolation strength slider do?: It sets how aggressively non-voice frames are attenuated. At 0 the gate is loose and you'll hear faint background; at 100 anything the model isn't confident is voice goes to silence. 70–80 is a good starting point for podcasts, 90–100 for music or crowd removal.
Is the audio uploaded to a server?: No. The model and your file stay in your browser. The pipeline is a small WebAssembly module that runs locally on your CPU, so nothing leaves your computer.
What output format do I get?: A mono 48 kHz WAV file in 16-bit PCM. WAV is uncompressed and works in every audio editor and podcast host. Use the Convert audio tool to export an MP3 if you need a smaller file.
How long can the recording be?: Files up to 200 MB are accepted. Two passes process at roughly 3–5× real-time on a modern laptop, so a 10-minute recording isolates in two to three minutes.

Guides

Articles →

5 min
Audio guide
How to Isolate Voice From Background Noise Free Online
Strip music, crowds, and traffic from any recording. Multi-pass neural isolation runs entirely in your browser — no upload, no server, nothing stored.
Updated Mar 30, 2026Read

Related tools

Audio →

Explore other tools

All tools →

Voice isolator

About the Voice isolator

Voice isolator features

Two-stage isolation, not just denoise

Tunable for podcast or rescue work

Runs locally, no upload

Voice isolator FAQ

Guides

How to Isolate Voice From Background Noise Free Online

Related tools

Transcribe audio to text

Voice enhancer

Trim audio

Explore other tools

Unlock PDF

Photo editor

HEVC to MOV

Markdown to HTML

Paraphrasing tool