Edit audio and video like a document — remove filler words, clone your voice, and auto-generate clips
Descript AI: The Document-First Media Editor
Why Descript is Different
Descript treats your recording as a text document. Edit the transcript, and the audio/video changes automatically. This fundamental shift makes complex editing accessible to anyone who can use a word processor.
Core AI Features
Overdub: Voice Cloning
How it works:
Record 10 minutes of clear speech for training
Wait 24 hours for voice model
Type corrections in transcript → audio plays in your voiceUse cases:
Fix mispronunciations post-recording
Add clarifications without re-recording
Re-record sections when environment changed
Create multiple language versionsQuality note: Overdub sounds 90% like you. Listeners rarely notice on podcasts.
AI Filler Word Removal
Descript detects and removes:
"um", "uh", "like"
False starts and repetitions
Extended silences (configurable threshold)
Mouth clicks and breath soundsBefore/after timing: A 45-minute interview often becomes 32 minutes after cleanup.
Settings: Conservative mode (removes only clear filler) vs. Aggressive (also removes pauses).
Studio Sound
One-click audio enhancement:
Background noise suppression
Room tone equalization
Dynamic range compression
Volume normalizationTransforms kitchen table recording into broadcast-quality audio.
AI Highlights Generation
For long-form content:
Run AI analysis on full transcript
AI identifies most quotable moments
Review 3-5 suggested clips
Export as social media cuts with captionsPodcast Production Workflow
Recording
Record each participant in separate tracks
Descript handles multi-track automaticallyEditing (45 min for 1-hour episode)
Cleanup (10 min): Apply Studio Sound, run filler word removal
Structure edit (20 min): Delete sections in transcript view
Transitions (10 min): Add room tone fills between cuts
Highlights (5 min): Approve AI-suggested clipsDistribution
Export MP3 for RSS feed
Export 3-5 short clips with auto-captions
Generate chapter markers from transcript
Create blog post from AI summaryVideo Production Workflow
Interview/Talk Content
Import video
Auto-transcription (2-3 min per hour of content)
Script-based editing
Auto-generate square/vertical crops for social
Add B-roll from Descript's stock libraryScreen Recording / Tutorial Content
Screen record with Descript's built-in recorder
Auto-transcription of narration
Remove mistakes by deleting text
Add webcam overlay and graphics
Export with chapter markersPricing
| Plan | Features | Price |
| Free | 1hr/mo transcription | $0 |
| Hobbyist | 10hr/mo, Overdub | $12/mo |
| Creator | 30hr/mo, all features | $24/mo |
| Business | Unlimited, team features | $40/mo |