How to Estimate How Long Your Script Will Take to Read Aloud
Quick Answer
Divide your word count by your speaking pace in words per minute to get runtime in minutes. Most on-camera presenters speak at 120–150 words per minute for recorded video, meaning a 750-word script runs approximately 5 to 6 minutes. Add 10–15% for natural pauses, breath breaks, and emphasis moments not captured in the raw word count.
“I always ran over time at conferences until I started using the 15% pause buffer. Now I hit my slot within 30 seconds every time. Measuring my own wpm instead of using the benchmark was the key — I speak faster than average.”
James T. — Conference Speaker, Chicago IL
Why Script Timing Estimations Are Almost Always Wrong
After coaching hundreds of creators through script development, the most common surprise is how different a script feels to read aloud versus silently. Silent reading happens at 200–400 words per minute for most adults. We internalize that pace and assume our scripts are shorter than they are. Then we stand in front of a camera and discover the 600-word script we thought was three minutes is actually four and a half — after accounting for pauses, breath patterns, and the natural slowing that occurs with emphasis.
Getting timing right matters practically. A YouTube video that runs long loses viewers before the call to action. An online course module that runs short may not justify its billing. A conference talk that overruns is career-defining for the wrong reasons.
The Core Formula
The calculation is simple:
Runtime (minutes) = Word Count ÷ Words Per Minute
The variable that most people get wrong is the denominator. Here are accurate speaking pace benchmarks for different recording contexts:
- Conversational, relaxed delivery: 120–130 wpm
- Standard on-camera talking head: 130–150 wpm
- Fast-paced YouTube / social content: 150–175 wpm
- Academic lecture or instructional content: 110–130 wpm
- Non-native speaker delivery: 100–120 wpm
- Formal presentation or keynote: 120–140 wpm
Quick Reference: Word Count to Video Length
- 150 words → approximately 1 minute at 140 wpm
- 450 words → approximately 3 minutes at 140 wpm
- 750 words → approximately 5 minutes at 140 wpm
- 1,200 words → approximately 8 minutes at 140 wpm
- 2,000 words → approximately 14 minutes at 140 wpm
The 15% Pause Adjustment
Raw word count does not include the pauses that make speech intelligible. A well-delivered video includes:
- Breath pauses between paragraphs (1–2 seconds each)
- Emphasis pauses before key points (1–3 seconds)
- Transition pauses between sections (2–4 seconds)
- Reaction or rhetorical pauses that invite viewer reflection
In practice, these pauses add roughly 10–20% to calculated runtime. For a 5-minute script estimate, plan for 5.5 to 6 minutes of actual delivered content. For a conference talk with a hard 18-minute limit, your script should estimate to no more than 15 minutes on paper.
How to Measure Your Own Speaking Pace
The benchmarks above are starting points. Your actual pace varies with subject matter confidence, nerves, and delivery style. Here is how to calibrate to your specific pace:
- Write a 300-word passage on a topic you know well.
- Read it aloud at your natural on-camera pace — not rushed, not slowed.
- Time the reading with a stopwatch.
- Divide 300 by the number of minutes elapsed. This is your natural wpm.
Repeat this test three times and average the results. Most creators discover their natural pace is between 130 and 155 wpm. Use your personal number rather than a benchmark for all future estimates.
Using Telepront's Word Count and Duration Features
When you load a script into Telepront, the voice-scroll engine tracks your actual speaking pace during each take and displays your real words-per-minute rate alongside the elapsed time. After your first take, you have precise data rather than an estimate — and you can compare it to your target runtime. If your actual pace is consistently faster than planned, use the [SLOW] cue in your script to add deliberate pause checkpoints. If you are running long, identify which sections have the most natural cuts.
Scripts With Variable Density
Not all words take the same time to speak. Technical vocabulary, long numerical values, and unfamiliar proper nouns create verbal speed bumps that your formula will not capture. A 500-word medical explainer script will run longer than a 500-word personal story, even at the same wpm. When your script contains dense technical content, add a 20–25% buffer rather than the standard 15%.
Fitting Scripts to Specific Time Targets
Social Media (under 60 seconds)
Target 130–150 words for a 60-second clip at normal conversational pace. Eliminate any sentence that does not directly support the single main point. Social video has no room for preamble — the hook must be in the first 5 words.
YouTube (8–12 minutes)
Target 1,000–1,500 words for standard YouTube content at 130–140 wpm with normal pause density. Structure the script with an introduction hook (150 words), three to four main sections (250–300 words each), and a conclusion with call to action (100 words).
Online Course Modules (5–9 minutes)
Target 650–1,100 words depending on content density. Educational content should be at the slower end of the pace range — 120–130 wpm — to allow processing time. Budget extra pause time after every new concept introduction.
Keynote Talk (15–18 minutes)
Target 1,700–2,000 words. Keynote delivery is typically the slowest of all formats, with the longest pauses, because the speaker must also allow time for audience reaction, laughter, and absorption of visual aids. A 15-minute keynote rarely exceeds 1,800 words of actual scripted content.
A Note on Filler Words
If your draft script includes many filler words — um, uh, you know, kind of — estimate that these will be replaced by 0.5–1.5 second pauses in delivery (if you remove them consciously) or will add time if they appear in the final recording. Clean scripts with clean pauses are more predictable to time than scripts with filler words you plan to edit out.
“The word count to runtime table is something I reference every time I'm writing a module. I used to wing it and always ended up with segments that were either way too short or needed to be cut down in post.”
Hannah B. — Online Course Creator, Austin TX

Use this script in Telepront
Paste any script and it auto-scrolls as you speak. AI voice tracking follows your pace — the floating overlay sits on top of Zoom, FaceTime, OBS, or any app.
Your Script — Ready to Go
Script Timing Explainer for Creators · 140 words · ~1 min · 131 WPM
Creators Love It
“The technical vocabulary density point was a revelation. I kept wondering why my technical explainer scripts ran 20% longer than my formula predicted. The extra buffer for dense content solved it.”
Marcus D.
Marketing Video Producer, Seattle WA
See It in Action
Watch how Telepront follows your voice and scrolls the script in real time.
Every Question Answered
5 expert answers on this topic
How many words are in a 1-minute video script?
At a typical on-camera pace of 130–150 words per minute, a one-minute video requires approximately 130–150 words of scripted content, not counting pauses. Allow for 10–15% fewer words if your delivery style includes deliberate pausing and emphasis moments.
How many words are in a 5-minute video script?
A five-minute video at 130–150 wpm requires approximately 650–750 words before pause adjustment. After adding 15% for pauses, target 560–650 actual scripted words. For dense instructional content at 120 wpm, target around 540–560 words.
Does speaking faster make my video shorter in practice?
Yes, but with diminishing returns. Speaking above 160 wpm on camera often becomes difficult for audiences to follow, and many platforms' auto-captioning accuracy drops at higher speech rates. A faster pace that loses audience comprehension creates more problems than the time it saves.
How do I time my script before recording?
Read your script aloud once at your intended delivery pace and time it with your phone's stopwatch. This is more accurate than any formula because it captures your natural pauses, the actual length of your delivery, and any verbal stumbles that indicate sections needing revision.
Why does my video always run longer than my script estimate?
The most common causes are: silent reading is faster than spoken delivery (creating an underestimate baseline), pauses are not counted in word count, and performance nerves slow pace below your rehearsal baseline. Adding 15–20% buffer to your calculated estimate and measuring your pace during a calm rehearsal read-through corrects most of this drift.