Put complex or central narration into on-screen captions

For ordered-time narrated visuals, use captions for complex or important spoken points on dynamic displays to improve fidelity and address losses from speech-only delivery for viewers following unfamiliar content.

purpose:refine
basis:empirical
time:ordered-time
temporal-pattern:dynamic
quality:fidelity:use
lever:text-annotation
component:caption:use

advice

Add captions for key spoken points

Show a caption at the same time as any spoken point that is complex or central to the message. For example, place the process name, the initial problem state, or the key intermediate relation on screen while the narration says it.

reason

Why captions help here

Some spoken material is too important to leave in audio alone. A matching caption gives viewers a second channel for the same point and makes the concept more available during the visual sequence.

Mechanism: Captions reinforce the spoken message at the moment it must be understood, which improves retention of central terms and relations that are easy to miss in audio alone.

Evidence: The paper reports that propositions conveyed only in speech were generally poorly recalled, and it recommends concurrent captions for speech information that is complex or important; recall improved after the redesign added captions for key points (Faraday & Sutcliffe, 1997).

context

Use when the narration carries hard or central concepts

User Goal: Remember the main process, relation, or problem state after one viewing.
Data: Some steps or relations are not obvious from the image alone.
Chart Setting: A narrated dynamic display has room for short on-screen captions.
Audience: Viewers are learning content they may not already understand.
Success Criterion: Viewers can recall the key named concept or relation, not just the moving objects.

exceptions

Do not caption every spoken point the same way

Break it when: The spoken point is neither complex nor important to the explanation. Why: The source limits this recommendation to information that needs special emphasis.

costs

What captions cost

Sacrifice: You give up some visual simplicity. Risk: A caption can compete with a moving event if both start together without reading time. Mitigation: Reveal the caption with the speech cue and leave it on screen long enough to read.

mistakes

Common caption misuse

Mistake: Leaving the main process name or relation only in speech. Why it fails: Viewers hear it once but do not retain it reliably.

check

How to test caption coverage

Failure Sign: Viewers remember concrete objects and motions but miss the named process or relation. Quick Check: Inspect the script for central terms that have no matching on-screen caption. Stronger Test: Show the sequence once and ask viewers to free-recall the main process name and key relation.

fix

What to change

Add a brief caption for the central process term.
Add a caption for the initial state or key relation when narration introduces it.
Time the caption reveal to the matching speech cue.
Keep the caption visible long enough to be read before the next competing event.

References

Faraday, P., & Sutcliffe, A. (1997). Designing effective multimedia presentations. Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems, 272–278. https://doi.org/10.1145/258549.258753