Guidelines
Suggest edit

Recruit diverse viewers for visualization testing

For interpretation testing, use a diverse participant sample in visualization review sessions to improve fidelity and address audience-specific interpretation gaps for viewers with mixed ages, education, and visualization experience.

  • purpose:refine
  • basis:rhetorical
  • quality:fidelity
  • lever:interaction-access
  • communication:workflow
  • knowledge:mixed
  • audience:designer

advice

Diversify test participants

Recruit test viewers who differ in age, education, and visualization experience when you evaluate how a visualization is interpreted. For example, include both older and younger viewers, include people with non-academic as well as student backgrounds, and collect critique from viewers with varying levels of visualization experience instead of testing only one homogeneous group.

reason

Why diverse testing reveals gaps

Different viewer groups notice different interpretation problems. A more varied test group is more likely to reveal gaps that stay hidden when all feedback comes from people with similar age, education, or experience.

Mechanism: Diverse testing exposes interpretation differences across audience groups, which helps designers find misunderstandings that a single-group test can miss.

Evidence: Workshops and interviews that included older adults, students, people with non-academic educational backgrounds, and viewers with varying visualization experience surfaced insights that could have been missed in a more homogeneous group. A representative survey of adults aged 18 to 74 also gathered 377 voluntary critique comments on visualizations, showing the value of feedback from a broad audience range (Knoll et al., 2025; Saske et al., 2025).

context

Use when testing across audience groups

  • User Goal: Check whether different kinds of viewers interpret the visualization well.
  • Task: Gather critique or feedback that can reveal interpretation gaps.
  • Chart Setting: Formative review through workshops, interviews, or surveys that can recruit more than one viewer profile.
  • Audience: Intended viewers vary in age, education, or visualization experience.
  • Success Criterion: Feedback surfaces issues that a single homogeneous test group would likely miss.

exceptions

Do not use when the target audience is narrowly defined

Break it when: the visualization is intentionally being tested for one narrowly defined audience group rather than across age, education, or expertise differences. Why: this guideline is for surfacing differences between viewer groups, so its main benefit drops when cross-group variation is not the testing target.

costs

Tradeoffs of broader recruitment

Sacrifice: You give up the speed and convenience of testing with one readily available participant group. Risk: A sample can appear varied while still missing important audience differences. Mitigation: Vary age, education background, and visualization experience rather than changing only one of those dimensions.

mistakes

Common testing mistake

Mistake: Recruiting only one age, education, or experience group and treating that as audience testing. Why it fails: a homogeneous group can miss interpretation problems that appear in other viewer groups.

check

Check participant diversity

Failure Sign: Most feedback comes from viewers with similar backgrounds, and critique does not reflect clearly different perspectives. Quick Check: List participants by age range, education background, and visualization experience and see whether one profile dominates the sample. Stronger Test: Compare feedback from clearly different viewer groups and check whether they raise different interpretation issues.

fix

Fix the recruitment plan

  • Expand the participant pool beyond one convenience group.
  • Set recruitment targets across age ranges, education backgrounds, and visualization experience levels.
  • Add both older and younger viewers when the intended audience spans generations.
  • Collect open critique from the mixed group through workshops, interviews, or surveys.

References

Knoll, C., Möller, T., Gregory, K., & Koesten, L. (2025). The Gulf of Interpretation: From Chart to Message and Back Again. Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 1–17. https://doi.org/10.1145/3706598.3713413
Saske, A., Koesten, L., Möller, T., Staudner, J., & Kritzinger, S. (2025). A Multidimensional Assessment Method for Situated Visualization Understanding (MdamV). arXiv. https://doi.org/10.48550/arXiv.2410.23807