Guidelines
Suggest edit

Use a parallel coordinates plot instead of a scatter plot for anomaly detection

For anomaly-detection tasks on sparse multivariate quantitative data, prefer a parallel coordinates plot over a scatter plot to improve fidelity and mitigate missed anomalous records in static views.

  • purpose:select
  • basis:empirical
  • chart:parallel:use
  • chart:scatter:avoid
  • data:quantitative
  • quality:fidelity:use
  • lever:chart-family
  • measure:multi
  • density:sparse

advice

Choose the anomaly-finding view

Use a parallel coordinates plot when the job is to detect anomalous records in multivariate quantitative data. For example, replace a scatter plot view with a parallel coordinates plot when readers must choose one or two anomalous records in a 4-attribute, 8-record display.

reason

Why parallel coordinates help anomaly detection

A parallel coordinates plot can show a record separating from the overall multivariate pattern across several attributes at once, while a scatter plot view spreads the evidence across pairwise plots.

Mechanism: Parallel coordinates keep each record together as one line across all shown attributes. This helps readers spot records that depart from the rest by value range or by relationship pattern.

Evidence: The collated record ranks the parallel coordinates plot above the scatter plot on anomaly-detection accuracy, and the original experiment concluded that parallel coordinates plots generally outperformed the other tested representations on outlier detection (Zeng & Battle, 2023; Kanjanabose et al., 2015).

Notes: Do not assume a large improvement on every performance measure; the clearest supported gain is better anomaly-detection performance.

context

Use when anomaly finding is the job

  • User Goal: Choose one or two anomalous records.
  • Data: 8 records with 4 quantitative attributes.
  • Chart Setting: Static representations of the same records and attributes with no interaction.
  • Success Criterion: More correct anomaly choices.

exceptions

Do not use this choice for exact lookup

Break it when: The user goal is exact value retrieval rather than anomaly detection. Why: In the same study, the table was faster for direct value lookup.

costs

Tradeoffs of replacing a scatter plot with parallel coordinates

Sacrifice: You should not expect the switch to solve every speed problem by itself. Risk: Changing to parallel coordinates only for speed may not deliver a strong payoff. Mitigation: Use this change when anomaly-detection accuracy is the main success criterion.

mistakes

Common anomaly-finding failure

Mistake: Keeping a scatter plot view when anomaly-detection accuracy is the main success criterion. Why it fails: The scatter plot view underperformed the parallel coordinates plot for this task in the studied multivariate setup.

check

Check the anomaly-detection choice

Failure Sign: Reviewers miss the anomalous record or choose the wrong pair in the scatter plot view. Quick Check: Show the same anomaly-detection prompt in a scatter plot view and in a parallel coordinates plot, and compare which version produces the correct anomaly choice more often. Stronger Test: Repeat matched anomaly prompts and compare accuracy first, then completion time.

fix

Fix the anomaly-finding view

  • Replace the scatter plot view with a parallel coordinates plot for anomaly-detection tasks.
  • Show each record as one polyline across shared attribute axes instead of distributing the evidence across scatter plots.
  • Switch back to a table only when the task changes to exact value retrieval.

References

Kanjanabose, R., Abdul-Rahman, A., & Chen, M. (2015). A Multi-task Comparative Study on Scatter Plots and Parallel Coordinates Plots. Computer Graphics Forum, 34(3), 261–270. https://doi.org/10.1111/cgf.12638
Zeng, Z., & Battle, L. (2023). A Review and Collation of Graphical Perception Knowledge for Visualization Recommendation. Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 1–16. https://doi.org/10.1145/3544548.3581349