Prefer natural interpolation when clustered values and outliers need different treatment

For regional comparison on outlier-rich value maps, prefer natural interpolation on choropleth color scales to improve fidelity and mitigate the tradeoff between washed-out linear scales and overstated equal-count scales for readers scanning geographic patterns.

purpose:refine
basis:heuristic
task:compare
chart:choropleth
data:geospatial
quality:fidelity
lever:encoding
shape:outlier-rich

advice

Distribution-aware interpolation

Prefer natural interpolation when values cluster tightly in one part of the range but a few outliers sit far away. For example, use Natural breaks for classed scales or Natural interpolation for continuous scales so clustered midrange regions separate into several colors while only a few true outliers get the darkest shade.

reason

Why natural interpolation balances pattern and rarity

Natural interpolation follows the actual grouping in the data instead of forcing equal numeric spacing or equal counts. That makes it useful when the map needs to show both internal variation and true outliers.

Mechanism: Closely packed values are grouped together while far-apart outliers are separated more aggressively, so the map shows meaningful regional variation without pretending extreme values are common.

Evidence: The article presents natural interpolation as the compromise that respects the distribution, reveals differences around the middle of the data, and still leaves only a few regions as clear outliers (Muth, 2022).

Notes: The article discusses this logic for both classed and unclassed color scales.

context

Use when the data are clustered with a few far-apart extremes

User Goal: Show geographic patterns across the common values while still signaling that only a few regions are extreme.
Task: Compare regions across an uneven distribution.
Data: Quantitative regional values with dense clustering and a small number of far-apart outliers.
Chart Setting: A choropleth with a classed or unclassed sequential color scale.
Success Criterion: Several colors appear across the main cluster, but the darkest shade is still reserved for only a few regions.

exceptions

Do not use natural interpolation when the simple scale already fits the story

Break it when: The values are fairly even, or the main point is to draw immediate attention to outliers alone. Why: Linear interpolation already serves those cases more directly.

costs

Tradeoffs of natural interpolation

Sacrifice: You give up simple evenly spaced break values. Risk: The resulting thresholds can be awkward decimals that are hard to read in the legend. Mitigation: Round the thresholds afterward with custom breaks if the map impression stays the same.

mistakes

Common failure with natural interpolation decisions

Mistake: Staying with linear or equal-count interpolation after they clearly force the map to one side of the tradeoff. Why it fails: Linear can hide the main cluster, while equal-count can make extremes look too common.

check

How to test whether natural interpolation is the better compromise

Failure Sign: Linear leaves most regions near one light color, while equal-count interpolation makes dark colors too common. Quick Check: Inspect the distribution; if many values are tightly packed and a few are far away, natural interpolation is a better match. Stronger Test: Compare linear, equal-count, and natural versions and keep natural if it reveals midrange patterns while reserving the darkest color for only a few regions.

fix

What to change when linear and equal-count both mislead

Change the interpolation to Natural breaks on a classed scale.
Change the interpolation to Natural on a continuous scale.
Keep the same palette and let the breakpoints follow the clusters in the data.
If the legend becomes hard to read, round the resulting thresholds and compare the map again.

References

Muth, L. C. (2022). How to choose an interpolation for your color scale. https://www.datawrapper.de/blog/interpolation-for-color-scales-and-maps