Twenty-six measures sounds like twenty-six different problems: diabetes here, smoking there, food insecurity somewhere else. It isn't. Standardize all 26 measures and ask the data for its principal axes of variation, and the answer is blunt: a single component explains 57% of the variance between ZIP codes. A second adds 20%. Together, two numbers carry 77% of everything these 26 measures can say about how places differ.
Two components carry most of the signal
Share of total variance explained by each principal component
The first axis is deprivation wearing a hospital gown
What is this dominant axis? Look at how every measure “loads” onto it: nearly everything points the same way. ZIP codes high on PC1 have more diabetes and more smoking and more disability and more food insecurity, all at once. And the axis is barely about health care at all — across ZIP codes it correlates at ρ = -0.78 with median household income and ρ = +0.72 with the Area Deprivation Index. If you know how poor a neighborhood is, you already know most of what this axis knows.
The exceptions are the interesting part. Binge drinking loads negative — it is the one behavior that rises with affluence. Cancer prevalence and skipped checkups barely load at all, because they answer to a different master: age.
How each measure loads on the two axes
PC1 = overall burden · PC2 = the age-and-place axis
Every ZIP code, on two axes
Plot every analyzed ZIP code in this two-dimensional space and color it by income, and the gradient is unmistakable — blue (higher-income) places pile up on the left of the burden axis, red (lower-income) places stretch right. The vertical spread at any income level is the age-and-place axis doing its separate work.
The health plane of America's ZIP codes
Each dot is one ZIP/ZCTA area, positioned by its two principal component scores
The burden axis, on the map
ZIP centroids colored by PC1 percentile (deeper red = higher combined burden)