US Population by ZIP code

US population by gender and race for each US ZIP code sourced from 2000 and 2010 Decennial Census.

This dataset is sourced from United States Census Bureau’s Decennial Census Dataset APIs. Review Terms of Service and Policies and Notices for the terms and conditions related to the use this dataset.

Note

Microsoft provides Azure Open Datasets on an “as is” basis. Microsoft makes no warranties, express or implied, guarantees or conditions with respect to your use of the datasets. To the extent permitted under your local law, Microsoft disclaims all liability for any damages or losses, including direct, consequential, special, indirect, incidental or punitive, resulting from your use of the datasets.

This dataset is provided under the original terms that Microsoft received source data. The dataset may include data sourced from Microsoft.

Volume and retention

This dataset is stored in Parquet format and has data for the year 2010.

Storage location

This dataset is stored in the East US Azure region. Allocating compute resources in East US is recommended for affinity.

Columns

Name Data type Unique Values (sample) Description
decennialTime string 1 2010 The time of the decennial census happened, for example, 2010, 2000.
maxAge int 23 54 21 Max of the age range. If it’s null, it’s across all ages or the age range has no upper bound, for example, age > 85.
minAge int 23 45 30 Min of the age range. If it’s null, it’s across all ages.
population int 29,274 1 2 Population of this segment.
race string 8 SOME OTHER RACE ALONE BLACK OR AFRICAN AMERICAN ALONE Race category in Census data. If it’s null, it’s across all races.
sex string 3 Female Male Male or female. If it’s null, it’s across both sexes.
year int 1 2010 Year (in integer) of the decennial time.
zipCode string 33,120 39218 87420 5-Digit ZIP Code Tabulation Area (ZCTA5).

Preview

decennialTime zipCode population race sex minAge maxAge year
2010 77477 265 WHITE ALONE Female 15 17 2010
2010 77477 107 SOME OTHER RACE ALONE Female 15 17 2010
2010 77477 12 SOME OTHER RACE ALONE Female 65 66 2010
2010 77477 101 ASIAN ALONE Female 60 61 2010
2010 77477 221 ASIAN ALONE Male 10 14 2010
2010 77478 256 WHITE ALONE Female 15 17 2010
2010 77478 17 SOME OTHER RACE ALONE Female 15 17 2010
2010 77478 3 SOME OTHER RACE ALONE Female 65 66 2010

Data access

Azure Notebooks

# This is a package in preview.
from azureml.opendatasets import UsPopulationZip

population = UsPopulationZip()
population_df = population.to_pandas_dataframe()
population_df.info()

Azure Databricks

# This is a package in preview.
from azureml.opendatasets import UsPopulationZip

population = UsPopulationZip()
population_df = population.to_spark_dataframe()
display(population_df.limit(5))

Azure Synapse

# This is a package in preview.
from azureml.opendatasets import UsPopulationZip

population = UsPopulationZip()
population_df = population.to_spark_dataframe()
# Display top 5 rows
display(population_df.limit(5))

Next steps

View the rest of the datasets in the Open Datasets catalog.