Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
US population by gender and race for each US ZIP code sourced from 2000 and 2010 Decennial Census.
This dataset is sourced from United States Census Bureau’s Decennial Census Dataset APIs. Review Terms of Service and Policies and Notices for the terms and conditions related to the use this dataset.
Note
Microsoft provides Azure Open Datasets on an “as is” basis. Microsoft makes no warranties, express or implied, guarantees or conditions with respect to your use of the datasets. To the extent permitted under your local law, Microsoft disclaims all liability for any damages or losses, including direct, consequential, special, indirect, incidental or punitive, resulting from your use of the datasets.
This dataset is provided under the original terms that Microsoft received source data. The dataset may include data sourced from Microsoft.
Volume and retention
This dataset is stored in Parquet format and has data for the year 2010.
Storage location
This dataset is stored in the East US Azure region. Allocating compute resources in East US is recommended for affinity.
Related datasets
Columns
Name | Data type | Unique | Values (sample) | Description |
---|---|---|---|---|
decennialTime | string | 1 | 2010 | The time of the decennial census happened, for example, 2010, 2000. |
maxAge | int | 23 | 54 21 | Max of the age range. If it’s null, it’s across all ages or the age range has no upper bound, for example, age > 85. |
minAge | int | 23 | 45 30 | Min of the age range. If it’s null, it’s across all ages. |
population | int | 29,274 | 1 2 | Population of this segment. |
race | string | 8 | SOME OTHER RACE ALONE BLACK OR AFRICAN AMERICAN ALONE | Race category in Census data. If it’s null, it’s across all races. |
sex | string | 3 | Female Male | Male or female. If it’s null, it’s across both sexes. |
year | int | 1 | 2010 | Year (in integer) of the decennial time. |
zipCode | string | 33,120 | 39218 87420 | 5-Digit ZIP Code Tabulation Area (ZCTA5). |
Preview
decennialTime | zipCode | population | race | sex | minAge | maxAge | year |
---|---|---|---|---|---|---|---|
2010 | 77477 | 265 | WHITE ALONE | Female | 15 | 17 | 2010 |
2010 | 77477 | 107 | SOME OTHER RACE ALONE | Female | 15 | 17 | 2010 |
2010 | 77477 | 12 | SOME OTHER RACE ALONE | Female | 65 | 66 | 2010 |
2010 | 77477 | 101 | ASIAN ALONE | Female | 60 | 61 | 2010 |
2010 | 77477 | 221 | ASIAN ALONE | Male | 10 | 14 | 2010 |
2010 | 77478 | 256 | WHITE ALONE | Female | 15 | 17 | 2010 |
2010 | 77478 | 17 | SOME OTHER RACE ALONE | Female | 15 | 17 | 2010 |
2010 | 77478 | 3 | SOME OTHER RACE ALONE | Female | 65 | 66 | 2010 |
Data access
Azure Notebooks
# This is a package in preview.
from azureml.opendatasets import UsPopulationZip
population = UsPopulationZip()
population_df = population.to_pandas_dataframe()
population_df.info()
Azure Databricks
# This is a package in preview.
from azureml.opendatasets import UsPopulationZip
population = UsPopulationZip()
population_df = population.to_spark_dataframe()
display(population_df.limit(5))
Azure Synapse
# This is a package in preview.
from azureml.opendatasets import UsPopulationZip
population = UsPopulationZip()
population_df = population.to_spark_dataframe()
# Display top 5 rows
display(population_df.limit(5))
Next steps
View the rest of the datasets in the Open Datasets catalog.