USA_1900_PHC_v01_M_v7.5_A_IPUMS
United States Census of 1900 - IPUMS Subset
PHC United States 1900 (5%) (IPUMS Harmonized Subset)
| Name | Country code | 
|---|---|
| United States | USA | 
Population and Housing Census [hh/popcen] IPUMS International
DOI:10.18128/D020.V7.5
Population and Housing Census [hh/popcen]
Persons, households, and dwellings
UNITS IDENTIFIED:
UNIT DESCRIPTIONS:
Version 7.5. The datasets contain selected variables from the original census microdata plus harmonized variables from the IPUMS-International database.
2024-10-05
Additional notes on a sample that is part of this study: United States 1900 (5%)
| Topic | Vocabulary | 
|---|---|
| Demographic Variables -- PERSON | IPUMS | 
| Nativity and Birthplace Variables -- PERSON | IPUMS | 
| Fertility and Mortality Variables -- PERSON | IPUMS | 
| Geography: Global Variables -- HOUSEHOLD | IPUMS | 
| Technical Household Variables -- HOUSEHOLD | IPUMS | 
| Geography: O-Z Variables -- HOUSEHOLD | IPUMS | 
| Constructed Family Interrelationship Variables -- PERSON | IPUMS | 
| Household Economic Variables -- HOUSEHOLD | IPUMS | 
| Group Quarters Variables -- HOUSEHOLD | IPUMS | 
| Constructed Household Variables -- HOUSEHOLD | IPUMS | 
| Ethnicity and Language Variables -- PERSON | IPUMS | 
| Work Variables -- PERSON | IPUMS | 
| Education Variables -- PERSON | IPUMS | 
| Other Person Variables -- PERSON | IPUMS | 
| Technical Person Variables -- PERSON | IPUMS | 
| Other Household Variables -- HOUSEHOLD | IPUMS | 
| Technical Household Variables -- HOUSEHOLD | IPUMS | 
| Geography: O-Z Variables -- HOUSEHOLD | IPUMS | 
| Group Quarters Variables -- HOUSEHOLD | IPUMS | 
| Household Economic Variables -- HOUSEHOLD | IPUMS | 
| Constructed Household Variables -- HOUSEHOLD | IPUMS | 
| Other Household Variables -- HOUSEHOLD | IPUMS | 
| Technical Person Variables -- PERSON | IPUMS | 
| Constructed Family Interrelationship Variables -- PERSON | IPUMS | 
| Demographic Variables -- PERSON | IPUMS | 
| Ethnicity and Language Variables -- PERSON | IPUMS | 
| Nativity and Birthplace Variables -- PERSON | IPUMS | 
| Nativity and Birthplace Variables -- PERSON | IPUMS | 
| Ethnicity and Language Variables -- PERSON | IPUMS | 
| Education Variables -- PERSON | IPUMS | 
| Work Variables -- PERSON | IPUMS | 
| Technical Person Variables -- PERSON | IPUMS | 
| Other Person Variables -- PERSON | IPUMS | 
| Demographic Variables -- PERSON | IPUMS | 
County
All persons living in the United States including temporarily absent residents and sailors at sea. Native Americans living on reservations or under tribal rule were enumerated using a separate schedule.
| Name | Affiliation | 
|---|---|
| Department of the Interior | |
| IPUMS | University of Minnesota | 
MICRODATA SOURCE: Department of the Interior
SAMPLE SIZE (person records): 3852852.
SAMPLE DESIGN: 1-in-20 national random sample of the population. Alaska and Hawaii are not included in this dataset.
Self-weighting (expansion factor=20)
The census operation involved four schedules. Schedule 1 was used to enumerate households and collected information on individual characteristics. Other schedules were used to enumerate the Native American population, and record information on agriculture and livestock.
| Start | End | 
|---|---|
| 1900-06-01 | 1900-06-01 | 
| Start date | End date | 
|---|---|
| 1900-06-01 | 1900-06-01 | 
de jure, CENSUS DAY: June 1, 1900
| Name | 
|---|
| Department of the Interior | 
| Is signing of a confidentiality declaration required? | Confidentiality declaration text | 
|---|---|
| yes | IPUMS International distributes integrated microdata of individuals and households only by agreement of collaborating national statistical offices and under the strictest of confidence. Before data may be distributed to an individual researcher, an electronic license agreement must be signed and approved. To gain access to the data, a researcher must agree to the following: (1) Implement security measures to prevent unauthorized access to census microdata. Under IPUMS International agreements with collaborating agencies, redistribution of the data to third parties is prohibited. (2) Use the microdata for the exclusive purposes of scholarly research and education. Researchers must explicitly agree to not use microdata acquired for any commercial or income-generating venture. (3) Maintain the confidentiality of persons, households, and other entities. Any attempt to ascertain the identity of persons or households from the microdata is prohibited. Alleging that a person or household has been identified is also prohibited. (4) Report all publications based on these data to IPUMS International, which will in turn pass the information on to the relevant national statistical agencies. Once a project is approved, a password is issued and data may be acquired through the Internet. Penalties for violating the license include: revocation of the license, recall of all microdata acquired, filing of a motion of censure to the appropriate professional organizations, and civil prosecution under the relevant national or international statutes. These safeguards mirror the principles from the Joint ECE/Eurostat Work Session on Statistical Data Confidentiality. Employees of the Minnesota Population Center who work with the census microdata to produce the harmonized database also sign agreements to respect the confidentiality of the data. IPUMS International works with each country's statistical office to minimize the risk of disclosure of respondent information. The details of the confidentiality protections vary across countries, but in all cases, names and detailed geographic information are suppressed and top-codes are imposed on variables such as income that might identify specific persons. In addition, IPUMS International uses a variety of technical procedures to enhance confidentiality protection. These include the following: (1) Swapping an undisclosed fraction of records from one administrative district to another to make positive identification of individuals impossible. (2) Randomizing the placement of households within districts to disguise the order in which individuals were enumerated or the data processed. (3) Aggregating codes of sensitive characteristics (e.g., grouping together very small ethnic categories) (4) Top- and bottom-coding continuous variables to prevent identification of extreme cases. The safety record for public-use census microdata is apparently perfect. In almost four decades of use, there has not been a single verified breach of statistical confidentiality. The measures implemented by the IPUMS International are designed to extend this record. | 
An adapted version of the dataset, harmonized for international comparability, is available from IPUMS International (https://international.ipums.org/international/) under the following conditions:
IPUMS International distributes integrated microdata of individuals and households only by agreement of collaborating national statistical offices and under the strictest of confidence. Before data may be distributed to an individual researcher, an electronic license agreement must be signed and approved. To gain access to the data, a researcher must agree to the following:
(1) Implement security measures to prevent unauthorized access to census microdata. Under IPUMS International agreements with collaborating agencies, redistribution of the data to third parties is prohibited.
(2) Use the microdata for the exclusive purposes of scholarly research and education. Researchers must explicitly agree to not use microdata acquired for any commercial or income-generating venture.
(3) Maintain the confidentiality of persons, households, and other entities. Any attempt to ascertain the identity of persons or households from the microdata is prohibited. Alleging that a person or household has been identified is also prohibited.
(4) Report all publications based on these data to IPUMS International, which will in turn pass the information on to the relevant national statistical agencies.
Once a project is approved, a password is issued and data may be acquired through the Internet. Penalties for violating the license include: revocation of the license, recall of all microdata acquired, filing of a motion of censure to the appropriate professional organizations, and civil prosecution under the relevant national or international statutes.
These safeguards mirror the principles from the Joint ECE/Eurostat Work Session on Statistical Data Confidentiality. Employees of the Minnesota Population Center who work with the census microdata to produce the harmonized database also sign agreements to respect the confidentiality of the data.
Steven Ruggles, Lara Cleveland, Rodrigo Lovaton, Sula Sarkar, Matthew Sobek, Derek Burk, Dan Ehrlich, Quinn Heimann, Jane Lee. Integrated Public Use Microdata Series, International: Version 7.5 [dataset]. Minneapolis, MN: IPUMS, 2024. https://doi.org/10.1 [dataset]. Minneapolis, MN: IPUMS, 2024. https://doi.org/10.18128/D020.V7.5
Researchers should also acknowledge the statistical agency that originally produced the data: United States, Department of the Interior. United States Census of 1900
The licensing agreement for use of IPUMS International data requires that users supply IPUMS International with the title and full citation for any publications, research reports, or educational materials making use of the data or documentation.
Copies of such materials are also gratefully received at ipums@umn.edu.
Printed matter should be sent to:
IPUMS International
Minnesota Population Center
University of Minnesota
50 Willey Hall
225 19th Avenue South
Minneapolis, MN 55455
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
(c) Copyright 1900, Department of the Interior and Minnesota Population Center
| Name | 
|---|
| Department of the Interior | 
DDI_USA_1900_PHC_v01_M_v7.5_A_IPUMS
| Name | Abbreviation | Affiliation | Role | 
|---|---|---|---|
| IPUMS | IPUMS | University of Minnesota | Integration Harmonization Documentation | 
May 21, 2024
Version 7.5 October 2024. NEW FEATURES.
--Historical data from NAPP project now available from IPUMS-International.
--Historical census data from Canada, Denmark, the United Kingdom, Germany, Iceland, Norway, Sweden, and the United States for the period 1703 to 1911 are now available from IPUMS-International. The complete count and sample datasets were previously disseminated by the North Atlantic Population Project (NAPP). Where possible, the data have been integrated into existing IPUMS-International variable coding schema. Some new variables have been created that are available only for these pre-1960 datasets. NAPP data users should note that many NAPP variables are available from IPUMS-International by different names. For a complete list of NAPP variables that have been renamed in IPUMS-Interational, refer to the crosswalk.
--Individual country shapefiles for the third-level administrative level of geography are now available for a few IPUMS samples.
--New spatially harmonized previous-residence variables at the second administrative level of geography are available for several samples in this data release. More information is available here. Users should note that many older migration variables are available by different names. Refer to this table for a crosswalk of old and corresponding new migration variables.
--IPUMS now hosts the Census Mosaic data collection. Census Mosaic identifies, gathers, harmonizes, and distributes surviving historical census microdata from regions of Continental Europe where complete centralized records are not available. The Mosaic project was founded by a consortium of historical social scientists in Europe. Data can be downloaded as static files from the Census Mosaic website. Although the data are not yet integrated fully into IPUMS International, variables have been standardized and harmonized to be roughly compatible with IPUMS coding structures.
NEW SAMPLES.
--Full-count datasets for Great Britain 1851, 1861, 1871 (Scotland only), 1891, and 1901.
--Full-count dataset for Sweden 1910. Denmark (1845, 1880, and 1885)
--Labor force surveys from Spain and eight new labor force surveys from Italy added to the series.
Newly added countries:
Benin, Cote d'Ivoire, Finland, Guatemala, Honduras, Laos, Lesotho, Mauritius, Myanmar, Papua New Guinea, Russia, Slovak Republic, Suriname, Togo, and Zimbabwe
New samples for:
Bolivia, Cambodia, Cambodia, Chile, Cuba, Cote d'Ivoire, Egypt (1848 and 1868, historical samples), Fiji, Guinea, Ireland, Israel, Italy, Lao PDR, Mexico, Morocco, Nepal, Netherlands, Palestine, Peru, Philippines, Puerto Rico, Rwanda, Senegal, Sierra Leone, South Africa, Switzerland, Uganda, United States, United Kingdom, United States, Vietnam, and Zimbabwe
SUPPLEMENTAL DATA.
Data from censuses from Benin and Lesotho that record individual fertility and/or mortality events were made available in IPUMS-International. These files can be downloaded and linked to data produced by the extract system.
NEW VARIABLES.
--IPUMS-International now provides harmonized and year-specific geography variables for all countries including 13 new samples from Dominican Republic, Germany, Indonesia, Israel, Malaysia, Mongolia, Nicaragua, Nigeria, Palestine, Paraguay, Thailand, United Kingdom, and Uruguay. First-level and second-level year specific geography variables are also available for all countries. IPUMS provides corresponding, downloadable GIS boundary files for all harmonized and year specific geography variables. More information about IPUMS geography variables is available here.
--IPUMS International now provides spatially harmonized previous-residence variables at the first administrative level of geography. The codes for the spatially harmonized previous-residence variables match the spatially harmonized place of current residence. More information is available here.
--IPUMS International provides spatially harmonized previous-residence variables at the first administrative level of geography for all samples; previously available country-specific migration variables at the first administrative level that were not fully harmonized spatially have been phased out. Spatially harmonized previous-residence variables at the second administrative level of geography are available for selected samples. More information is available here. Users should note that many older migration variables are available by different names. Refer to this table for a crosswalk of old and corresponding new migration variables.
--IPUMS International now provides spatially harmonized previous-residence variables at the first administrative level of geography for all samples. Spatially harmonized previous-residence variables at the second administrative level of geography are available for several samples in this data release. More information is available here. Users should note that many older migration variables are available by different names. Refer to this table for a crosswalk of old and corresponding new migration variables.
--Lower (third) level geography codes and GIS files have been added for Bangladesh, China, Ethiopia, Mali, Rwanda, and Zimbabwe. Some geography codes and labels might have changed for these countries to accommodate the newer lower level geography.
--Added more detailed 3-digit industry and occupation variables for China 2000.
EDITED SAMPLES.
--Revised full-count data for Great Britain 1881
--Revised full-count datasets for Sweden 1890 and 1900. The revision includes the following changes that improve comparability across Sweden datasets:
--Revisions to certain ethnicity and work variables (and the underlying source data): ORIGIN, LABFORCE, OCCHISCO, OCRELATE, OCSTATUS.
--Revisions to unharmonized source variables: SE1890A_HISCOSE, SE1890A_HISCRELSE, SE1890A_HISCSTATSE, SE1890A_OCCMULTISE, SE1900A_HISCOSE, SE1900A_HISCRELSE, SE1900A_HISCSTATSE, SE1900A_OCCMULTISE.
--A new United States 1850 full-count dataset now matches the corresponding dataset distributed by the USA IPUMS data project. The source variable US1850A_0502 (HISTID) provides a linking key to match person records to the USA version of the data. The IPUMS International version of the data contains names, which the USA version cannot distribute.
EDITED VARIABLES.
An error affecting HHWT for South Africa 2007 was corrected. The existing values were adjusted by a factor of 0.01.
AGEMARR was edited to add data for Hungary 1980 and 1990.
Harmonized and year-specific geography variables for Brazil and Colombia have been edited to accommodate for the availability of refined municipal boundaries. Users should be aware that codes and labels have changed in all harmonized and year specific geography varaibles for these two countries.
Errors affecting BPLSE2 (formerly BPLPARSE) for Sweden 1890 and the underlying source variable were corrected. Several thousand cases were incorrectly coded as 258101000. These cases have been updated with the correct code: 258171000.
Harmonized geography variables for Italy, Philippines, Rwanda, and United States have been edited to accommodate new samples. Users should be aware that codes and labels have changed in all harmonized and year specific geography varaibles for these countries. More information about IPUMS geography variables is available here.
The codes for the source variable RW2002A_0419 were corrected to include 0 and 8 as possible responses, which were previously identified as 'unknown years' within primary education.
Errors affecting EDUCFJ for Fiji 2006 were corrected.
A problem with PERWT for Tanzania 2012 was corrected. The previous weights were adjusted to properly reflect population totals.
MOMLOC, POPLOC, and PARRULE were updated for the United States 2010 and 2015 samples to include additional information on subfamilies. Prior to this correction, persons above age 17 were not receiving links to their co-resident mothers and fathers.
An error affecting codes for the URBAN variable in Egypt 1986 for Cairo, Alexandria, Port-Said, and Suez was corrected.
An error in INCEARN affecting Venezuela 2001 was corrected. Earned income in the source variable VE2001A_0440 is interpreted as a monthly amount, thus adjustments previously applied to convert data from daily or weekly income were supressed.
All the six Brazil samples in IPUMS International were replaced with higher density samples.
An edited version of the Chile 2017 sample was introduced to correct an error in household breaks.
Errors affecting codes for GEO1_ZA in South Africa 2011 and ENUTS1 in United Kingdom 1991 were corrected.
Harmonized geography variables for Cambodia, Fiji, and Nepal have been edited to accommodate new samples. Users should be aware that codes and labels have changed in all harmonized and year-specific geography varaibles for these countries. More information about IPUMS geography variables is available here.
An error in PERWT affecting Nepal 2001 was corrected.
Errors affecting a code in GQ for Brazil 2010 and Indonesia 2010 were corrected. Both census samples now identify 1-person units created by splitting a large household.
An error in MARRNUM affecting Indonesia 1976 was corrected. Some codes for GEO1_EG2006 and GEO2_EG2006 were edited.
Harmonized geography variables for Bolivia, Cuba, Guinea, Ireland, Morocco, Palestine, Senegal, South Africa, and Uganda have been edited to accommodate new samples. Users should be aware that codes and labels have changed in all harmonized and year-specific geography variables for these countries. More information about IPUMS geography variables is available here.
An error in INCEARN affecting Brazil 1980 was corrected.
An error in EDATTAIN affecting Ireland 1971 and 1981 was corrected.
A small proportion of person records in Mexico 1960 were re-classified in MIGRATEP based on information about their current and previous residence. These were previously coded to 'different major administrative unit', even though their place of residence suggests that their last move was within the same major administrative unit.
The second-level technician (higher) degrees for Spain 1991, 2001, and 2011 were re-classified into post-secondary technical education in EDATTAIN.
An error affecting codes for SEX for Egypt 1848 and 1868 was corrected. The values for male and female had been reversed.
A problem with HHWT and PERWT for Canada 2011 was corrected. The previous weights were adjusted to properly reflect population totals.
Harmonized geography variables for Cambodia, Lao PDR, Mexico, Peru, Switzerland, Vietnam, Puerto Rico, United Kingdom, and United States have been edited to accommodate new samples. Users should be aware that codes and labels have changed in all harmonized and year-specific geography variables for these countries. More information about IPUMS geography variables is available here.
Harmonized geography variables for Chile and Sierra Leone have been edited to accommodate new samples. Users should be aware that codes and labels have changed in all harmonized and year-specific geography variables for these countries. More information about IPUMS geography variables is available here.
An error affecting codes for COMPUTER for Senegal 2013 was corrected.
An error affecting labels available in IND for Peru 1993 was corrected.
An error affecting codes for persons previously residing abroad for MIG1_5_BO in Bolivia 2001 and 2012 was corrected.
EDUCAR, EDATTAIN, and YRSCHOOL were adjusted in the Argentina samples to incorporate information on completion of education levels in the data harmonization.
HHWT and PERWT were calibrated in Kenya 1979 to properly reflect the population distribution by province.
In GQ (group quarters status), persons residing in hospitals of all types were reclassified to 'institutional group quarters' from 'other group quarters,' making their treatment consistent with GQTYPE.
Errors affecting codes for BPLBJ2 in Benin 1979, 1992, and 2002 were corrected.
Errors affecting codes for GEO2_BR1970 in Brazil 1970 were corrected.