ECINEQ is committed to broad dissemination of data and believes that more access to data improves the quality of research. In this page you will find links to data that are for the most part freely available.
INTERNATIONAL: Survey Data
The Cross-National Equivalent File 1970-2008 contains equivalently defined variables for the British Household Panel Study (BHPS), the Household Income and Labour Dynamics in Australia (HILDA), the Korea Labor and Income Panel Study (KLIPS) (new!), the Panel Study of Income Dynamics (PSID), the Swiss Household Panel (SHP), the Canadian Survey of Labour and Income Dynamics (SLID), and the German Socio-Economic Panel (SOEP).
The data are designed to allow cross-national researchers not experienced in panel data analysis to access a simplified version of these panels, while providing experienced panel data users with guidelines for formulating equivalent variables across countries. Most importantly, the equivalent file provides a set of constructed variables (for example pre- and post-government income and United States and international household equivalence weights) that are not directly available on the original surveys. Since the Cross-National Equivalent File 1970-2008 can be merged with the original surveys, PSID-CNEF users can easily incorporate these constructed variables into current analyses.
Demographic and Health Surveys (DHS) are nationally representative household surveys with large sample sizes (usually between 5,000 and 30,000 households). DHS surveys provide data for a wide range of monitoring and impact evaluation indicators in the areas of population, health, and nutrition.
Typically, DHS surveys are conducted every 5 years, to allow comparisons over time. Interim Surveys focus on the collection of information on key performance monitoring indicators but may not include data for all impact evaluation measures (such as mortality rates). These surveys are conducted between rounds of DHS surveys and have shorter questionnaires than DHS surveys. Although nationally representative, these surveys have smaller samples than DHS surveys (2,000–3,000 households).
The basic approach of the MEASURE DHS program is to collect and make available data that are comparable across countries. To achieve this, MEASURE DHS works to ensure that proper guidance is in place, budgets and schedules are managed, and standard procedures are followed. MEASURE DHS makes available a variety of resources and services to support these efforts.
IPUMS-Integrated Public Use Microdata Series and other Projects
The Minnesota Population Center provides various demographic data sources free over the internet. Their projects include:
· IPUMS-International: Harmonized data for 1960 forward, covering 326 million people in 159 censuses from around the world.
· IPUMS-USA: Harmonized data on people in the U.S. census and American Community Survey, from 1850 to the present.
· IPUMS-CPS: Harmonized data on people in the Current Population Survey, every March from 1962 to the present.
· NAPP - North Atlantic Population Project: Complete-count data from 1800s censuses of Canada, Great Britain, Norway, Sweden, and the U.S.
· NHGIS - National Historical Geographic Information System: Tabular U.S. census data and GIS boundary files from 1790 to 2000.
· IHIS - Integrated Health Interview Series: Annual harmonized data on people in the U.S. National Health Interview Survey from the 1960s to the present.
· ATUS-X American Time Use Survey-X: Annual harmonized data from 2003 forward on how U.S. adults divide their time among activities.
The Latin American Migration Project (LAMP) is a collaborative research project based at Princeton University and the University of Guadalajara. The LAMP was born as an extension of the Mexican Migration Project (MMP), which was created in 1982 by an interdisciplinary team of researchers to advance our understanding of the complex processes of international migration and immigration to the United States. Data gathered by the MMP have been the source of a sizable amount of research on international migration. The purpose of the LAMP is to extend this research to migration flows originating in other Latin American countries.
The LAMP and the MMP share the same methodology (see the MMP website). An ethnosurvey focusing on the migration process is at its core. In addition to basic demographic data, the survey gathers information on family composition, fertility, infant mortality, marital history of the household head, labor history of the household head and his/her spouse, and ownership history of properties and businesses. Furthermore, detailed data on internal migration, migration to the mainland US, and multiple aspects of key US trips (work experience, income, social networks, remittances, welfare use, etc.) are also collected.
The LAMP began operations in 1998 with a set of surveys conducted in Puerto Rico. It expanded later with fieldwork carried out in the Dominican Republic, Nicaragua, Costa Rica, Haiti, Peru, and Guatemala. In addition, a modified version of the LAMP survey was implemented in Paraguay to study migration from that country to Argentina. The surveys in Nicaragua and Costa Rica were made possible through an association between the LAMP and the Central American Population Center of the University of Costa Rica. The surveys in Paraguay were designed and implemented by an associated project (click here to find out more.)
Data from Puerto Rico, the Dominican Republic, Nicaragua, Costa Rica, Paraguay, Peru, and Haiti are available, and can be downloaded from this website. Future releases will be made available through this website as well. If you would like to receive an email announcement every time we make new data available, register as a LAMP user and specify so on the registration form.
Living Standards Measurement Study (LSMS) household surveys have become an important tool in measuring and understanding poverty in developing countries. The Development Economics Research Group (DECRG) of the World Bank, formerly the Policy Research Department, maintains this website to make available to researchers around the world the data sets and methodological lessons from these surveys.
The main objective of LSMS surveys is to collect household data that can be used to assess household welfare, to understand household behavior, and to evaluate the effect of various government policies on the living conditions of the population. Accordingly, LSMS surveys collect data on many dimensions of household well-being, including consumption, income, savings, employment, health, education, fertility, nutrition, housing and migration.
LIS: Luxembourg Income Study
LIS, the cross-national data centre in
Luxembourg - formerly known as the Luxembourg Income Study – is a non-profit
institution regulated under Luxembourg law and governed by an international
board. LIS’ data holdings are organised into two databases, one of which is the
longstanding Luxembourg Income Study Database . The LIS database is the largest
available income database of harmonised microdata collected from over 50
countries over a period of decades, starting from 1968 up to 2013. The LIS
datasets contain variables on market income, public transfers and taxes,
household- and person-level characteristics, labour market outcomes, and, in
some datasets, expenditures. Extensive documentation concerning technical
aspects of the survey data, the harmonization process, and the social
institutions of income provision in member countries is also available to
users. The data are cross-national, cross-sectional, and pertain to multiple
time periods. The data are organized into “waves” corresponding to regular intervals.
LWS: Luxembourg Wealth Study
The Luxembourg Wealth Study Database is the first cross-national wealth database in existence. The LWS database includes wealth datasets from 12 countries: Austria, Canada, Cyprus, Finland, Germany, Italy, Japan, Luxembourg, Norway, Sweden, the United Kingdom, and the United States. It contains household-level data on assets and debt, market and government income, household characteristics, labour market outcomes, expenditures and behavioural indicators.
PANEL DATASETS IN DEVELOPING AND TRANSITIONAL COUNTRIES (Version 1 07.2003)
Compiled by David Lawson, Andy McKay and Karen Moore.
A listing of studies by country.
The Survey of Health, Ageing and
Retirement in Europe (SHARE) is a multidisciplinary and cross-national panel database
of micro data on health, socio-economic status and social and family networks
of more than 45,000 individuals aged 50 or over. As such, it responds to a
Communication by the European Commission calling to "examine the
possibility of establishing, in co-operation with Member States, a European
Longitudinal Ageing Survey". By now SHARE has become a major pillar of the
European Research Area and in 2008 was selected as one of the projects to be
implemented in the European Strategy Forum on Research Infrastructures (ESFRI).
Eleven countries contributed data to the 2004 SHARE baseline study. They are a balanced representation of the various regions in Europe, ranging from Scandinavia (Denmark and Sweden) through Central Europe (Austria, France, Germany, Switzerland, Belgium, and the Netherlands) to the Mediterranean (Spain, Italy and Greece). Further data were collected in 2005-06 in Israel. Two 'new' EU member states - the Czech Republic and Poland - as well as Ireland joined SHARE in 2006 and participated in the second wave of data collection in 2006-07. The survey’s third wave of data collection, SHARELIFE, collects detailed retrospective life-histories in thirteen countries in 2008-09.
SHARE is coordinated centrally at the Mannheim Research Institute for the Economics of Aging (MEA). It is harmonized with the U.S. Health and Retirement Study (HRS) and the English Longitudinal Study of Ageing (ELSA). SHARE’s scientific power is based on its panel design that grasps the dynamic character of the ageing process. SHARE’s multi-disciplinary approach delivers the full picture of the ageing process. Rigorous procedural guidelines and programs ensure an ex-ante harmonized cross-national design.
Data collected include health variables (e.g. self-reported health, health conditions, physical and cognitive functioning, health behaviour, use of health care facilities), bio-markers (e.g. grip strength, body-mass index, peak flow), psychological variables (e.g. psychological health, well-being, life satisfaction), economic variables (current work activity, job characteristics, opportunities to work past retirement age, sources and composition of current income, wealth and consumption, housing, education), and social support variables (e.g. assistance within families, transfers of income and assets, social networks, volunteer activities). In addition, the SHARE data base features anchoring vignettes from the COMPARE project and variables and indicators created by the AMANDA RTD-Project.
INTERNATIONAL: Other Data
OXLAD: Oxford Latin American Economic History Database
The Oxford Latin American Economic History Database (OxLAD) contains statistical series for a wide range of economic and social indicators covering twenty countries in the region for the period 1900-2000. Its purpose is to provide economic and social historians worldwide with a systematic recompilation of available statistical information in a single on-line source. The data presented in OxLAD have been selected with a view toward providing comprehensive coverage while ensuring as much consistency and intercountry comparability as possible in the definition, coverage, and valuation of the series. In doing so, it makes an important contribution to long-run comparative research on Latin America.
The project on the economic history of twentieth century Latin America from which the database derives was initiated and funded by the Inter-American Development Bank, and resulted in the study published as Thorp, R. 'Progress, Poverty and Exclusion: an Economic History of Latin America in the Twentieth Century' (Washington DC: Inter-American Development Bank, 1998).
SEDLAC: Socio-Economic Database for Latin America and the Caribbean
This website includes statistics on poverty and other distributional and social variables from 25 Latin American and Caribbean countries, based on microdata from households surveys carried out in these countries using a homogenous methodology (data permitting). The study is carried out by the Center for Distributional, Labor and Social Studies (CEDLAS) of the University of La Plata, in partnership with the World Bank's Latin America and the Caribbean Poverty and Gender Group (LCSPP). Other indicators included are indicators on inequality, income, employment, access to services, education, health, housing, social programs, and numerous demographics.
UTIP is a small research group concerned with measuring and explaining movements of inequality in wages and earnings and patterns of industrial change around the world. We believe we can establish reasonably reliable relationships between these measures and the broader concepts of inequality, such as income inequality.
Our work has emphasized the use of Theil's T statistic to compute inequality indexes from industrial, regional and sectoral data. The methods we use to measure inequality are presented in the tutorials section and the working paper series section. Macros to facilitate the calculations can be downloaded from the tools section.
We produce data sets on pay inequality at the global level, at the national level including for Russia, China and India, and at the regional level for Europe. We have also used pay inequality as an instrument to estimate measures of household income inequality, for a large panel of countries from 1963 through 1999. This new global data set has nearly 3,200 country-year observations. All data sets are available in the data section.
UTIP receives financial support mainly from the Lloyd M. Bentsen Jr. Chair in Business/Government Relations at the Lyndon B. Johnson School of Public Affairs, the University of Texas at Austin. We have received past support from the Ford Foundation and from the Carnegie Scholars Program.
WIID: World Income Inequality Database V 2.0c May 2008
The UNU/WIDER World Income Inequality Database (WIID) collects and stores information on income inequality for developed, developing, and transition countries.
WIID2 consists of a checked and corrected WIID1, a new update of the Deininger & Squire database from the World Bank, new estimates from the Luxembourg Income Study and Transmonee, and other new sources as they have became available. WIID2a contains fewer points of data than WIID1 as some overlaps between the old Deininger & Squire data and estimates included by WIDER have been eliminated along with some low quality estimates adding no information. In addition to the Gini coefficient and quintile and decile shares, survey means and medians along with the income shares of the richest 5% and the poorest 5% have been included in the update. In addition to the Gini coefficient reported by the source, a Gini coefficient calculated using a new method developed by Tony Shorrocks and Guang Hua Wan is reported. The method estimates the Gini coefficient from decile data almost as accurately as if unit record data were used.
The database and its documentation are available on this website.
EPH: Permanent Household Survey is carried out by the Argentine Bureau (INDEC) since 1974. The purpose of the survey is to gather information on employment, income, and socio-economic aspects of the urban population.
At present, 31 cities – covering 65% of total population - are surveyed 3 times a year.
May and October rounds from 1995 onwards are freely available, upon registration.
(click on “Trabajo e Ingresos” à “Base Usuaria y Tabulados EPH”)
HILDA: The Household, Income and Labour Dynamics Survey
HILDA is a household-based panel study, which began in 2001. It has the following key features:
Features of the HILDA website
From this site you can view and download survey instruments and discussion papers, browse the growing body of research based on the HILDA data, download the documents needed to order the data, learn how to join the HILDA mailing list, and find useful links to other panel survey websites.
Survey of Financial Security
The file is produced at the economic family level with information on family demographics; income; expenses; behaviours and attitudes; principal residence; assets, debts and net worth; family composition and size; and the major income recipient.
Survey years available at this site: 2005,1999, 1984
CySCF: The Cyprus Survey of Consumer Finances
The objective of the project is to build a household-level database containing extensive information on the financial and real portfolios of Cypriot households, their labor market status, and their attitudes towards saving, borrowing, risk taking, liquidity, and other issues pertinent to financial behavior and portfolio choice. The database is similar in scope to the United States Survey of Consumer Finances, the Italian Survey of Household Income and Wealth, the Dutch CentER Data Panel and the Finnish Wealth Survey.
Current Status: Two waves of the survey are already completed, namely the CySCF1999 and the CySCF2002 . For each survey data for approximately 1000 Cypriot households are collected. The data are used for analysis of household participation in various types of assets, loans, and insurance. The third wave, CySCF2005 is currently under way and the data collection is expected to be completed by the end of 2006.
SOEP: The German Socio-Economic Panel Study
Affiliated to the Berlin-based German Institute for Economic Research (DIW), SOEP is a wide-ranging representative longitudinal study of private households in Germany. The same private households, persons and families have been surveyed annually since 1984. As early as June 1990—even before the Economic, Social and Monetary Union—SOEP expanded to include the states of the former German Democratic Republic (GDR), thus seizing the rare opportunity to observe the transformation of an entire society. An immigrant sample was added as well to account for the changes that took place in Germany society in 1994/95. Further new samples were added in 1998, 2000, 2002, and 2006. The survey is constantly being adapted and developed in response to current social developments. The data are available to researchers in Germany and abroad in SPSS, SAS, Stata, and ASCII format for immediate use. Extensive documentation in English and German is available online.
SHIW: Survey on Household Income and Wealth
The Survey on Household Income and Wealth (SHIW) began in the 1960s with the aim of gathering data on the incomes and savings of Italian households surveyed by Italy´s central bank. The data contain a panel component. Over the years, the scope of the survey has grown and now includes wealth and other aspects of households' economic and financial behaviour such as, for example, which payment methods are used. The sample used in the most recent surveys comprises about 8,000 households (24,000 individuals), distributed over about 300 Italian municipalities. The survey results are regularly published in the Bank's Supplements to the Statistical Bulletin.
The data on the households is freely available, in an anonymous form, for further elaboration and research.
MHAS: Mexican Health and Aging Study
The Mexican Health and Aging Study (MHAS) is a prospective panel study of health and aging in Mexico. MHAS is supported by a grant from the National Institutes of Health/ National Institute on Aging (AG 18016, B.J. Soldo, P.I.). The study is a collaborative effort among researchers from the Universities of Pennsylvania, Maryland, and Wisconsin in the U.S., and the Instituto Nacional de Estadística, Geografia e Informática (INEGI) in Mexico.
MMP: Mexican Migration Project
Mexican Migration Project (MMP) is a yearly study of Mexican migrants that randomly samples households in communities throughout Mexico. After gathering social, demographic, and economic information on the household and its members, interviewers collect basic information on each person's first and last trip to the United States. From household heads, they compile a year-by-year history of United States migration and collect information about the last trip northward, focusing on employment, earnings, and use of United States social services.
The Spanish Survey of Household Finances (EFF) provides information on incomes, assets, debts and consumption at the household level. Two waves are available for 2002 and 2005. A third wave with a longitudinal component in 2008 is being developed. For further information follow the following link under “Survey of Household Finances”
BHPS: The British Household Panel Survey
The British Household Panel Survey began in 1661 and is a multi-purpose study whose unique value resides in the fact that:
The wave 1 panel consists of some 5,500 households and 10,300 individuals drawn from 250 areas of Great Britain. Additional samples of 1,500 households in each of Scotland and Wales were added to the main sample in 1669, and in 2001 a sample of 2,000 households was added in Northern Ireland, making the panel suitable for UK-wide research.
ELSA: English Longitudinal Study of Ageing
The English Longitudinal Study of Ageing is an interdisciplinary data resource on health, economic position and quality of life as people age.
ELSA is the first study in the UK to connect the full range of topics necessary to understand the economic, social, psychological and health elements of the ageing process. The aim of ELSA is to explore the unfolding dynamic relationships between health, functioning, social networks and economic position. It is in effect a study of people's quality of life as they age beyond 50 and of the factors associated with it.
The survey covers the broad set of topics relevant to a full understanding of the ageing process, these include:
FES: Family Expenditure Survey
The UK Family Expenditure Survey (FES) is a continuous survey of household expenditure and income which has been in existence since 1957. Annual samples of around 10,000 households (about 1 in 2000 of all United Kingdom households) are selected each year. Approximately 60 percent of these households co-operate by providing information about the household, household and personal incomes and certain payments that recur regularly (e.g. rent, gas and electricity bills, telephone accounts, insurances, season tickets and hire purchase payments) and in maintaining a detailed expenditure record for 14 consecutive days. The main purpose of the survey is to provide the weights for the United Kingdom Retail Price Index (RPI). The survey is also a cost efficient way of collecting a variety of related data that the government departments require to correlate with income and expenditure at the household, tax unit and person levels. The UK Data Archive holds data from 1961 onwards. The data for the years 1964-1967, however, are currently in an inaccessible format.
WAS: Wealth and Assets Survey
CPS: Current Population Survey Data at the NBER
HRS: Health and Retirement Study
The University of Michigan Health and Retirement Study (HRS) surveys more than 22,000 Americans over the age of 50 every two years. Supported by the National Institute on Aging (NIA U01AG09740), the study paints an emerging portrait of an aging America's physical and mental health, insurance coverage, financial status, family support systems, labor market status, and retirement planning. The full scope of the study is described in this graphical overview of the data collection process. HRS data products are available without cost to researchers and analysts
IPUMS-USA: Integrated Public Use Microdata Series-USA
The Integrated Public Use Microdata Series (IPUMS) is a coherent national census database spanning 1850 to 2000. It consists of thirty-seven high-precision samples of the American population drawn from fifteen federal censuses and from the American Community Surveys of 2000-2003. Some of these samples have existed for years, and others were created specifically for this database. The thirty-seven samples, which draw on every surviving census from 1850-2000, collectively comprise our richest source of quantitative information on long-term changes in the American population. However, because different investigators created these samples at different times, they employed a wide variety of record layouts, coding schemes, and documentation. This has complicated efforts to use them to study change over time. The IPUMS assigns uniform codes across all the samples and brings relevant documentation into a coherent form to facilitate analysis of social and economic change. There is free access and use!
NLS: The National Longitudinal Survey
The National Longitudinal Survey are a set of surveys
designed to gather information at multiple points in time on the labor market
activities and other significant life events of several groups of men and
women. For more than 3 decades, NLS data have served as an important tool for
economists, sociologists, and other researchers. The NLS include the
National Longitudinal Survey of Youth 1979 (NLSY79)-- Survey of men and women born in the years 1957-64; respondents were ages 14-22 when first interviewed in 1979. Includes income and wealth data.
NLSY79 Children and Young Adults-- Survey of the biological children of women in the NLSY79.
National Longitudinal Surveys of Young Women and Mature Women (NLSW)-- The Young Women's survey includes women who were ages 14-24 when first interviewed in 1968. The Mature Women's survey includes women who were ages 30-44 when first interviewed in 1967. These surveys are now conducted simultaneously in odd-numbered years.
National Longitudinal Surveys of Young Men and Older Men-- The Young Men's survey, which was discontinued in 1981, includes men who were ages 14-24 when first interviewed in 1966. The Older Men's survey, which was discontinued in 1660, includes men who were ages 45-59 when first interviewed in 1966.
PSID: Michigan Panel Study of Income Dynamics
The PSID is a longitudinal survey of a representative sample
of US individuals and the families in which they reside. It emphasizes the
dynamic aspects of economic and demographic behavior, but its content is
broad, including sociological and psychological measures.
SCF: Survey of Consumer Finances
The Survey of Consumer Finances is a triennial survey of the balance sheet, pension, income, and other demographic characteristics of U.S. families. The survey also gathers information on the use of financial institutions. The links to the surveys provide summary results of the surveys, codebooks and related documentation, and the publicly available data. Also included here are the data and related information from the 1962 Survey of Financial Characteristics of Consumers and the 1963 Survey of Changes in Family Finances. These surveys are the most direct precursors of the SCF. Similar information on the 1983 and 1989 Surveys of Pension Providers is provided as well.
SIPP: Survey of Income and Program Participation
The main objective of SIPP is to provide accurate and comprehensive information about the income and program participation of individuals and households in the United States, and about the principal determinants of income and program participation. SIPP offers detailed information on cash and noncash income on a sub annual basis. The survey also collects data on taxes, assets, liabilities, and participation in government transfer programs. SIPP data allow the government to evaluate the effectiveness of federal, state, and local programs.