More than two decades of data on childhood, poverty and inequality — open to researchers worldwide.
Household and child surveys data
Young Lives has completed seven rounds of quantitative surveys. In rounds 1–5, data was collected through in-person interviews with primary caregivers and index children. In 2020–21, due to the COVID-19 pandemic, Round 6 was conducted through five phone surveys. Round 7, in 2023–24, marked the return to face-to-face interviews after the COVID-19 pandemic. On this occasion, data were not collected in Vietnam.
Data was collected using paper questionnaires for Rounds 1–3, from Round 4 onwards, data has been collected using computer assisted programming interview (CAPI), in which interviewers record participants answers using a tablet.
For each round, we release data per country for the Older Cohort, the Younger Cohort and data at the community level. For each cohort, there is a main dataset with information collected from the Young Lives participant (from their caregivers in the earlier rounds) and additional long format datasets (e.g. Household roster).
Data from Rounds 1–7 are all available via the UK Data Service in SPSS and STATA format with the following study numbers:
- Young Lives: an International Study of Childhood Poverty: Round 1, 2002 (SN 5307)
- Young Lives: an International Study of Childhood Poverty: Round 2, 2006 (SN 6852)
- Young Lives: an International Study of Childhood Poverty: Round 3, 2009 (SN 6853)
- Young Lives: an International Study of Childhood Poverty: Round 4, 2013–14 (SN 7931)
- Young Lives: an International Study of Childhood Poverty: Round 5, 2016 (SN 8357)
- Round 6 (2020–21)
- Young Lives: An International Study of Childhood Poverty: Round 7, 2023–24 (SN 9538)
To make the data accessible and to support data users, the Young Lives study also prepares a constructed dataset for each of the study countries, exploiting the longitudinal nature of the data. The latest version includes data from Rounds 1–7. One main constructed data file is available for each of the four countries. These are presented in a panel format and contain approximately 200 original and constructed variables, with the majority comparable across all seven rounds. A companion technical note is also included for information.
- Young Lives: An International Study of Childhood Poverty: Rounds 1–7 Constructed Files, 2002–26 (SN 9543)
- Technical Note 63: A Guide to Young Lives Constructed Datasets: Rounds 1 to 7
School Surveys Data
Data from the Ethiopia, India, Peru and Vietnam school surveys are all available via the UK Data Service with the following study numbers:
- Young Lives: School Survey, Ethiopia, 2012–13 (SN 7823)
- Young Lives: School Survey, India, 2010–11 (SN 7478)
- Young Lives: School Survey, Peru, 2011 (SN 7479)
- Young Lives: School Survey, Vietnam, 2011–12 (SN 7663)
The data are hierarchically structured at the pupil, teacher, class, school site and head teacher level.
Additionally, in 2020, Young Lives conducted the Head Teacher Telephone Survey in Ethiopia and India to understand how the schools were being affected by the COVID-19 pandemic. The survey investigated how schools provided support to children and families while schools remained closed, the effects of this on children's learning, and their plans for reopening.
- Young Lives: Head Teacher Telephone Survey, Ethiopia and India, 2020 (SN9007)
Qualitative Research Data
Data from our longitudinal qualitative research are not archived in the same way as the survey data because of concerns about confidentiality. The data is only available for Young Lives researchers and has been used to write extensive reports. Please visit our Publications page to explore our qualitative findings.
Matched data
Young Lives research has expanded to explore linking geographical data collected during the rounds to external datasets. Matching Young Lives data with administrative and geographic datasets significantly increases the scope for research in several areas, it may also allow researchers to identify sources of exogenous variation for more convincing causal analysis on policy and/or early life circumstances. We have released linked datasets on the UK Data Service:
- Young Lives: Data Matching Series, 1900–21 (SN 9251). This includes the following linked datasets:
1. Climate Matched Datasets (four Young Lives study countries): Community-level GPS data has been matched with temperature and precipitation data from the University of Delaware. Climate variables are offered at the community level, with a panel data structure spanning across years and months. Hence, each community has a unique value of precipitation (variable PRCP) and temperature (variable TEMP), for each year and month pairing for the period 1900-2017.
2. COVID-19 Matched Dataset (Peru only): The Young Lives Phone Survey Calls data has been matched with external data sources (The Peruvian Ministry of Health and the National Information System of Deaths in Peru). The matched dataset includes the total number of COVID cases per 1,000 inhabitants, the total number of COVID deaths by district and per 1,000 inhabitants; the total number of excess deaths per 1,000 inhabitants and the number of lockdown days in each Young Lives district in Peru during August 2020 to December 2021.
Further information is available in the following technical notes:
More than two decades of data on childhood, poverty and inequality — open to researchers worldwide.
Household and child surveys data
Young Lives has completed seven rounds of quantitative surveys. In rounds 1–5, data was collected through in-person interviews with primary caregivers and index children. In 2020–21, due to the COVID-19 pandemic, Round 6 was conducted through five phone surveys. Round 7, in 2023–24, marked the return to face-to-face interviews after the COVID-19 pandemic. On this occasion, data were not collected in Vietnam.
Data was collected using paper questionnaires for Rounds 1–3, from Round 4 onwards, data has been collected using computer assisted programming interview (CAPI), in which interviewers record participants answers using a tablet.
For each round, we release data per country for the Older Cohort, the Younger Cohort and data at the community level. For each cohort, there is a main dataset with information collected from the Young Lives participant (from their caregivers in the earlier rounds) and additional long format datasets (e.g. Household roster).
Data from Rounds 1–7 are all available via the UK Data Service in SPSS and STATA format with the following study numbers:
- Young Lives: an International Study of Childhood Poverty: Round 1, 2002 (SN 5307)
- Young Lives: an International Study of Childhood Poverty: Round 2, 2006 (SN 6852)
- Young Lives: an International Study of Childhood Poverty: Round 3, 2009 (SN 6853)
- Young Lives: an International Study of Childhood Poverty: Round 4, 2013–14 (SN 7931)
- Young Lives: an International Study of Childhood Poverty: Round 5, 2016 (SN 8357)
- Round 6 (2020–21)
- Young Lives: An International Study of Childhood Poverty: Round 7, 2023–24 (SN 9538)
To make the data accessible and to support data users, the Young Lives study also prepares a constructed dataset for each of the study countries, exploiting the longitudinal nature of the data. The latest version includes data from Rounds 1–7. One main constructed data file is available for each of the four countries. These are presented in a panel format and contain approximately 200 original and constructed variables, with the majority comparable across all seven rounds. A companion technical note is also included for information.
- Young Lives: An International Study of Childhood Poverty: Rounds 1–7 Constructed Files, 2002–26 (SN 9543)
- Technical Note 63: A Guide to Young Lives Constructed Datasets: Rounds 1 to 7
School Surveys Data
Data from the Ethiopia, India, Peru and Vietnam school surveys are all available via the UK Data Service with the following study numbers:
- Young Lives: School Survey, Ethiopia, 2012–13 (SN 7823)
- Young Lives: School Survey, India, 2010–11 (SN 7478)
- Young Lives: School Survey, Peru, 2011 (SN 7479)
- Young Lives: School Survey, Vietnam, 2011–12 (SN 7663)
The data are hierarchically structured at the pupil, teacher, class, school site and head teacher level.
Additionally, in 2020, Young Lives conducted the Head Teacher Telephone Survey in Ethiopia and India to understand how the schools were being affected by the COVID-19 pandemic. The survey investigated how schools provided support to children and families while schools remained closed, the effects of this on children's learning, and their plans for reopening.
- Young Lives: Head Teacher Telephone Survey, Ethiopia and India, 2020 (SN9007)
Qualitative Research Data
Data from our longitudinal qualitative research are not archived in the same way as the survey data because of concerns about confidentiality. The data is only available for Young Lives researchers and has been used to write extensive reports. Please visit our Publications page to explore our qualitative findings.
Matched data
Young Lives research has expanded to explore linking geographical data collected during the rounds to external datasets. Matching Young Lives data with administrative and geographic datasets significantly increases the scope for research in several areas, it may also allow researchers to identify sources of exogenous variation for more convincing causal analysis on policy and/or early life circumstances. We have released linked datasets on the UK Data Service:
- Young Lives: Data Matching Series, 1900–21 (SN 9251). This includes the following linked datasets:
1. Climate Matched Datasets (four Young Lives study countries): Community-level GPS data has been matched with temperature and precipitation data from the University of Delaware. Climate variables are offered at the community level, with a panel data structure spanning across years and months. Hence, each community has a unique value of precipitation (variable PRCP) and temperature (variable TEMP), for each year and month pairing for the period 1900-2017.
2. COVID-19 Matched Dataset (Peru only): The Young Lives Phone Survey Calls data has been matched with external data sources (The Peruvian Ministry of Health and the National Information System of Deaths in Peru). The matched dataset includes the total number of COVID cases per 1,000 inhabitants, the total number of COVID deaths by district and per 1,000 inhabitants; the total number of excess deaths per 1,000 inhabitants and the number of lockdown days in each Young Lives district in Peru during August 2020 to December 2021.
Further information is available in the following technical notes:

