af-ahead-v1
African Harmonised Early-Grade Assessment Dataset
AHEAD
| Name | Country code |
|---|---|
| Africa | AFR |
| Type | Identifier |
|---|---|
| DOI | https://doi.org/10.25828/2287-mw84 |
The AHEAD dataset pools and harmonizes microdata from Early Grade Reading Assessment (EGRA) and Early Grade Mathematics Assessment (EGMA) surveys conducted in African countries. Each constituent survey was originally captured with its own variable names, numeric codes, and value labels. The harmonization assigns a single canonical variable name, code, label scheme to comparable items across surveys to produce one analysis-ready pooled data file suitable for cross-country and cross-time comparative research on early-grade literacy and numeracy across diverse educational contexts.
Survey-level documentation, including assessment sub-tasks, sampling design commands, and source-linking instructions, is available in the AHEAD Study Catalogue: [https://datafirst-courses.github.io/egra-egma/]
Harmonized survey microdata
Individual pupil (learner)
v1.1: Edited, anonymised data for public distribution
2026-06-29
DataFirst
National coverage
Early-grade primary school pupils assessed in the respective EGRA/EGMA samples. Exact target grades and eligibility rules differ by survey
| Name | Affiliation |
|---|---|
| African Foundational Learning Data Hub | DataFirst |
| Name | Role |
|---|---|
| Gates Foundation | Financial support |
Constituent surveys used different sample designs. Harmonized survey design variables are present in the pooled file where supplied by the original survey. There is no single pooled sampling frame. Weighted and sampling error estimation analysis requires a separate survey design specification for each source.
Early Grade Reading Assessment (EGRA) and Early Grade Mathematics Assessment (EGMA) instruments. Original instruments, languages of assessment, and field protocols differ by constituent survey.
Cross-section [cross section]
| Name | Affiliation | Abbreviation |
|---|---|---|
| United States Agency for International Development | US Government | USAID |
Full description of the cleaning operations can be found in the Reference Guide in the Downloads tab.
Known limitations, residual unresolved codes, and cross-survey variable availability are documented in the User Guide and Appendix C (Variable Availability Matrix).
DataFirst data repository
Creative Commons CC-BY 4.0 attribution license
African Foundational Learning Data Hub. African Harmonised Early-Grade Assessment Data [dataset]. Version 1. Cape Town: AFLEARN [producers], 2026. Cape Town: DataFirst [distributor], 2026. DOI: https://doi.org/10.25828/2287-mw84
Researchers agree to cite the data in their publications using the recommended citation in this metadata record, including the DOI (unique dataset identifier)
Researchers agree to send DataFirst a link to any research publication based on the data
| Name | Abbreviation | Affiliation |
|---|---|---|
| DataFirst | DF | University of Cape Town |
2026-06-29
Version 1