Abstract
Estimates of mean IQ scores for different nations have engendered controversy since their first publication in 2002. While some researchers have used these mean scores to identify relationships between the scores and other national-level variables (e.g., economic and health variables) or test theories, others have argued that the scores are without merit and that any study using them is inherently and irredeemably flawed. The purpose of this article is to evaluate the quality of estimates of mean national IQs, discuss the validity of different interpretations and uses of the scores, point out shortcomings of the dataset, and suggest solutions that can compensate for the deficiencies in the data underpinning the estimated mean national IQ scores. My hope is that the scientific community can chart a middle course and reject the false dichotomy of either accepting the scores without reservation or rejecting the entire dataset out of hand.
Similar content being viewed by others
Data Availability
Data for new analyses in this article are available in the accompanying Supplemental file.
Notes
This Flynn effect adjustment is often misunderstood. It does not increase or decrease the score of the country to reflect the age of the test. Rather, it adjusts the international IQ standard (where 100 = the mean in the UK) to the year of the test administration in a country so that the country’s measured IQ is compared to the estimated standard for the same year.
Only one sample had an overall quality rating of .18; it was collected in the United States. Four samples achieved an overall quality rating of .90. The data for these samples were collected in Tajikistan, the UK, the USA, and Yemen.
The width of a confidence interval is equal to \(\pm \ 1.96(\frac{\sigma }{\sqrt{n}})\), where \(\frac{\sigma }{\sqrt{n}}\) is equal to the standard error of the mean, σ = 15 (the default SD of a population on the IQ metric), and n is the combined sample size of all samples that contribute to a country’s mean IQ estimate (Warne, 2021, pp. 199–201).
These statistics are calculated using the absolute value of the differences between the QNW + SAS + GEO IQ in the Lynn and Becker (2019b) dataset and the IQ + GEO IQ for the previous version.
Listed in descending order of the magnitude of IQ change: Nicaragua (− 23.78 IQ points); Haiti (21.60 IQ points); Honduras (− 18.84 IQ points); Nepal (− 18.00 IQ points); Guatemala (− 17.71 IQ points); Saint Helena, Ascension, and Tristan da Cunha (− 17.01 IQ points); Belize (− 16.25 IQ points); Cabo Verde (− 16.00 IQ points); Morocco (− 15.39 IQ points); Yemen (− 14.39 IQ points); Mauritania (− 14.00 IQ points); Chad (11.83 IQ points); Saint Lucia (11.71 IQ points); Barbados (11.69 IQ points); Senegal (− 10.50 IQ points); Republic of the Congo (− 10.03 IQ points); Côte d’Ivoire (− 10.02 IQ points); and Vanuatu (10.02 IQ points). Positive values in this list indicate that the new IQ estimates from Lynn and Becker (2019b) are higher than the earlier estimate. Negative values indicate the new value is lower.
The PERCE 1997 and SERCE 2006 data are taken from official publications reporting country means for each grade level and subject (Oficina Regional de Educación para América Latina y el Caribe/UNESCO, 2001, p. 176; 2008, Tables A.3.1, A.3.5, A.4.1, A.4.5, and A.5.1). TERCE 2013 and ERCE 2019 data can be downloaded at https://raw.githubusercontent.com/llece/comparativo/main/datos_grafico_1-1.csv
Cuba did not participate in TERCE 2013. Its ERCE 2019 data are much more similar to data from other countries in Latin America.
The 1995 SACMEQ test only produced reading scores. The 2000 and 2007 SACMEQ tests produced a reading and mathematics score. The 2000 and 2007 scores were combined as an unweighted mean for each country when calculating correlations with the estimated national-level IQs.
The correlations between PASEC scores and the GEO IQ scores from the Lynn and Becker (2019b) dataset—i.e., with Benin and Burkina Faso removed—are r = .142 (for PASEC grade 2 language), r = .153 (for PASEC grade 2 mathematics), r = − .662 (for PASEC grade 6 language), and r = −.262 (for PASEC grade 6 mathematics). This does not change the conclusion that geographically imputed scores have a poor correspondence with data drawn from a country.
The national PIRLS/TIMSS scores and the chart to convert LLECE and PASEC scores to PIRLS/TIMSS scores to one another are available at https://www.cgdev.org/sites/default/files/patel-sandefur-human-capital-final-results.xlsx
Two regions, England and Northern Ireland, were part of the same country. When calculating correlations with QNW + SAS IQs, the Northern Ireland data were dropped, and the data for England was compared to QNW + SAS IQs for the entire UK.
Sear’s (2022) criticism of using IQ data from children to estimate IQs for an entire population shows that she does not understand that IQ scores are calculated by comparing examinees to their age peers. This functionally controls for age and allows scores from different age groups to have the same meaning. For an accessible explanation of how IQ scores are calculated, see Warne (2020), pp. 5–9.
Readers may be aware of Lim et al.’s (2018) study that measures human capital in 195 countries. These scores are not included in the discussion in this article because the underlying data are not solely cognitive/educational scores. Lim et al. (2018) also used health data and longevity/life expectancy data in the calculation of their human capital scores. Therefore, the Lim et al. (2018) data cannot be interpreted as a cognitive measure, which makes it inadequate to use for convergent validity purposes when studying the Lynn and Becker (2019b) dataset.
Available at https://datacatalog.worldbank.org/search/dataset/0038001.
This statistical truism is why the Flynn effect (a purely environmental effect) can coexist with high heritability (a variance statistic measuring the strength of generic influence on a phenotype in a population) of IQ. The same secular mean increase occurred in height (a phenotype with high heritability) in many countries during the twentieth century. Changes in the mean do not automatically result in changes in the variance—and vice versa.
It is important to recognize that mean QNW + SAS IQs below 70 are also found in some Central American nations (Belize, El Salvador, Guatemala, Honduras, Nicaragua), the Caribbean (Dominica and Saint Vincent and the Grenadines), and Morocco, Nepal, and Yemen.
For the 2018 PISA, the SD for the UK data was 93 for math scores and 99 for science scores (Schleicher, 2019, pp. 7–8). In these calculations, I used the standard deviation of 99 to be more conservative. My choice of standard deviation will not affect any correlations, but it will change differences between these IQs and others and make outlier national mean IQs slightly less extreme.
This is not an artifact of the extrapolation based on nearby countries’ data that Gust et al. (2022) used. The correlation between scores for the 12 countries that had imputed data in both datasets was r = .608; for the 13 countries that had geographically imputed scores in the Lynn and Becker (2019b) dataset and scores based on educational achievement testing data in the Gust et al. (2022) dataset, the correlation was r = .511. The average difference between the two sets of scores is also similar.
Gust et al. (2022, p. A1) noted that Angrist et al.’s (2021) method overestimates academic achievement HLOs, compared to the Gust et al. (2022) method. The average scores in Table 2 are much more similar than would be expected because of the different means for the UK that were used to calculate z-scores and IQs. The HLO mean for the UK is 527.8 in the Angrist et al. (2021) data, compared to the Gust et al. (2022) mean of 503.2. The higher HLO mean for the UK provides a correction to the HLO scores, when converted to IQs, and makes the weighted mean IQs for both datasets in Table 2 much more similar.
The QNW + SAS IQs for these countries are 69.45 (Botswana), 60.98 (Ghana), and 69.80 (South Africa). However, note that these are not independent of the PIRLS and TIMSS data because Lynn and Becker (2019b) used the educational achievement data to calculate SAS IQs, which contributed data to the QNW + SAS IQs.
The largest discrepancies were for the Dominican Republic (+ 20.11 IQ points), Yemen (+ 19.34 IQ points), Tunisia (+ 12.39 IQ points), Argentina (+ 11.45 IQ points), Kuwait (+ 10.59 IQ points), and Honduras (− 10.47 IQ points). In this list, positive numbers indicate a higher QNW + SAS score in the Lynn and Becker (2019b) dataset, and negative numbers indicate a higher IQ derived from the Patel and Sandefur (2020) study.
The four countries with geographically imputed IQs in Lynn and Becker’s (2019b) dataset that have discrepancies of at least 10 IQ points are Paraguay (+ 17.26 IQ points), Senegal (− 15.76 IQ points), Chad (+ 13.92 IQ points), and Niger (+ 10.10 IQ points). In this list, positive numbers indicate a higher QNW + SAS + GEO score in the Lynn and Becker (2019b) dataset, and negative numbers indicate a higher IQ derived from the Patel and Sandefur (2020) study.
The largest discrepancies were for Cambodia (+ 26.4 IQ points), Venezuela (− 23.1 IQ points), Cuba (− 20.6 IQ points), Pakistan (+ 18.4 IQ points), Nicaragua (− 15.9 IQ points), Sri Lanka (+ 15.9 IQ points), Guatemala (− 15.4 IQ points), the Dominican Republic (+ 15.3 IQ points), the Philippines (+ 14.8 IQ points), Kyrgyzstan (+ 13.1 IQ points), Argentina (+ 12.4 IQ points), Haiti (+ 12.2 IQ points), Morocco (− 11.4 IQ points), Mongolia (+ 10.8 IQ points), and the United Arab Emirates (− 10.1 IQ points). In this list, positive numbers indicate a higher QNW + SAS score in the Lynn and Becker (2019b) dataset, and negative numbers indicate a higher IQ derived from the Gust et al. (2022) study. The inclusion of Cuba on this list is due to the use of SERCE 2006 data in the Gust et al. (2022) paper. As I stated earlier in this article, the Cuban data for this test are an outlier and likely fraudulent. This shows that when national IQ discrepancies arise in different datasets, it does not always indicate that Lynn and Becker’s (2019b) data are wrong.
In descending order of the magnitude of the discrepancy, these countries were Honduras (22.62 IQ points lower), Botswana (18.52 IQ points lower), South Africa (13.80 IQ points lower), and Egypt (11.65 IQ points lower).
Testing students one grade higher typical is standard practice for South Africa when administering PIRLS and TIMSS tests.
The Burundi data are clearly an outlier. Patel and Sandefur (2020) reported that 43% of examinees in Burundi met or exceeded the TIMSS low international benchmark in reading, which is typical of PASEC countries (PASEC, 2015, p. 50). The discrepancy between Burundi’s math and reading performance originates in the PASEC data and is not an error in Patel and Sandefur’s conversion of PASEC scores to TIMSS scores.
Pupil age is another factor to consider in making these comparisons. Repeating a grade is much more common in sub-Saharan Africa than it is in Western countries. However, these older pupils score worse on the PASEC than their classmates who have never repeated a grade (PASEC, 2015, pp. 78–81). Unlike testing students in a higher grade, the inclusion of these older students does not increase the countries’ percentages of students who meet the TIMSS low international benchmark.
I only compared mathematics scores here because language differences (e.g., one language being easier to learn to read than another) make comparing reading scores and competency less straightforward than comparing proficiency in mathematics (Gust et al., 2022). Additionally, many children in African learn to read in a non-native language (i.e., Swahili, or a colonial language instead of their local African language), which would be a penalty when comparing reading scores to children in economically developed nations where most children are tested in their native language.
There are three versions of the Raven’s: the Colored Progressive Matrices, Progressive Matrices, and Advanced Matrices (listed in ascending order of difficulty).
Countries with a low NWQ + SAS IQ (≤ 75) based solely on matrix test data are Benin, the Republic of the Congo, Djibouti, Dominica, Eritrea, Ethiopia, The Gambia, Guatemala, Malawi, Mali, Morocco, Namibia, Nepal, Saint Vincent and the Grenadines, Sierra Leone, Somalia, South Sudan, Syria, Tanzania, Yemen, and Zimbabwe.
This is why I have preferred to use the QNW + SAS IQs whenever possible in this article. QNW + SAS IQs are based on the most data and do not include countries with geographically imputed mean IQs.
That is, unless one does not believe that educational performance, life outcomes, health and disease, economic prosperity, and strong civic institutions are important.
References
American Educational Research Association, American Psychological Association, and National Council on Measurement in Education. (2014). Standards for educational and psychological testing. American Educational Research Association
American Psychiatric Association. (2013). Diagnostic and Statistical Manual for Mental Disorders (5th ed.). American Psychiatric Publishing.
American Psychological Association Publications and Communications Board Working Group on Journal Article Reporting Standards. (2008). Reporting standards for research in psychology: Why do we need them? What might they be? American Psychologist, 63(9), 839–851. https://doi.org/10.1037/0003-066x.63.9.839
Angrist, N., Djankov, S., Goldberg, P. K., & Patrinos, H. A. (2021). Measuring human capital using global learning data. Nature, 592(7854), 403–408. https://doi.org/10.1038/s41586-021-03323-7
Attallah, S. E. F., Ahmed, K. Y., & Meisenberg, G. (2014). Factor structure of Wechsler Intelligence Scale for Children-Third Edition (WISC-III) among gifted students in Sudan. Mankind Quarterly, 55, 147–170. https://doi.org/10.46469/mq.2014.55.1.11
Bakhiet, S. F., Albursan, I. S., Al Qudah, M. F., Abduljabbar, A. S., Aljomaa, S. S., Toto, H. S. A., & Lynn, R. (2017). Sex differences on the WISC-III among children in Sudan and the United States. Journal of Biosocial Science, 49(6), 792–797. https://doi.org/10.1017/s0021932016000432
Barnes, J. C., Beaver, K. M., & Boutwell, B. B. (2013). Average county-level IQ predicts county-level disadvantage and several county-level mortality risk rates. Intelligence, 41(1), 59–66. https://doi.org/10.1016/j.intell.2012.10.007
Bauer, P. J. (2020). A call for greater sensitivity in the wake of a publication controversy. Psychological Science, 31(7), 767–769. https://doi.org/10.1177/0956797620941482
Beaver, K. M., Schwartz, J. A., Connolly, E. J., Said Al-Ghamdi, M., Kobeisy, A. N., Barnes, J. C., & Boutwell, B. B. (2016). Intelligence and early life mortality: Findings from a longitudinal sample of youth. Death Studies, 40(5), 298–304. https://doi.org/10.1080/07481187.2015.1137994
Becker, D., Meisenberg, G., Dutton, E., Bakhiet, S. F., Humad, O. A. M., Abdoulaye, H. A., & Ahmed, S. A. E. S. (2022). Factor structure in Raven’s Progressive Matrices Plus in sub-Saharan Africa – Benin and Djibouti. Journal of Psychology in Africa, 32(2), 103–114. https://doi.org/10.1080/14330237.2022.2028080
Becker, D., & Rindermann, H. (2016). The relationship between cross-national genetic distances and IQ-differences. Personality and Individual Differences, 98, 300–310. https://doi.org/10.1016/j.paid.2016.03.050
Belasen, A., & Hafer, R. W. (2013). IQ and alcohol consumption: International data. Intelligence, 41, 615–621. https://doi.org/10.1016/j.intell.2013.07.019
Bernstein, B. O., Lubinski, D., & Benbow, C. P. (2019). Psychological constellations assessed at age 13 predict distinct forms of eminence 35 years later. Psychological Science, 30, 444–454. https://doi.org/10.1177/0956797618822524
Bhatia, C. M. (1955). Performance tests of intelligence under Indian conditions. Oxford University Press.
Bold, T., Filmer, D., Martin, G., Molina, E., Stacy, B., Rockmore, C., Svensson, J., & Wane, W. (2017). Enrollment without learning: Teacher effort, knowledge, and skill in primary schools in Africa. Journal of Economic Perspectives, 31(4), 185–204. https://doi.org/10.1257/jep.31.4.185
Chinapah, V., H’ddigui, E. M., Kanjee, A., Falayajo, W., Fomba, C. O., Hamissou, O. Rafalimanana, A., & Byomugisha, A. (2000). With Africa for Africa: Towards quality education for all. 1999 MLA Project. Human Sciences Research Council. http://files.eric.ed.gov/fulltext/ED444931.pdf
Clark, G. (2007). A farewell to alms: A brief economic history of the world. Princeton University Press.
Deary, I. J., Pattie, A., & Starr, J. M. (2013). The stability of intelligence from age 11 to age 90 years: The Lothian Birth Cohort of 1921. Psychological Science, 24(12), 2361–2368. https://doi.org/10.1177/0956797613486487
Deary, I. J., Strand, S., Smith, P., & Fernandes, C. (2007). Intelligence and educational achievement. Intelligence, 35, 13–21. https://doi.org/10.1016/j.intell.2006.02.001
Deary, I. J., Whiteman, M. C., Starr, J. M., Whalley, L. J., & Fox, H. C. (2004). The impact of childhood intelligence on later life: Following up the Scottish Mental Surveys of 1932 and 1947. Journal of Personality and Social Psychology, 86, 130–147. https://doi.org/10.1037/0022-3514.86.1.130
Dickins, T. E., Sear, R., & Wells, A. J. (2007). Mind the gap(s) in theory, method and data: Re-examining Kanazawa (2006). British Journal of Health Psychology, 12(2), 167–178. https://doi.org/10.1348/135910707X174339
Dutton, E., Becker, D., Osman, H. A., Bakhiet, S. F., Essa, Y. A. S., Ali, H. A. A., Alqafari, S. M., Hamdi, A. H. M. N., & Alfaleh, A. S. H. (2018). The Raven’s test performance of South Sudanese samples: A validation of criticisms of the utility of Raven’s among Sub-Saharan Africans. Personality and Individual Differences, 128, 122–126. https://doi.org/10.1016/j.paid.2018.02.018
Ebbesen, C. L. (2020). Flawed estimates of cognitive ability in Clark et al. Psychological Science, 2020. PsyArXiv. https://psyarxiv.com/tzr8c
Flores-Mendoza, C., Ardila, R., Rosas, R., Lucio, M. E., Gallegos, M., & Retegui Colareta, N. (2018). Intelligence measurement and school performance in Latin America: A report of the Study of Latin American Intelligence Project. Springer.
Floyd, R. G., Reynolds, M. R., Farmer, R. L., & Kranzler, J. H. (2013). Are the general factors from different child and adolescent intelligence tests the same? Results from a five-sample, six-test analysis. School Psychology Review, 42, 383–401. https://doi.org/10.1080/02796015.2013.12087461
Gale, C. R., Batty, G. D., Tynelius, P., Deary, I. J., & Rasmussen, F. (2010). Intelligence in early adulthood and subsequent hospitalization for mental disorders. Epidemiology, 21(1), 70–77. http://www.jstor.org/stable/25662808
Gichuhi, J. (1999). An examination of the Wechsler Intelligence Scale for Children-III for predicting performance on a national examination of grade eight children in public primary schools (Unpublished doctoral dissertation). Biola University, La Mirada, CA.
Glass, G. V. (1976). Primary, secondary, and meta-analysis of research. Educational Researcher, 5(10), 3–8. https://doi.org/10.3102/0013189X005010003
Gust, S., Hanushek, E. A., & Woessmann, L. (2022). Global universal basic skills: Current deficits and implications for world development (Working paper No. 30566). National Bureau of Economic Research. https://www.nber.org/papers/w30566
Haier, R. J. (2017). The neuroscience of intelligence. Cambridge University Press. https://doi.org/10.1017/9781316105771
Haile, D., Gashaw, K., Nigatu, D., & Demelash, H. (2016). Cognitive function and associated factors among school age children in Goba Town, South-East Ethiopia. Cognitive Development, 40, 144–151. https://doi.org/10.1016/j.cogdev.2016.09.002
Hanushek, E. A. (2016). Will more higher education improve economic growth? Oxford Review of Economic Policy, 32, 538–552. https://doi.org/10.1093/oxrep/grw025
Hanushek, E. A., & Kimko, D. D. (2000). Schooling, labor-force quality, and the growth of nations. American Economic Review, 90(5), 1184–1208. https://doi.org/10.1257/aer.90.5.1184
Hart, C. L., Taylor, M. D., Davey Smith, G., Whalley, L. J., Starr, J. M., Hole, D. J., Wilson, V., & Deary, I. J. (2003). Childhood IQ, social class, deprivation, and their relationships with mortality and morbidity risk in later life: Prospective observational study linking the Scottish Mental Survey 1932 and the Midspan Studies. Psychosomatic Medicine, 65(5), 877–883. https://doi.org/10.1097/01.psy.0000088584.82822.86
Hart, C. L., Taylor, M. D., Smith, G. D., Whalley, L. J., Starr, J. M., Hole, D. J., Wilson, V., & Deary, I. J. (2004). Childhood IQ and cardiovascular disease in adulthood: Prospective observational study linking the Scottish Mental Survey 1932 and the Midspan studies. Social Science & Medicine, 59(10), 2131–2138. https://doi.org/10.1016/j.socscimed.2004.03.016
Helms, J. E. (1992). Why is there no study of cultural equivalence in standardized cognitive ability testing? American Psychologist, 47, 1083–1101. https://doi.org/10.1037/0003-066X.47.9.1083
Holahan, C. K., & Sears, R. R. (1995). The gifted group in later maturity. University of Stanford Press.
Holding, P., Anum, A., van de Vijver, F. J., Vokhiwa, M., Bugase, N., Hossen, T., Makasi, C., Baiden, F., Kimbute, O., & Bangre, O. (2018). Can we measure cognitive constructs consistently within and across cultures? Evidence from a test battery in Bangladesh, Ghana, and Tanzania. Applied Neuropsychology: Child, 7, 1–13. https://doi.org/10.1080/21622965.2016.1206823
Hungi, N., Makuwa, D., Ross, K., Saito, M., Dolata, S., van Cappelle, F., Paviot, L., & Vellien, J. (2010). SACMEQ III project results: Pupil achievement levels in reading and mathematics. SACMEQ. http://www.sacmeq.org/sites/default/files/sacmeq/reports/sacmeq-iii/working-documents/wd01_sacmeq_iii_results_pupil_achievement.pdf
Hunt, E., & Carlson, J. (2007). Considerations relating to the study of group differences in intelligence. Perspectives on Psychological Science, 2(2), 194–213. https://doi.org/10.1111/j.1745-6916.2007.00037.x
Hunt, E., & Sternberg, R. J. (2006). Sorry, wrong numbers: An analysis of a study of a correlation between skin color and IQ. Intelligence, 34, 131–137. https://doi.org/10.1016/j.intell.2005.04.004
Irvine, S. H. (1964). A psychological study of selection problems at the end of primary schooling in Southern Rhodesia (Unpublished doctoral dissertation). University of London.
Jensen, A. R. (1969). Reducing the heredity-environment uncertainty: A reply. Harvard Educational Review, 39, 449–483. https://doi.org/10.17763/haer.39.3.4l58240700761019
Jensen, A. R. (1980). Bias in mental testing. The Free Press.
Jensen, A. R. (1998). The g factor: The science of mental ability. Praeger.
Johnson, W., Bouchard Jr, T. J., Krueger, R. F., McGue, M., & Gottesman, I. I. (2004). Just one g: Consistent results from three test batteries. Intelligence, 32, 95–107. https://doi.org/10.1016/S0160-2896(03)00062-X
Johnson, W., te Nijenhuis, J., & Bouchard, T. J. Jr. (2008). Still just 1 g: Consistent results from five test batteries. Intelligence, 36, 81–95. https://doi.org/10.1016/j.intell.2007.06.001
Jones, G. (2016). Hive mind: How your nation’s IQ matters so much more than your own. Stanford University Press.
Kamin, L. J. (2006). African IQ and mental retardation. South African Journal of Psychology, 36(1), 1–9. https://doi.org/10.1177/008124630603600101
Kanazawa, S. (2006). Mind the gap in intelligence: Re-examining the relationship between inequality and health. British Journal of Health Psychology, 11(4), 623–642. https://doi.org/10.1348/135910705X69842
Keith, T. Z., Kranzler, J. H., & Flanagan, D. P. (2001). What does the Cognitive Assessment System (CAS) measure? Joint confirmatory factor analysis of the CAS and the Woodcock-Johnson Tests of Cognitive Ability (3rd edn). School Psychology Review, 30, 89–119. https://doi.org/10.1080/02796015.2001.12086102
Kell, H. J., Lubinski, D., & Benbow, C. P. (2013). Who rises to the top? Early Indicators. Psychological Science, 24(5), 648–659. https://doi.org/10.1177/0956797612457784
Kim, J. Y. (2018). The human capital gap: Getting governments to invest in people. Foreign Affairs, 97(4), 92–101. http://www.jstor.org/stable/44822216
Lean, G., & Clements, M. A. (1981). Spatial ability, visual imagery, and mathematical performance. Educational Studies in Mathematics, 12(3), 267–299. https://doi.org/10.1007/bf00311060
Lim, S. S., Updike, R. L., Kaldjian, A. S., Barber, R. M., Cowling, K., York, H., Friedman, J., Xu, R., Whisnant, J. L., Taylor, H. J., Leever, A. T., Roman, Y., Bryant, M. F., Dieleman, J., Gakidou, E., & Murray, C. J. L. (2018). Measuring human capital: a systematic analysis of 195 countries and territories, 1990–2016. The Lancet, 392(10154), 1217–1234. https://doi.org/10.1016/S0140-6736(18)31941-X
Lubinski, D. (2009). Exceptional cognitive ability: The phenotype. Behavior Genetics, 39(4), 350–358. https://doi.org/10.1007/s10519-009-9273-0
Lubinski, D., Benbow, C. P., & Kell, H. J. (2014). Life paths and accomplishments of mathematically precocious males and females four decades later. Psychological Science, 25(12), 2217–2232. https://doi.org/10.1177/0956797614551371
Lynn, R. (2015). Race differences in intelligence: An evolutionary analysis (2nd ed.). Washington Summit Publishers.
Lynn, R., & Becker, D. (2019a). The intelligence of nations. Ulster Institute for Social Research.
Lynn, R., & Becker, D. (2019b). The NIQ-Dataset version 1.3.3. https://viewoniq.org/
Lynn, R., & Meisenberg, G. (2010). National IQs calculated and validated for 108 nations. Intelligence, 38, 353–360. https://doi.org/10.1016/j.intell.2010.04.007
Lynn, R., & Vanhanen, T. (2002). IQ and the wealth of nations. Praeger.
Lynn, R., & Vanhanen, T. (2006). IQ and global inequality. Washington Summit Publishers.
Lynn, R., & Vanhanen, T. (2012). Intelligence: A unifying construct for the social sciences. Ulster Institute for Social Research
MacArthur, R. S., Irvine, S. H., & Brimble, A. R. (1964). The Northern Rhodesia mental ability survey 1963. Rhodes-Livingstone Institute.
Makel, M. C., Kell, H. J., Lubinski, D., Putallaz, M., & Benbow, C. P. (2016). When lightning strikes twice: Profoundly gifted, profoundly accomplished. Psychological Science, 27(7), 1004–1018. https://doi.org/10.1177/0956797616644735
McFie, J. (1954). African performance on an intelligence test. Uganda Journal, 18, 34–43.
Miezah, D. (2015). Validation of the Wechsler Adult Intelligence Scale-Fourth Edition (WAIS-IV) in the Ghanian population. University of Ghana.
Murray, C. (2002). IQ and income inequality in a sample of sibling pairs from advantaged family backgrounds. American Economic Review, 92(2), 339–343. https://doi.org/10.1257/000282802320191570
Nyborg, H. (2012). The decay of Western civilization: Double relaxed Darwinian selection. Personality and Individual Differences, 53(2), 118–125. https://doi.org/10.1016/j.paid.2011.02.031
Oesterdiekhoff, G. W. (2012). Was pre-modern man a child? The quintessence of the psychometric and developmental approaches. Intelligence, 40(5), 470–478. https://doi.org/10.1016/j.intell.2012.05.005
Oficina Regional de Educación para América Latina y el Caribe/UNESCO. (2001). Primer studio internactional comparative sobre lenguaje, matématica y factores asociados, para alumnus del tercer y cuarto grado de la educación básica: Informe técnico. UNESCO. https://unesdoc.unesco.org/ark:/48223/pf0000149268
Organization for Economic Cooperation and Development. (2016). Skills matter: Further results from the Survey of Adult Skills. OECD Publishing. https://doi.org/10.1787/9789264258051-en
Organization for Economic Cooperation and Development. (2019). PISA 2018 results (Volume I): What students know and can do. OECD Publishing. https://doi.org/10.1787/5f07c754-en
Panza Lombardo, M. A. (2016). Transferencia y permanencia del entrenamiento dual n-back en la inteligencia fluida y en la memoria de trabajo verbal [Transfer and permanence of dual n-back training on fluid intelligence and verbal working memory]. Psicodebate, 16, 49–82.
Patel, D., & Sandefur, J. (2020). A Rosetta Stone for human capital. Center for Global Development. https://www.cgdev.org/publication/rosetta-stone-human-capital
Piffer, D. (2013). Correlation of the COMT Val158Met polymorphism with latitude and a hunter-gather lifestyle suggests culture–gene coevolution and selective pressure on cognition genes due to climate. Anthropological Science, 121, 161–171. https://doi.org/10.1537/ase.130731
Pokropek, A., Marks, G. N., & Borgonovi, F. (2022). How much do students’ scores in PISA reflect general intelligence and how much do they reflect specific abilities? Journal of Educational Psychology, 114(5), 1121–1135. https://doi.org/10.1037/edu0000687
Pritchett, L., & Viarengo, M. (2021). Learning outcomes in developing countries: Four hard lessons from PISA-D. RISE Working Paper No. 21/069. https://doi.org/10.35489/BSG-RISE-WP_2021/069
Programme d’Analyse des Systèmes Éducatifs de la Confemen. (2015). PASEC2014 education system performance in francophone sub-Saharan Africa: Competencies and learning factors in primary education. Programme d’Analyse des Systèmes Éducatifs de la Confemen. https://www-pasec-confemen-org.translate.goog/wp-content/uploads/2015/12/Rapport_Pasec2014_GB_webv2.pdf
Raven, J., Raven, J. C., & Court, J. H. (1998). Manual for Raven’s Progressive Matrices and Vocabulary Scales, Section 1: General overview. Pearson.
Richardson, K. (2002). What IQ tests test. Theory & Psychology, 12, 283–314. https://doi.org/10.1177/0959354302012003012
Rindermann, H. (2018a). Cognitive capitalism: Human capital and the wellbeing of nations. Cambridge University Press. https://doi.org/10.1017/9781107279339
Rindermann, H. (2018b). Appendix. https://www.tu-chemnitz.de/hsw/psychologie/professuren/entwpsy/team/rindermann/pdfs/RindermannCogCapAppendix.pdf
Rindermann, H., Becker, D., & Coyle, T. R. (2017). Survey of expert opinion on intelligence: The FLynn effect and the future of intelligence. Personality and Individual Differences, 106, 242–247. https://doi.org/10.1016/j.paid.2016.10.061
Rindermann, H., & Carl, N. (2020). The Good Country Index, cognitive ability and culture. Comparative Sociology, 19(1), 39–68. https://doi.org/10.1163/15691330-12341521
Rindermann, H., Falkenhayn, L., & Baumeister, A. E. E. (2014). Cognitive ability and epistemic rationality: A study in Nigeria and Germany. Intelligence, 47, 23–33. https://doi.org/10.1016/j.intell.2014.08.006
Ritchie, S. J., & Tucker-Drob, E. M. (2018). How much does education improve intelligence? A Meta-Analysis. Psychological Science, 29(8), 1358–1369. https://doi.org/10.1177/0956797618774253
Roth, B., Becker, N., Romeyke, S., Schäfer, S., Domnick, F., & Spinath, F. M. (2015). Intelligence and school grades: A meta-analysis. Intelligence, 53, 118–137. https://doi.org/10.1016/j.intell.2015.09.002
Ruffieux, N., Njamnshi, A. K., Mayer, E., Sztajzel, R., Eta, S. C., Doh, R. F., Kengne, A.-M., Ngamaleu, R. N., Chanal, J., & Verdon, V. (2009). Neuropsychology in Cameroon: First normative data for cognitive tests among school-aged children. Child Neuropsychology, 16(1), 1–19. https://doi.org/10.1080/09297040902802932
Sandefur, J. (2018). Internationally comparable mathematics scores for fourteen African countries. Economics of Education Review, 62, 267–286. https://doi.org/10.1016/j.eonedurev.2017.12.003
Schleicher, A. (2019). PISA 2018: Insights and interpretations. Organisation for Economic Co-operation and Development. https://www.oecd.org/pisa/PISA%202018%20Insights%20and%20Interpretations%20FINAL%20PDF.pdf
Sear, R. (2022). ‘National IQ’ datasets do not provide accurate, unbiased or comparable measures of cognitive ability worldwide. PsyArXiv. https://doi.org/10.31234/osf.io/26vfb
Sen, A., Jensen, A. R., Sen, A. K., & Arora, I. (1983). Correlation between reaction time and intelligence in psychometrically similar groups in America and India. Applied Research in Mental Retardation, 4(2), 139–152. https://doi.org/10.1016/0270-3092(83)90006-1
Serpell, R., & Jere-Folotiya, J. (2008). Developmental assessment, cultural context, gender, and schooling in Zambia. International Journal of Psychology, 43(2), 88–96. https://doi.org/10.1080/00207590701859184
Sörberg, A., Allebeck, P., & Hemmingsson, T. (2014). IQ and somatic health in late adolescence. Intelligence, 44, 155–162. https://doi.org/10.1016/j.intell.2014.04.002
Songy, D. G. (2007). Predicting success in academic achievement of major seminarians in Papua New Guinea: A comparison of cognitive test results and grade point averages. Contemporary PNG Studies, 7, 59–71.
Spearman, C. (1927). The abilities of man: Their nature and measurement. The Macmillan Company.
Starr, J. M., Deary, I. J., Lemmon, H., & Whalley, L. J. (2000). Mental ability age 11 years and health status age 77 years. Age and Ageing, 29, 523–528. https://doi.org/10.1093/ageing/29.6.523
Stauffer, J. M., Ree, M. J., & Carretta, T. R. (1996). Cognitive-components tests are not much more than g: An extension of Kyllonen’s analyses. The Journal of General Psychology, 123, 193–205. https://doi.org/10.1080/00221309.1996.9921272
Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103(2684), 677–680. https://doi.org/10.1126/science.103.2684.677
Templer, D. I., & Arikawa, H. (2006). Temperature, skin color, per capita income, and IQ: An international perspective. Intelligence, 34, 121–139. https://doi.org/10.1016/j.intell.2005.04.002
Terman, L. M., & Oden, M. H. (1947). Genetic studies of genius: (Vol. IV). Twenty-five years’ follow-up of a superior group. Stanford University Press.
Terman, L. M., & Oden, M. H. (1959). Genetic studies of genius: (Vol. V). Thirty-five years’ follow-up of the superior child. Stanford University Press.
Terracciano, A., Abdel-Khalek, A. M., Ádám, N., Adamovová, L., Ahn, C. K., Ahn, H. N., Alansari, B. M., Alcalay, L., Allik, J., Angleitner, A., Avia, M. D., Ayearst, L. E., Barbaranelli, C., Beer, A., Borg-Cunen, M. A., Bratko, D., Brunner-Sciarra, M., Budzinski, L., Camart, N., & McCrae, R. R. (2005). National character does not reflect mean personality trait levels in 49 cultures. Science, 310(5745), 96–100. https://doi.org/10.1126/science.1117199
Thompson, B., & Vacha-Haase, T. (2000). Psychometrics is datametrics: The test is not reliable. Educational and Psychological Measurement, 60(2), 174–195. https://doi.org/10.1177/0013164400602002
United Nations Office on Drugs and Crime. (2022). Intentional homicide. https://dataunodc.un.org/sites/dataunodc.un.org/files/data_cts_intentional_homicide.xlsx
Vacha-Haase, T., Kogan, L. R., & Thompson, B. (2000). Sample compositions and variabilities in published studies versus those in test manuals: Validity of score reliability inductions. Educational and Psychological Measurement, 60(4), 509–522. https://doi.org/10.1177/00131640021970682
van den Briel, T., West, C. E., Bleichrodt, N., van de Vijver, F. J. R., Ategbo, E. A., & Hautvast, J. G. A. J. (2000). Improved iodine status is associated with improved mental performance of schoolchildren in Benin. The American Journal of Clinical Nutrition, 72(5), 1179–1185. https://doi.org/10.1093/ajcn/72.5.1179
Wai, J. (2014). Experts are born, then made: Combining prospective and retrospective longitudinal data shows that cognitive ability matters. Intelligence, 45, 74–80. https://doi.org/10.1016/j.intell.2013.08.009
Warne, R. T. (2020). In the know: Debunking 35 myths about human intelligence. Cambridge University Press. https://doi.org/10.1017/9781108593298
Warne, R. T. (2021). Statistics for the social sciences: A general linear model approach (2nd ed.). Cambridge University Press.
Warne, R. T. (2022). Tests of measurement invariance of three Wechsler intelligence tests in economically developing nations in south Asia and sub-Saharan Africa. Manuscript submitted for publication.
Warne, R. T., & Burningham, C. (2019). Spearman’s g found in 31 non-Western nations: Strong evidence that g is a universal phenomenon. Psychological Bulletin, 145(3), 237–272. https://doi.org/10.1037/bul0000184
Warne, R. T., & Burton, J. Z. (2020). Beliefs about human intelligence in a sample of teachers and nonteachers. Journal for the Education of the Gifted, 43(2), 143–166. https://doi.org/10.1177/0162353220912010
Warne, R. T., Yoon, M., & Price, C. J. (2014). Exploring the various interpretations of “test bias.” Cultural Diversity and Ethnic Minority Psychology, 20, 570–582. https://doi.org/10.1037/a0036503
Wolf, T. H. (1973). Alfred Binet. The University of Chicago Press.
Wicherts, J. M., Dolan, C. V., Carlson, J. S., & van der Maas, H. L. J. (2010a). Raven’s test performance of sub-Saharan Africans: Average performance, psychometric properties, and the Flynn Effect. Learning and Individual Differences, 20(3), 135–151. https://doi.org/10.1016/j.lindif.2009.12.001
Wicherts, J. M., Dolan, C. V., Carlson, J. S., & van der Maas, H. L. J. (2010b). Another failure to replicate Lynn’s estimate of the average IQ of sub-Saharan Africans. Learning and Individual Differences, 20, 155–157. https://doi.org/10.1016/j.lindif.2010.03.010
Wicherts, J. M., Dolan, C. V., & van der Maas, H. L. J. (2010c). A systematic literature review of the average IQ of sub-Saharan Africans. Intelligence, 38(1), 1–20. https://doi.org/10.1016/j.intell.2009.05.002
Wicherts, J. M., Dolan, C. V., & van der Maas, H. L. J. (2010d). The dangers of unsystematic selection methods and the representativeness of 46 samples of African test-takers. Intelligence, 38(1), 30–37. https://doi.org/10.1016/j.intell.2009.11.003
Williams, R. T., Polanin, J. R., & Pigott, T. D. (2017). Meta-analysis and reproducibility. In Toward a more perfect psychology: Improving trust, accuracy, and transparency in research. (pp. 255–270). American Psychological Association. https://doi.org/10.1037/0000033-016
Woodberry, K. A., Giuliano, A. J., & Seidman, L. J. (2008). Premorbid IQ in schizophrenia: A meta-analytic review. The American Journal of Psychiatry, 165(5), 579–587. https://doi.org/10.1176/appi.ajp.2008.07081242
Zaboski, B. A., II., Kranzler, J. H., & Gage, N. A. (2018). Meta-analysis of the relationship between academic achievement and broad abilities of the Cattell-Horn-Carroll theory. Journal of School Psychology, 71, 42–56. https://doi.org/10.1016/j.jsp.2018.10.001
Zagorsky, J. L. (2007). Do you have to be smart to be rich? The impact of IQ on wealth, income and financial distress. Intelligence, 35, 489–501. https://doi.org/10.1016/j.intell.2007.02.003
Zhao, Q., Wang, X., & Rozelle, S. (2019). Better cognition, better school performance? Evidence from primary schools in China. China Economic Review, 55, 199–217. https://doi.org/10.1016/j.chieco.2019.04.005
Zisman, C., & Ganzach, Y. (2022). The claim that personality is more important than intelligence in predicting important life outcomes has been greatly exaggerated. Intelligence, 92, Article 101631. https://doi.org/10.1016/j.intell.2022.101631
Author information
Authors and Affiliations
Contributions
The author is solely responsible for the theoretical analysis, identification of secondary data, analysis of data, and writing of the manuscript.
Corresponding author
Ethics declarations
Ethics Approval
No ethics approval was necessary because the article does not describe human subjects' research. All data analyzed were publicly available archival data.
Consent to Participate
Informed consent to participate was not necessary because the article does not describe human subjects' research. All data analyzed were publicly available archival data.
Consent to Publish
All data in this article were aggregate data publicly available from online datasets. Consent to publish is not necessary.
Conflict of Interest
The author declares no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Warne, R.T. National Mean IQ Estimates: Validity, Data Quality, and Recommendations. Evolutionary Psychological Science 9, 197–223 (2023). https://doi.org/10.1007/s40806-022-00351-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40806-022-00351-y