View on IQ

A Corrigendum to V1.3.2 and a Comment to V1.3.3

Preface

The publication of the book written by Richard Lynn and me (Lynn & Becker, 2019) and of V1.3.2 of the NIQ-dataset got a lot of attention and feedback. I’m very grateful for any positive or negative comments as they help me to further improve my work. Although I wanted to release the next version later and with more progress, some of the feedback forced me to rush forward to V1.3.3. I would like to use the release of this latest version to respond to some of the most common critiques and questions. The most important of these should be discussed here. My special thanks go to Ronald Henss and Jared Huggins, who have pointed out a series of mistakes and errors, which could then be corrected.

The very low NIQ scores for some countries were the most criticized aspect. Overall, there were nine countries in V1.3.2 in which measured NIQ scores fell below 60. According to the International Statistical Classification of Diseases and Related Health Problems, such scores would be within a range (70-35) classified as indicators for a mild or moderate mental retardation (ICD-10 F70-F79) (DIMDI, 2012). However, Jensen (1989, p.367-369) showed that Black children in the USA with an IQ<70 showed less abnormal behaviour than white children with comparable scores, which raise doubts about the universality of this rule for populations with different ethnic backgrounds. Critics also pointed out that much higher scores were reported for the same countries by Lynn and Vanhanen (2012) and even these have previously been criticized as “too low” by Wicherts, Dolan and van der Maas (2010).

Four questions arise at this point: (1) Why are there such significant differences between IQs estimated by Lynn and Vanhanen and in the NIQ-dataset? (2) Would it be possible that a country has a population with a mean IQ in a range <70 or even <60? (3) How meaningful are the results, provided by the available tests, in such a low range? (4) Would have IQ differences in that range implications in the real-world? In the further course I would like to try to give some answers.

Causes of IQ-differences between datasets

Table 1: List of countries with NIQ<60 in V1.3.2

Notes: “PMM” = from psychometric measurements; “SAS” = from school assessment studies; “GEO” = means of neighboring countries

The first question can actually be answered very quickly: One of the major differences between the methods of Lynn and Vanhanen and those used in the NIQ dataset is the use of conversion formulas to convert raw to IQ scores in the case of the Raven’s Matrices. These formulas also take into account variations in the IQ scores <60, whereas Lynn and Vanhanen used the norm tables in the Raven’s Manuals for conversion, which do not go in areas below the 0.10 or 1.00 BP (British Percentile) respectively below IQs of 55 or 60. To clarify this issue, Fig.1 shows an excerpt of the SPM norms from the British standardization in 1979, taken from Raven (2000, Table B1). If two samples, both with a mean age of 7 years, would obtain mean raw scores on the SPM of 9 (blue) and 6 (red), both samples would be at the 1.00 BP and equivalent to an IQ of around 65, despite their raw score difference of 3 points.

Fig.1 British norms (1979) for SPM (excerpt)

Source: from Raven (2000, Table B1)

In contrast, the formulas used in the NIQ-dataset describe the relationship between raw and IQ scores as shown by the trendline and formula in Fig.2. This made it possible to extrapolate above or below the ranges covered by the norm tables. By applying the formula displayed in Fig.2, a raw score of 9 would now be equivalent to an IQ score of 69.37 whereas a raw score of 6 would now be equivalent to an IQ score of 53.93.

Fig.2 Relationship between SPM raw scores and IQ scores for 7 years olds in the British norm sample from 1979

Notes: Blue dots represent numbers from the norm table (Raven, 2000, Table B1); dotted line represents trend according to formula displayed; blue and red arrows represent relationships at 6 and 9 raw scores

The use of the above-mentioned method, in particular the exceeding of the norm range of the tests, can be criticized. In the following, I would therefore first like to disclose its effects by detailed description of the calculations for some real low-IQ countries.

Ghana is an interesting case, since this country participated in TIMSS 2003, 2007 and 2011 (Martin, Mullis & Foy, 2008; Martin et al., 2012, 2014; Mullis, Martin & Foy, 2008; Mullis et al., 2004, 2012) and therefore one of the few Sub-Sahara African countries in international school assessment studies at all. It obtained mean scores of 250.45 in 2003, 284.99 in 2007 and 300.96 in 2011. By transforming these scores, by mean and standard deviation of the United Kingdome, IQ equivalents of 48.62, 55.33 and 60.11 are the results. The average score of 54.69 is much closer to the country’s psychometric NIQ of 61.63 in V1.3.2 than to the psychometric IQ of 70 from Lynn and Vanhanen. This shows that an NIQ<60 would be quite conceivable.

The three sources for Gambia, with two samples per source available, all used Raven’s Matrices, one times the SPM and two times the CPM (Alderman et al., 2014; Jukes & Grigorenko, 2010; Jukes et al. 2006). On the SPM, raw scores were 14.50 and 14.65, on the CPM, raw scores were between 8.34 and 9.29. However, all six samples aged 17 to 20 years and were therefor above the age range of 6 to 16 years and 4 to 12 years for which the tests used are intended (Raven, 1981, 2008a). After conversions to the scale of the APM according to Raven, Raven and Court (1998, Table APM34) and Raven, Raven & Court (2006, Tab.26), all these scores would be below an APM raw score of 1.00 and therefore below the 0.38s BP, were the norm stables stopped. As a result, Lynn gave the samples an IQs of 60 to 64 and the V1.3.2 between 45.96 and 55.57.

Similar applies to Guatemala. Here, we have seven samples from four sources (Calderon & Hoddinott, 2010; Stein et al., 2005; Choudhury & Gorman, 1999; Martorell et al., 2005), six measured wit SPM and one with the CPM. Once again, in case of the six samples measured with SPM, sample ages were above the age range of 6 to 16 years, with 17 to 38 years, and had to be transformed to the APM scale, which result in raw scores below 1 and IQs between 33.29 and 53.68. In contrast, the sample measured with the CPM had a mean age of around 8 years which makes a conversion into another Raven’s scale redundant. The CPM raw score of 18.43 is equivalent to an IQ score of 69.25 and very close to the estimate of 70 by Lynn and Vanhanen. The discrepancy between the IQ scores, which resulted from transformation into APM scale, and the one from the CPM administration could indicate that the very low scores result from the APM transformation. However, the CPM sample is also different in age and it can not be ruled out that such age differences in intelligence exist in Guatemala actually.

The NIQ dataset has only one sample for Mali (Dramé & Ferguson, 2017), which has not been used by Lynn and Vanhanen. However, the NIQ score is 59.76, close to 60 and therefore less problematic.

Within the three samples available from Nicaragua, two were tested by CPM and one by the WISC-IV. On the WISC-IV, a FS-IQ of 67 was reported directly and corrected for FLynn-Effect to 63.48 (Rodríguez, 2012). IQs from the CPM are 49.75 and 54.86 and therefore much lower, but had to go through a whole series of conversions. Sandiford et al. (1997) reported a CPM raw score of 8.24 for a sample of females which completed primary school and had an age of around 35 years. However, the source reported that only the Set AB was applied. According to Raven, Raven and Court (2006, Tab.8), a raw score of 8.24 on Set AB would be equivalent to a raw score of 23.00 to 24.00 on the full CPM. Since the age of the sample was above the CPM norms, this score had to be transformed to 23.00 to 25.00 on the SPM scale and this to 1 to 3 on the APM scale. By using the conversion formulas, the APM raw score is 1.79, equivalent to an IQ of 55.91. A second sample of females which not completed primary school get, after the same calculations, an APM raw score of 0.45, equivalent to an IQ of 50.80. Lynn and Vanhanen estimated an IQ of 84 for Nicaragua but only by the geographic mean. Although the Wechsler IQ is eleven or 16 points above the CPM and also clearly above the limit of 60, it is still 17 points below the geographic average. Problems with the large number of necessary conversions and transformations could also be a cause here for the very low scores.

For Nepal, four sources gave scores for overall nine samples, all measured with the CPM (Buckley et al., 2013; Christian et al., 2010; Jamison & Lockheed, 1985; Jamison & Moock, 1984). All samples obtained IQs<60 and with between 38.90 and 51.20 some of the lowest scores calculated within the whole dataset. Once again, as in the case of Nicaragua, the high ages made transformations to SPM and APM necessary again. Reported CPM raw scores are between 12.98 and 17.78. The transformation to the SPM scale, based on information of Raven, Raven and Court (2006, Tab.26), did not change this range and on the APM scale, all raw scores are <1. In contrast to Nicaragua, there is no evidence that only one or two sets of the CPM were administered. Reported standard deviations are between 4 and 6, which indicated the use of the whole CPM. Jamison and Lockheed (1985) named the test used “Raven’s Progressive Matrices”. At first it was not clear which Raven’s Test was used, even if an item range from 0 to 36 was given. This could have meant that the APM or an abridged version of the SPM had been used. However, Jamison and Moock (1984) cited the manual of the CPM and since there is a common author and the structure and content of both studies are very similar, this rather indicates the use of CPM in both.

The study of Berry (1966) is the only source for intelligence in Sierra Leone. It reported CPM scores of 13.10 and 13.90 for two samples with mean ages of 25 years. Applying the same procedure as described for Nicaragua and Nepal, APM scores would be <1 and IQ scores 44.64 and 46.38. The discrepancy between the NIQ score and the one from Lynn and Vanhanen is almost 20 scores. The source reported that the “coloured series A, Ab, and B was administered untimed; maximum score: 36.”

South Sudan was not listed as a separate country by Lynn and Vanhanen (2012) due to its, at that time, recent independence. This has the positive effect that the measurements available to this country are all quite up to date, with years of administration between 2009 and 2017, however they all consist of refugee children. Sources are Ahmed et al. (2017) and Osman et al. (2017a, 2018). Two samples were measured with the SPM but have to be split into four since the ages reached up to 17 years and thus beyond the intended range of the SPM. Sample one has a mean age of 10.50 years with a range from 6 to 15 years. Raw scores were given for ten age groups and on average 13.80. According to the SPM norm table this score would be below the 1st BP and an IQ of <65. The application of the conversion formula results in an IQ of 59.57. However, in the NIQ-dataset the IQs were calculated for the ten age groups separately and were between 61.79 at the age of 9 and 72.02 at the age of 8, averaged to 66.33. The same procedure applicated on sample two, with a mean age of 11 and a range from 7 to 15 years, gives a mean raw score of 13.17, which would be again below the 1st BP and an IQ of <65 according to the norm table and 63.43 by formula. Again, in the NIQ-dataset the IQs were calculated for age groups separately and were between 59.67 at the age of 14 and 77.21 at the age of 7, averaged to 65.66. The two hereof separated sub-samples have mean ages of 17 with ranges from 16 to 18 and SPM raw scores of 20.54 and 22.27 were therefore converted to 0.33 and 1.01 on the APM scale, equivalent to IQs of 53.17 and 56.16 or 53.52 and 56.19 if age groups were handled separately. An additional sample had a mean age of 6.5 years with a range from 6 to 7 and scored on average 13.79 on the CPM, which would be at the 9th BP and equivalent to an IQ of 80 according to the norm table, or 78.67 according to conversion formulas. The higher score of the fifth sample might be caused by its lower age, since it is known that in in developing countries, IQs are lower in higher ages due to a delayed rate of cognitive development (Bakhiet et al., 2018).

Many of the transformations and corrections applied in the NIQ dataset could be potential causes of errors. E.g. it could be argued that the two- to three-step transformations make the final results so low. What would happen if the matter were approached much more cautious? This should be checked on the sample with the ID “NPL1673” from Nepal. It has a mean age of 19 years and obtained a CPM raw score of 12.98, rounded to 13 for the sake of simplicity. 19 years is far above the highest age of the CPM norms but still not in an age range where cognitive backward development is significant. So, if we applied the CPM norms of the highest age group, the 11-12 years old, the raw score of 13 would be, according to the norm table (Raven, 2008a, Table A.1), equivalent to the 0.10 BP and an IQ<60. By transforming the CPM raw score of 13 to the SPM scale (Raven, Raven & Court, 2006, Table 26), it would be 12, which in turn would be below the 1.00 BP if norms of the highest age group of the SPM, the 15-16 years old, was used (Raven, 2000, Table B1). If the SPM+ was used instead the APM, a SPM raw score of 12 would be equivalent to an SPM+ raw score of 9 (Raven, Raven & Court, 1999, Table 10). Now, we can use the age group of the 18-19 years old that fits well to the sample. However, once again, the SPM+ raw score is in the range of the 0.10 BP and therefore below an IQ of 55 (Raven, 2008b, Table A.1). A CPM raw score could also be obtained on only two of the three sets of the CPM, even if there are no notifications about this in the sources. 13 CPM raw scores on sets A and AB would be equivalent to 17 on the full CPM. By using the same procedure as before, a CPM raw score of 17 would be on the 2.00 BP of the CPM norms for 11-12 years olds, equivalent to an IQ of around 69, which is much higher but still relatively low, equivalent to an SPM raw score of 16 and below the 0.10 BP for the SPM norms of the 15-16 years old, and 12 on the scale of the SPM+ and equivalent to an IQ<55 for the age group of the 18-19 years old on the SPM+. So, even if no or other transformations would be used or generous assumptions would be made, the IQ would always be similarly low.

Are mean country IQs in very low ranges conceivable?

All the examples above show that the NIQ scores <60 are most likely not based on errors in the calculation or interpretation of the numbers and methods mentioned in the sources.
So, the next question is, how credible are such scores? There are two ways to test this: (1) empirically by comparing them with scores from other and independent measurements, and (2) verifying the accordance of estimated NIQ scores and real-life phenomena within the countries and societies.

For the first approach, it can be referred to results from international school assessment studies (SAS), presented for all countries with an NIQ[SAS]<70 in Table 2 and converted into Greenwich IQ scores. All the national SAS-IQs are clearly below the psychometric NIQs which would rather be an indication of a psychometric overestimation. Meanwhile, the correlation between both IQ estimates is still positive (r=.68; N=7; p=.093), albeit in a slightly lower magnitude than in the groups of all countries (r=.78; N=82; p<.001). TIMSS and PIRLS do not measure intelligence directly, but skills and knowledge acquired by schooling in the fields of mathematics, sciences (TIMSS) and in reading (PIRLS). Thus, a significant positive effect of higher quality of national educational systems, in particular “international differences in student competence, including the amount of preschool education, student discipline, quantity of education, attendance at additional schools, early tracking, the use of centralized exams and high-stakes tests, and adult educational attainment”, was assumed by Rindermann and Ceci (2009) as well as distorting effects due to the factors literacy, extracurricular education (e.g. by parents or family in general), time in school and the general education level of the population by Rindermann (2006). But to explain the difference between the two measuring methods by differences in learning and living conditions would be premature, since effects of quantity and quality of schooling on Raven’s Test results are also documented (Falch & Massih, 2011; Ahmadpoura & Mujembari, 2015), as well as effects of parental education (Rindermann & Carl, 2017). Additionally, suboptimal living conditions, as they are normal in developing countries for large parts of the population, may influence performances on Raven’s tests too, e.g. psychological burdens due to living in poverty and bad nutrition (Mani et al., 2013; Whaley et al., 2003). Although at this point the Ecological Fallacy prevents us from the conclusion that psychometric and SAS-IQs at the cross-national level are similar affected by the same factors, the argument is strong enough to consider this possible.

Table 2 Compared NIQs from school assessment studies and psychometric measurements for countries with NIQ[SAS]<70

Notes: NIQ[SAS]: M=62.46; SD=5.52; NIQ[QNW]: M=76.07; SD=9.50; rNIQ[SAS]↔NIQ[QNW]=.68; N=7; p=.093; Sources for NIQ[SAS]: Beaton et al. (1997); Martin et al. (1997, 2000, 2004, 2012); Mullis et al. (1997, 2000, 2003, 2004, 2007, 2012a,b, 2016a,b,c,d, 2017); Mullis, Martin & Foy (2008)

For approach number two, descriptions of the degree of rational thinking, defined as the ability to find ways for successful problem solving in a certain situation (Rescher, 1993, 109-143) and which should prevail in societies with allegedly low IQ and form their cultural phenotypes, are appropriate. There is evidence from Piagetian cross-cultural psychology (PCCP) (Hallpike, 1979), that individuals in pre-modern societies are more similar to children in modern societies in terms of cognitive status and therefore rational thinking (Lurija, 1982; Oesterdiekhoff, 2013), which implies a certain correspondence between ontogenetic cognitive development of individuals and phylogenetic cultural development of populations (Oesterdiekhoff, 2012a,b, 2016). In conclusion, individuals in low-IQ countries should tend more often to decisions that do not lead to the intended goal or to modes of behavior and thinking that contradict reason, especially since tests used by Piaget or in PCCP and IQ tests as Raven’s both measured, though in different ways, abstract and logical thinking.

The human rights organization Under The Same Sun collected various reports about tens of thousands of cases of persecution, witchcraft judgments, tortures and ritual murders on albinos in Tanzania, Burundi, Malawi, Mozambique and the Democratic Republic of Congo and even the sale of their body parts to African leaders, due to deep-rooted cultural myths about the conferral of special powers through the use of these body parts in magical rituals (Redfern, 2010; Masakhwe, 2009; Baker, 2017). A recent news story published by the BBC on June 7th 2019 reported that in Mozambique, five bald men were beheaded because the murders thought they had gold hidden in their heads, which could then extract by the use of magical powers, and a police spokesman was cited: “Our preliminary conclusion indicates that the phenomenon is due to cultural beliefs.” Also heads of state, who are often well-educated and should be in the upper cognitive ranks, show such a behavior. The former Gambian President Yahya Jammeh claimed to be able to cure AIDS by rubbing in a “green herbal paste onto the ribcage of the patient” (The Associated Press, 2007). The former President of South Africa, Jacob Zuma, had sex with a person he knew she was HIV positive and thought it could prevent him from infection by “showered immediately after having sex” (Evers, 2007), and Zuma’s predecessor Thabo Mbeki, which for a while led the Ministry of Health, portray medicine used in antiretroviral therapies as ‘poison’ and supported alternative traditional medicines instead (Nattrass, 2008).

The collection of curious news can not provide a representative picture of the situation and quantifications are required. In Nigeria, traditional believes as animism and voodoo are far more common than e.g. in Germany and caused real-life consequences, as traffic accidents, persecution and murder for witchcraft, spread of diseases as HIV and AIDS, and many believe that “God is able to put money inside a person’s empty pocket” or gives a god-believing person an age above 140 (Rindermann, Falkenhayn & Baumeister, 2014). From Ghana it has been reported that, depending on the region of residence, 16 to 48% of the people believe that witchcraft as the mean cause for spreading AIDS, surprisingly with higher percentages in people with primary and secondary education than in those with no education (Tenkorang et al., 2011). A study about superstitious beliefs in Mali reported that 97% of the asked sample believe that the order twins leaving the womb of their mother define their ranks in the family, 95% that dogs crying repeatedly due to the sight of ghosts, 90% that members of the Bozo and Dogon ethnicities should not marry each other, 88% that a not circumcised male is unable to see the Komo-spirit and 78% that a woman who is unable to see this spirit is damned to die, 58% that looking into a mirror while raining causes thunder strikes and so one (Dissa et al., 2017). Also here, real-life consequences in health and wealth are reported and the level of education seems not to affect the prevalence of these beliefs significantly.

Of course, even in Western societies irrational believes are still present, e.g. in the field of homeopathy, were, for example, a survey from Germany in 2014 reported that 60% of the asked people have used homeopathic medicine at least once and 48% of those that the treatment had always the desired effect plus 39% which answered that it helped them sometimes (de Sombre, 2014). However, the search for newspaper reports from Germany, Sweden or Poland about beheadings due to the refusal of eating globules remained so far unsuccessful. The question of intensity and orthodoxy by which such ideas are held and practiced with real-life consequences by a large share of a population must be asked.

60 or 70?

The credibility of the NIQ scores <60 or <70 must be evaluated independently from the credibility of the variances that occur in this area. If these variances describe real differences in intelligence rather than measurement errors, correlations to non-IQ variables would have to be found within the group of NIQ<60 and <70 countries, as they occur on the global scale. This was tested in Table 3a and b.

In 3a, six non-IQ variables, which generally show very strong correlations to NIQ and whose data are also available in sufficient numbers for countries of the group NIQ<60, were selected and correlated with NIQ[QNW]. The use of the SAS-IQs was therefore omitted, as there were too few countries in the group of NIQ<60 available and we also wanted to put the focus on the psychometric measurements, because only here the above-mentioned problems appeared. The pattern found is remarkable. All correlations from the global scale changed their direction if countries with an NIQ>60 were excluded. Even more, the correlations in the group of NIQ>60 increased in strength in five of six cases. This is congruent with findings in Lynn and Becker (2019, Chapter 2), where many correlations became stronger when all IQs of all countries NIQ<60 were corrected to 60. This is a strong indicator that the variances in the range NIQ<60 are actually more of a measurement error than real differences in intelligence. However, this is not fully clarified, as it may be that in poor developed countries the relations between intelligence and non-IQ variables may indeed be different than in more developed countries. Aluko and Ajayi (2017) showed that some correlations with NIQ show the same direction in 25 countries from sub-Sahara Africa as on the global scale, e.g. with income (r=.59), financial openness (r=.46) and institutional quality (r=-.37), however some other correlations disappeared, e.g. with latitude (r=-.02) and democracy (r=-.04).

Table 3a Cross-national correlations between NIQ[QNW] and non-IQ variables on the global scale and two sub-samples, separated at IQ=60

The analysis was repeated in Table 3b with separation at the threshold of 70 instead of 60. The pattern of Table 2a largely vanishes, suggesting that there are real variances in intelligence in the range <70, which would be erroneously removed by correcting the results. Moreover, more than twice as many states are affected as before. In conclusion, a threshold of 60 would have to be considered more appropriate than a threshold of 70.

Table 3b Cross-national correlations between NIQ[QNW] and non-IQ variables on the global scale and two sub-samples, separated at IQ=70

In conclusion, there is some evidence that the NIQ variances in the range of <60 reflect no real differences in intelligence between countries and may be caused by the low validity of the tests in this range, whereas that is not the case in the range of <70. Nevertheless, a deeper analysis would be needed to establish a definitive threshold.

A flexible solution

As long as this ambiguity exists problem of very low IQs should therefore be addressed flexibly and the circumstance was used at the same time to realize another intention. V1.3.3 now includes a complex feature for filtering and correction that allows each user to set a set of individual rules for using/not using and correcting samples. It can be found within the primary file in the table [SET]. Figure 3 shows a section of it.

Fig.3 Screenshot from the new table [SET]in V1.3.3

The yellow colored cells can be used to define a set of criteria for correction and filtering. Both, corrections and filterings, are executed exclusively at the sample level and the numbers at the national level were calculated from numbers of the filtered set of samples. Here is a brief description of how the new feature can be used.

At first, in line 3 column D+E, a score can be defined as a lower threshold of IQ. This means that, if for example a score of “60.00” was entered, IQs of all samples (and the IQ from school assessment studies) <60.00 were set to 60.00, whereas a score of 65.00 will remain 65.00. No samples will be excluded here, no matter which setting is made. The corrections only affect the variable [IQ (cor.)] in table [REC] and the variable [IQ (SAS)] in [NAT]. “60.00” is also the default score which from now on is usually used in the entire dataset.

Lines 4 to 8 can be used to define which tests should be integrated. E.g. if “RPM” is typed in cell 6/DE, all samples which were not measured with a Raven’s Test will be excluded. If “WISC-IV” is typed in cell 7/DE, all samples which were not measured with the WISC-IV will be excluded. And if “CPM” is typed in cell 8/DE, all samples will be excluded on which the CPM was not used to calculate IQ scores.

From line 9 onwards, lower and upper boundaries can be defined for 32 metric or ordinal variables. E.g. if “10.00” is typed in cell 24/D and “20.00” is typed in cell 24/E, only samples will be used which mean ages are between 10 and 20 years. The maximum ranges of each variable are shown in columns F and G. In the case of ordinal variables, the column H gives information about the meaning of each condition. E.g. if “3” is typed in cell 17/D and “4” is typed in cell 17/E, only samples will be used which are defined as of urban or rural origin. However, a combination of regional (2) and foreign (5) sample origins is not possible at the time.

The lower threshold should be used here immediately. The question is to what extent certain correlations change when the threshold adopts different values. Table 4 correlates the same non-IQ variables with NIQ[QNW] as Table 2 but always on the global scale and with thresholds increasing from “0.00” along “55.00”, “60.00”, “70.00” until “80.00”. Fig. 4 shows in which countries the first sample was affected when thresholds increased. Please note that the USA and Canada are therefore affected because samples of some ethnic groups (e.g. Blacks; Native Americans) show IQs significantly lower than the country mean. From thresholds of “0.00” to “60.00”, the magnitudes of the correlations in Table 3 increased but start to stagnate or decline at “70.00”. If a threshold of “80.00” was set, all correlations stagnated or decline strongly. At the same time, the global IQ of 86.55 increase slightly to 86.69 at a threshold of “60.00” but after that, take significantly bigger steps to 87.22 at “70.00” and 88.84 at “80.00”. For this reason, “60.00” has been selected as the default value.

Table 4 Cross-national correlations between NIQ[QNW] and non-IQ variables for different settings of lower thresholds

Notes: Number of countries are stable

Fig.4 Countries affected by lower limits

Notes: A colour represents the first threshold which affects at least one of the samples used for a country

Changes from V1.3.2 to V1.3.3 in detail

The implementation of the feature for filtering and correction made some changes in the main file necessary. Some variables in tables [REC] and [SEL] were changed from texts to numbers. This recoding affected the variables [SES], [Sample char.], [Domain], [Procedure], [Country of std.], [Recalc.?], [Special calc.?] and [Test-conv.? ]. The meanings of the numbers which replaced the former text entries are explained in the manual and in the table [SET], column H. Variable [Greenwich cor.], which was a placeholder until then, was replaced by [Below threshold?] and now shows an “1” if a samples [IQ (cor.)] was changed by the threshold set, or “0” if not. The new variable [Test (type)] shows the test family (e.g. Raven’s Matrices, Wechsler Scales, Stanford-Binet) used for measurement. The table [SEL] is now working as the mechanics behind the filter.

I was informed of a bigger mistake concerning the calculation of population weighted IQ averages in tables [NAT] and [FAV], line 212. The formula used to calculate these scores was faulty, since it includes the country-populations only of nations with IQ-data in the numerator, but of all nations in the denominator, which reduced the overall world mean significantly if IQ-data were not available for each country in the list. I corrected the formula so it just takes into account population sizes from nations with IQ-data. As a result, die GLOBAL IQ on table [INF] increased from 81.98 to 86.55. Although, this mistake had no effect on individual country values and statistics, it has to be mentioned separately, since the GLOBAL IQ is the score that becomes visible firstly when using the data set.

Since I got several inquiries about which of the many different IQ variables was most appropriate, I also decided not to use NIQ[QNW] as before, but NIQ[QNW+SAS] as the GLOBAL IQ. The second has the much higher data quality because it is rechecked by the results from international school assessment studies. For this reason I would also like to recommend to anyone who prefers the quality rather than the quantity of data to fall back on NIQ[QNW+SAS]. Only those who want to achieve the highest possible number of cases should fall back on the variable NIQ[QNW+SAS+GEO], which includes the added IQs calculated via means of neighboring countries.

Further corrections were minor and listed in the file CHANGELOG (V1.3.2.1).txt. These concerned mostly occasional values for the non-IQ variables and some samples without leading to significant changes. Furthermore, all references were reviewed again and unified the citation style.

Another new table [CNC] (Cross-National Correlates) has been added. It contains a collection of calculated cross-national correlations from published studies that use the IQ scores from Lynn and Vanhanen or from the NIQ dataset. Most of these numbers were collected by Lynn and colleagues. However, I have tried to confirm as many as possible by the original sources and have added some missing. Additionally, I summarized similar variables to one. These are presented as bold entries from line 3 to 450, whereas all lines below contain each coefficient from each source. This table is still under construction and will be improved and expanded in the future.

Greenland now has its own NIQ, taken from a study by Weihe et al. (2002). Block Designs and Digit Span forward and backward from the WISC-R were administered to a sample of mercury exposed Inuit children in Thule. Since Digit Span is recommended by the WISC-R manual as a good test for finding effects of brain disease, which could be caused by mercury, and is normally not used to calculate FS-IQ (Wechsler, 1974, pp.7-8), only scores from Block Designs were used. Here, a mean raw score of 18.90 was reported. According to the WISC-R norms from 1972 in the USA, this score would be equivalent to a scaled score of 11 and an IQ of 105. 7.82 points have to be subtracted for FLynn-Effect, because measurement took place in 1995, and 2.50. because the norms are from the USA. Therefore, the final IQ would be 94.68, relatively high but in the range (78-96) of those reported for Arctic people by Lynn (2006, Table 11.1).

Closing remarks

I feel compelled to make two clarifications. The whole NIQ-project is based on quantitative methods, therefore it is not very suitable for the determination of individual cases. In particular, it is not appropriate to compare two single countries. What is more important is how close their NIQ scores are to each other. It is more suitable for showing global patterns and accordingly should not be misused for other purposes. There was also criticism regarding a missing peer review of the data and methods I used. I agree that this is a weak point, however the reason why I decided to make this project in an open-source style and with software that is widely used, which offers many more people the opportunity to gain insight into the work as it would ever be possible through a peer review in a journal. And as shown above, this contributes significantly to the improvement.

At the end, a short roadmap for the future of this project should be presented. There are still a lot of sources not integrated in the dataset, due to not yet integrated test types, unresolved problems with the reported findings, or simply for lack of time. This huge bunch of publications should be processed gradually. Recently, I got US norms for various Wechsler Scales. At the moment, German norms were mostly used to convert raw or scaled scores to IQ. These should be gradually replaced as the English-language versions of the tests are more widely used. I have received some inquiries about the standard deviations within the samples. So far, the NIQ dataset only shows the standard deviations of the means of samples per country. In order to change this, a further review of all sources is necessary as the required data has never been collected. However, it is not clear to what extent this will succeed. Many sources report standard deviations only for raw or IQ scores or neither. I am also looking forward to the results of the PISA 2018 volume, which will be published around December this year.

References

Ahmadpoura, N., & Mujembari, A. K. (2015). The impact of montessori teaching method on IQ levels of 5-Year old children. Procedia – Social and Behavioral Sciences, 205(9), 122-127.

Ahmed, A. S. F., Osman, H. A., Bakhiet, S. F. A., Cheng, H., & Lynn, R. (2017). Gender Differences on the Standard Progressive Matrices of South Sudanese Living in Khartoum. Mankind Quarterly, 58, 2.

Alderman, H., Hawkesworth, S., Lundberg, M., Tasneem, A., Mark, H., & Moore, S. E. (2014). Supplemental feeding during pregnancy compared with maternal supplementation during lactation does not affect schooling and cognitive development through late adolescence. American Journal of Clinical Nutrition, 99, 122-129.

Aluko, O. A., & Ajayi, M. A. (2017). Deteminants of banking sector development: Evidence from Sub-Saharan African countries. Borsa Istanbul Review, 20, 1-18.

Baker, Ch. (2017, September 22nd). The trade in body party of people with albinism is driven by myth and international inaction. Retrieved from: https://www.independent.co.uk/life-style/health-and-families/witchcraft-albino-trade-body-part-human-rights-un-sub-saharan-african-a7958946.html

Bakhiet, S., Dutton, E., Ashaer, K., Essa, Y., Blahmar, T., Hakami, S., & Madison, G. (2018). Understanding the Simber Effect: Why is the age-dependent increase in children’s cognitive ability smaller in Arab countries than in Britain? Personality and Individual Differences, 122, 38-42.

Beaton, A. E., Mullis, I. V. S., Martin, M. O., Gonzalez, E. J., Kelly, D. A., & Smith, T. A. (1997). Mathematics Achievement in the Middle School Years: IEA’s Third International Mathematics and Science Study. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Berry, J. W. (1966). Temne and Eskimo perceptual skills. International Journal of Psychology, 1, 207-229.

Calderon, M. C., & Hoddinott, J. (2010). The inter-generational transmission of cognitive abilities in Guatemala. International Food Policy Research Institute.

Choudhury, N., & Gorman, K. (1999). The relationship between reaction time and psychometric intelligence in a rural Guatemalan adolescent population. International Journal of Psychology, 34, 209-217.

de Sombre, S. (2014). Homeopathic medicines 2014. Notoriety, use and image. [Homöopathische Arzneimittel 2014. Bekanntheit, Verwendung und Image.] Bonn: Allensbach. Retrieved from: https://www.bah-bonn.de/bah/?type=565&file=redakteur_filesystem/public/Ergebnisse_Allensbach_deSombre.pdf

DIMDI (2012). Kapitel V – Psychische und Verhaltensstörungen (F00-F99). Retrieved from: https://www.dimdi.de/static/de/klassifikationen/icd/icd-10-gm/kode-suche/htmlgm2013/block-f70-f79.htm

Dissa, Y., Adjouro, T., Traore, A., & Yorote, A. (2017). A case study of the effects of superstitions and beliefs on Mali socioeconomic development. International Journal of African and Asian Studies, 30, 71-80.

Dramé, C., & Ferguson, Ch. J. (2017). Measurements of Intelligence in sub-Saharan Africa: Perspectives Gathered from Research in Mali. Current Psychology, 2017, 1-6.

Evers, M. (2007, June 8th). The Difficult Struggle Against AIDS in Africa. Retrieved from: https://www.spiegel.de/international/world/epidemic-of-ignorance-the-difficult-struggle-against-aids-in-africa-a-485715.html

Falch, T., & Massih, S. S. (2011). The effect of education on cognitive ability. Economic Inquiry, 49(3), 838-856.

Hallpike, C. R. (1979). Foundations of Primitive Thought. Oxford: Clarendon Press.

Jensen, A. (1998). The g factor. The Science of Mental Ability. Westport: Praeger.

Jukes, M. C. H., & Grigorenko, E. L. (2010). Assessment of cognitive abilities in multiethnic countries: The case of the Wolof and Mandinka in Gambia. British Journal of Educational Psychology, 80, 77-98.

Jukes, M. C. H., Pinder, M., Grigorenko, E. L., Smith, H. B., Walraven, G., & Bariau, E. M. (2006). Long-term impact of malaria chemoprophylaxis on cognitive abilities and educational attainment: Follow-up of a controlled trial. PLoS Clinical Trials, 19, 1-8.

Lurija, A. R. (1982). Cognitive development. Its cultural and social foundations. Harvard: University Press.

Lynn, R., & Becker, D. (2019). The Intelligence of Nations. London: Ulster Institute for Social Research.

Lynn, R., & Vanhanen, T. (2012). Intelligence: A Unifying Construct for the Social Sciences. London: Ulster Institute for Social Research.

Lynn, R. (2006). Race Differences in Intelligence. Augusta, GA: Washington Summit Publishers.

Mani, A., Mullainathan, S., Shafir, E., & Zhao, J. (2013). Poverty impedes cognitive function. Science, 341, 976-980.

Martin, M. O., Mullis, I. V. S., Beaton, A. E., Gonzalez, E. J., Smith, T. A., & Kelly, D. A. (1997). Science Achievement in the Primary School Years: IEA’s Third International Mathematics and Science Study. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Martin, M. O., Mullis, I. V. S., Gonzalez, E. J., Gregory, K. D., Smith, T. A., Chrostowski, S. J., Garden, R. A., & O’Connor, K. M. (2000). TIMSS 1999 International Science Report – Findings from IEA’s Repeat of the Third International Mathematics and Science Study at the Eighth Grade. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Martin, M. O., Mullis, I. V. S., Gonzalez, E. J., & Chrostowski, S. J. (2004). Findings From IEA’s Trends in International Mathematics and Science Study at the Fourth and Eighth Grades. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Martin, M. O., Mullis, I. V. S., & Foy, P. (2008). TIMSS 2007 International Science Report: Findings from IEA’s Trends in International Mathematics and Science Study at the Fourth and Eighth Grades. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Martin, M. O., Mullis, I. V. S., Foy, P., & Stanco, G. M. (2012). TIMSS 2011 International Results in Science. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Martorell, R., Behrman, J. R., Grajeda, R., & Hoddinott, J. (2005). The human capital 2002–04 study in Guatemala: A follow-up to the INCAP longitudinal study 1969–77. Food & Nutrition Bulletin, 26, 1-120.

Masakhwe, Ph. W. (2009, September 1st). Africa: Persons with Albinism are Facing Real Threat to Life. Retrieved from: https://allafrica.com/stories/200909010901.html

Mullis, I. V. S., Martin, M. O., Beaton, A. E., Gonzalez, E. J., Kelly, D. A., & Smith, T. A. (1997). Mathematics Achievement in the Primary School Years: IEA’s Third International Mathematics and Science Study. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., Gonzalez, E. J., Gregory, K. D., Garden, R. A., O’Connor, K. M., Chrostowski, S. J., & Smith, T. A. (2000). TIMSS 1999 International Mathematics Report – Findings from IEA’s Repeat of the Third International Mathematics and Science Study at the Eighth Grade. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., Gonzalez, E. J., & Chrostowski, S. J. (2004). Findings From IEA’s Trends in International Mathematics and Science Study at the Fourth and Eighth Grades. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., & Foy, P. (2008). TIMSS 2007 International Mathematics Report: Findings from IEA’s Trends in International Mathematics and Science Study at the Fourth and Eighth Grades. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., Foy, P., & Arora, A. (2012a). TIMSS 2011 International Results in Mathematics. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., Gonzalez, E. J., & Kennedy, A. M. (2003). PIRLS 2001 International Report: IEA’s Study of Reading Literacy Achievement in Primary Schools. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., Kennedy, A. M., & Foy, P. (2007). IEA’s Progress in International Reading Literacy Study in Primary School in 40 Countries. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., Foy, P., & Drucker, K. T. (2012b). PIRLS 2011 International Results in Reading. Chestnut Hill, MA: TIMSS & PIRLS International Study Center, Boston College.

Mullis, I. V. S., Martin, M. O., Foy, P., & Hooper, M (2016a). TIMSS 2015 – International Results in Mathematics. Fourth Grade Mathematics. Bosten: IEA.

Mullis, I. V. S., Martin, M. O., Foy, P., & Hooper, M (2016b). TIMSS 2015 – International Results in Mathematics. Fourth Grade Science. Bosten: IEA.

Mullis, I. V. S., Martin, M. O., Foy, P., & Hooper, M (2016c). TIMSS 2015 – International Results in Mathematics. Eight Grade Mathematics. Bosten: IEA.

Mullis, I. V. S., Martin, M. O., Foy, P., & Hooper, M (2016d). TIMSS 2015 – International Results in Mathematics. Eight Grade Science. Bosten: IEA.

Nattrass, N. (2008). AIDS and the scientific governance of medicine in post-apartheid South Africa. African Affairs, 107, 157-176.

Oesterdiekhoff, G. W. (2012a). The evolution of humanity from the childhood phase to adult maturity [Die Entwicklung der Menschheit von der Kindheitsphase zur Erwachsenenreife]. Berlin: Springer VS.

Oesterdiekhoff, G. W. (2012b). Was pre-modern man a child? The quintessence of the psychometric and developmental approaches. Intelligence, 40, 470-478.

Oesterdiekhoff, G. W. (2013). Ontogeny and history. The leading theories reconsidered. Cultural-Historical Psychology, 3, 60-69.

Oesterdiekhoff, G. W. (2016). Is a forgotten subject central to the future development of sciences? Jean Piaget on the interrelationship between ontogeny and history. Personality and Individual Differences, 98, 118-126.

Osman, H. A., Bakhiet, S. F. A., Alnaier, N. A. G., & Lynn, R. (2017a). A study of the intelligence of South Sudanese Refugee Children. Mankind Quarterly, 57, 585-589.

Osman, H.A., Bakhiet, S. F. A., Abduljabbar, A. S., Alnaier, N. A. G., & Lynn, R. (2018). A study of intelligence in the new nation of South Sudan. Mankind Quarterly, in press.

Raven, J. (1981). Irish and British Standardisations. Oxford: Oxford Psychologist Press.

Raven, J. (2000). The Raven’s Progressive Matrices: Change and Stability over Culture and Time. Cognitive Psychology, 41, 1-48.

Raven, J. (2008a). Coloured Progressive Matrices and Crichton Vocabulary Scale. London: Pearson.

Raven, J. (2008b). Standard Progressive Matrices and Mill Hill Vocabulary Scale. London: Pearson.

Raven, J., Raven, J. C., & Court, J. H. (1998). Manual for Raven’s Progressive Matrices and Vocabulary Scales. Oxford: Oxford Psychologist Press.

Raven, J., Raven, J. C., & Court, J. H. (2006). Manual zu Raven’s Progressive Matrices und Vocabulary Scales. Frankfurt: Harcourt Test Services.

Redfern, P. (2010, July 26th). Africa: Rising Number of Children Accused of ‘Crime of Witchcraft’. Retrieved from: https://allafrica.com/stories/201007261355.html

Rescher, N. (1993). Rationality: A Philosophical Inquiry into the Nature and the Rationale of Reason. [Rationalität. Eine philosophische Untersuchung über das Wesen und die Rechtfertigung von Vernunft]. Würzburg: Königshausen und Neumann.

Rindermann, H. (2006). Was messen internationale Schulleistungsstudien? [What do international student assessment studies measure? School performance, student abilities, cognitive abilities, knowledge or general intelligence?] Psychologische Rundschau, 57, 69-86.

Rindermann, H., & Carl, N. (2017). Indigenous Ecuadorian children: Parental education, parental wealth and children’s cognitive ability level. Learning and Individual Differences, 54, 202-209.

Rindermann, H., & Ceci, St. J. (2009). Educational policy and country outcomes in international cognitive competence studies. Perspectives on Psychological Science, 4(6), 551-568.

Rindermann, H., Falkenhayn, L., & Baumeister, A. E. E. (2014). Cognitive ability and epistemic rationality: A study in Nigeria and Germany. Intelligence, 47, 23-33.

Rodríguez, T. (2012). Environmental pesticide exposure and neurobehavioral effects among children of Nicaraguan agricultural workers. Ph.D thesis, University of Uppsala. Acta Universitatis Upsaliensis. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Medicine 820.

Sandiford, P., Cassell, J., Sanchez, G., & Coldham, C. (1997). Does intelligence account for the link between maternal literacy and child survival? Social Science & Medicine, 45, 1231-1239.

Stein, A. D., Behrman, J. R., DiGirolamo, A., Grajeda, R., Martorell, R., Quisumbing, A., & Ramakrishnan, U. (2005). Schooling, educational achievement, and cognitive functioning among young Guatemalan adults. Food and Nutrition Bulletin, 26, 46-54.

Tempe, J. (2019, Juni 7th). Five killed in Mozambique ‘for being bald’. Retrieved from: https://www.bbc.com/news/live/world-africa-39261633?ns_mchannel=social&ns_source=twitter&ns_campaign=bbc_live&ns_linkname=5937cde1e4b025bd1992528a%26Witchdoctors%20%27switch%20tactics%27%26&ns_fee=0#post_5937cde1e4b025bd1992528a

Tenkorang, E. Y., Gyimah, St. O., Tyndale, E. M. & Adjei, J. (2011). Superstition, witchcraft and HIV prevention in sub-Saharan Africa: the case of Ghana. Culture, Health & Sexuality, 13(9), 1001-1014.

The Associated Press (2007). Gambian president’s claim of AIDS cure causes alarm. Retrieved from: http://usatoday30.usatoday.com/news/health/2007-02-20-gambia_x.htm

Under The Same Sun (2019). Retrieved from: https://www.underthesamesun.com/content/academic-articles

Weihe, P., Hansen, J. C., Murata, K., Debes, F., Jørgensen, P. J., Steuerwald, U., White, R. F., & Grandjean, Ph. (2002). Neurobehavioral performance of Inuit children with increased prenatal exposure to Methylmercury. International Journal of Circumpolar Health, 61, 41-49.

Whaley, S. E., Sigman, M., Neumann, C., Bwibo, N., Guthrie, D., Weiss, R. E., Alberz, S. & Murphy, S. P. (2003). The Impact of Dietary Intervention on the Cognitive Development of Kenyan School Children. The Journal of Nutrition, 133, 3965S-3971S.

Wicherts, J. M., Dolan, C. V., & van der Maas, H. L. J. (2010). A systematic literature review of the average IQ of sub-Saharan Africans. Intelligence, 38, 1-20.

Version 1.3.2 now uploaded

Version 1.3.2 now uploaded

A new version of the NIQ dataset was completed yesterday and uploaded today. It now includes 669 samples with a total of 617,581 individuals from 130 different countries. The global IQ is now 81.98 (N=130; SD=13.43) for measured IQs only or 81.90 (N=201; SD=13.47) if missing countries have been supplemented by geographical averages of their neighboring countries (Fig. 1).

Fig. 1 Global distribution of national IQs from psychometric measurements and international school assessment studies, supplemented by geographical averages (M=81.90; N=201; SD=13.47).

Due to several comments, which objected the version 1.3.1 as too large, a conversion was made, which reduced the file size from formerly 24,998 to now 3,103 kb, without losing any content. Therefore, changes compared to the previous version can be found in particular in the file structure.

The former sheet [ADDVAR], which contained the non-IQ variables for correlation analyzes, has now been integrated into the sheet [NAT], to the right of the summaries of IQ samples at country level. A new table named [SEL] has been integrated to work as a filter for sample level data. All numbers in this new table are identical to those listed in the known table [REC]. This made it possible to select variables for correlation analyzes without having complete correlation matrices permanently in the file. So, the tables [STATCOR(REC)] and [STATCOR(NAT)] could also be excluded.

Two new tables [STAT(REC)] and [STAT(NAT)] now working as templates for descriptive statistics and correlation analyzes on the cross-sample level (REC) or cross-national level (NAT). To use them, the respective names of columns as capital letters must be typed into the yellow fields. An index of which column contains which variable can also be found in the two mentioned tables.

Two new countries have been added: Djibouti and Benin. Both were sourced from two publications in press (Bakhiet, Becker & Lynn; Bakhiet et al.). Both were measured with the Raven’s Standard Progressive Matrices Plus and the mean IQ of the sample from Benin was estimated as 70.36 and the mean IQ of the sample from Djibouti as 52.14. A comparison of the IQ-distributions within both samples, together with a third new sample from Sudan (Husain et al., in preparation), in comparison with an idealized distribution of a British norm-sample (Raven, 2008) can be seen in Fig. 2.

Fig. 2 IQ distribution in three samples from Benin, Djibouti and Sudan in comparison with an idealized distribution in a British norm-sample.

Furthermore, 18 new non-IQ variables have been added, e.g. shares of religious groups in the total populations, demographic variables for more than the recent year and the proportion of consanguinities on all couples. The scatter plot of the relation between the latter and national IQs can be seen in Fig. 3. Data for consanguinity were taken from Bittles (2001), prepared by Woodley (2009, Appendix A), and Tadmouri et al. (2009).

Fig. 3 Scatter plot between national IQs [QNW+SAS+GEO] and proportions of consanguinity on all couples [CONS. (%)] (r=-.51; N=76; p<.001)

In addition, some bugs have been fixed, e.g. a double-used sample from Sudan, a wrong geographic mean for North Korea and several errors within a sample from Haiti (ID: HTI5439).

References

Bakhiet, S. F., Becker, D., & Lynn, R. (in press). Intelligence in the West African state of Benin. Mankind Quarterly.

Bakhiet, S. F., Becker, D., Ahmed, S. A. E. S., & Lynn, R. (in press). Intelligence in the East African state of Djibouti. Mankind Quarterly.

Bittles, A.H., (2001). Consanguinity/endogamy resource. Retrieved from http://www.consang.net

Husain, N. I. A. E., Becker, D., Bakhiet, S. F., & Lynn, R. (in preparation).

Raven, J. (2008b). Standard Progressive Matrices and Mill Hill Vocabulary Scale. London: Pearson.

Tadmouri, Gh. O., Nair, P., Obeid, T., Al Ali, M. T., Al Khaja, N., & Hamamy, H. A. (2009). Consanguinity and reproductive health among Arabs. Reproductive Health, 6, 17. doi:10.1186/1742-4755-6-17

Woodley, M. A. (2009). Inbreeding depression and IQ in a study of 72 countries. Intelligence, 37, 268-276. doi:10.1016/j.intell.2008.10.007

THE NIQ-DATASET V1.3 – A Summary (Part – II)

This is a continuation. Enumerations of tables and figures continuous from Part – I. Explanations for abbreviations etc. could already be in Part – I. For Part – I and – II, the same dataset was used.

Correlation analysis

Simply comparing means does not let us analyze important deviations. Therefore, this section will discuss results from correlation analyses, at first in Table 3 between the different estimations at the cross-national level. The national IQs [DB] show strong positive correlations with all those from Lynn and Vanhanen, with the highest value in case of L&V12 (r=.89; N=108; p<.001). A look in the scatterplot in Fig.3 shows that this relationship is linear. Using a second grade polynomial formula increases r by <.01. IQs from Lynn and Vanhanen are a bit higher for low-IQ nations and lower for high-IQ nations. This might be due to DB’s intensified use of IQ conversions between different Raven’s Matrices forms, which would also explain the higher standard deviation of IQ[DB] relative to IQ[L&V12] in Table 1 and 2 (Part – I).

Table 3: Correlation matrix between national IQs by different estimations. Underlined relations in scatterplots. Variable “DB & L&V12” is from unweighted means of both estimates or from the only available estimate when there was only one present. For all correlations, p<.001.

Fig.3: Scatterplot between two estimations of national IQs (DB × L&V12; r(linear)=.89; N=108; p<.001).

Table 4: Correlation matrix between national IQs[DB] from named tests only. Underlined relations in scatterplots. *p<.001 (N varies). RPM=Raven’s Progressive Matrices (all); SPM=Raven’s Standard Progressive Matrices; SPM+=Raven’s Standard Progressive Matrices Plus; CPM=Raven’s Coloured Progressive Matrices; APM=Raven’s Advanced Progressive Matrices; WIS= Wechsler Intelligence Scales (all); WISC-R=Wechsler Intelligence Scale for Children revisited ed.; WISC-III=Wechsler Intelligence Scale for Children 3^rd ed.; CFT=Culture Fair Test.

The correlations between the national IQs [DB] from different tests are strongly positive, especially between tests from the same type (Raven’s Matrices or Wechsler Scales). But between different types, they decrease to a low of .26 (APM × SPM+; N=7; p=.571) or .15 (CPM × CFT; N=10; p<.676). This might be due to error as a consequence of low sample sizes of nations in which estimates from these different tests or test-types are available. Between the two most frequently used test-types, RPM and WIS, the correlation stays strong with r=.76 (N=23; p<.001), also shown in Fig.2. But larger discrepancies between the RPM and WIS can be seen in some cases; the most marked is Egypt (EGY), the red datapoint on Fig. 4. The low WIS score of 43.54 came from a study by Wachs et al. (1995) from a sample of rural children with low SES. This source was rechecked during the writing of this blogpost but no errors in the calculation of IQs could be found. The same source gives a CPM raw score of 14.18 equivalent to a corrected IQ of 56.87 for 8.50y olds, a bit higher than the IQ from WISC-R but also very low for a North African or Middle Eastern nation. The WIS score was the only available such score for Egypt; however, there were other available RPM scores, and the RPM estimate for Egypt as a whole is 82.65. Such outliers will be discussed later, in a special blogpost.

Fig.4: Scatterplot between estimations of national IQs[DB] from two different test-types (RPM × WIS; r(linear)=.76; N=23; p<.001). Red is strongest outlier EGY (M=82.65 vs. 43.54).

A correlation analysis at the cross-sample level allows one to observe relations between IQs from samples and some other properties of the samples. Line one in Table 5 shows that IQs are not significantly related to mean age or size of the samples, and also not to the amount of test-time adjustment (r(linear)=.05; N=566; p=0.435). Such adjustments are necessary to correct estimations of IQs for the FLynn-Effect in the country the used test was standardized (more in later posts). It is one of the most error-prone parts of the methods and criticized by other scientists (Wicherts, Dolan & van der Maas, 2010; Wicherts et al., 2010). But the weak correlation suggests that the adjustments lead to very little error.

Table 5: Correlation matrix between sample IQs and sample characteristics. Underlined relations in scatterplots. *p<.001.

Fig.5: Scatterplot between estimations of sample IQs [DB] and test-time adjustments; r(linear)=.05; N=566; p=.435).

There is a moderate and significant correlation (r=-.21; N=566; p<.001) between sample IQs and years of measurement, together with a weak but also significant correlation (r=-.13; N=566; p<.001) between sample IQs and the year of test standardization. Both sample properties are in turn strongly positively related (r=.48; N=566; p<.001), possibly because more recent studies mostly use more recent standardized tests. A closer look at the correlation between sample IQs and the year of measurement (Fig.6) shows that also an U-shaped regression-line would not increase the correlation noticeably (r(square)=.22).

Fig.6: Scatterplot between estimations of sample IQs [DB] and years of measurement; r(linear)=-.21; N=578; p=.435).

Once again, IQs from different estimations [DB; L&V] are very strongly linearly related, as shown in Fig.7 (r(linear)=.87; N=566; p<.001). One extreme outlier was found in this analysis, which is a sample for India (IND) with an IQ[DB] of 120.22 but an IQ[L&V] of 77.00, and the second matches much better to other scores found for this nation (M=81.94 than the first. The sample is from a publication of Gandhi-Kingdon (1996), consisting of government aided children of around 13.50y from urban areas, and the IQs were once again checked as a result of this finding. The source gives two additional samples with IQs [DB] of 70.08 (N=252) and 74.78 (N=290). The third sample, named “PUA” (private unaided) within the source, differs from both others by having much higher scores for education (self and family) and SES, which makes higher scores in Raven’s Tests expectable. The mean age of the PUA sample is 13.60 and the CPM raw score 36.03, converted to a SPM raw score of 52.91 because the age range of the CPM norms (4.00 to 11.00y) does not extend to the PUA samples’ age. This is equivalent to a corrected IQ of 120.22. An error could not be detected and is eventually hidden in inaccuracies of the CPM to SPM conversionLynn and Vanhanen did not split the samples and reported the score of 77.00 for all 928 individuals. By using the same procedure, a corrected IQ [DB] of 88.36 was estimated, which is still 11.36 points above their estimate, but at least closer. As mentioned before, this outlier will be discussed together with some others in a special blogpost later.

Fig.7: Scatterplot between two estimations of sample IQs [DB × L&V]; r(linear)=.87; N=566; p<0.001).

The last scatterplot in Fig.8 shows the relationship between sample IQs [DB] and the absolute value differences between both estimates of sample IQs [DB; L&V] (r=-.26; N=566; p<.001). The differences were much greater for low-IQ samples and a U-shaped regression line would show a correlation (r=.42) twice stronger than a linear one. This might be due to data quality or to the use of extrapolated IQs from conversion formulas. So far, it is the only pattern for a systematic deviation between both estimations and definitely requires further investigation.

Fig.8: Scatterplot between sample IQs [DB] and absolute differences between two estimations of sample IQs [DB × L&V]; r(linear)=.87; N=566; p<.001).

What’s next?

The further course of this project will follow three major goals. At first, analyses should be done to uncover the causes of deviations between the different estimations of national IQ as well as to find eventual methodological variables that might be responsible for variations in IQs. The second aim is a detailed explanation of the methods used to estimate national and sample IQs (DB), to make all steps and decisions easier to understand and transparent for critics. And last, a sample quality index should be created to evaluate the goodness and validity of the national IQ dataset (both in its entirety and for specific samples), and to evaluate the impact of data quality on estimated national and sample IQs.

Sources and References (Part – I and II)

Central Intelligence Agency (CIA) (2017). The World Fact Book. From: https://www.cia.gov/library/publications/the-world-factbook/

Gandhi-Kingdon, G. (1996). The quality and efficiency of private and public education: A case-study of urban India. Oxford Bulettin of Economics and Statistics, 58, 57-82. doi:10.1111/j.1468-0084.1996.mp58001004.x

Lynn, R., & Vanhanen, T. (2002). IQ and the Wealth of Nations. Westport, CT: Praeger.

Lynn, R., & Vanhanen, T. (2012). Intelligence: A Unifying Construct for the Social Sciences. London: Ulster Institute for Social Research.

Wachs, T. D., Bishry, Z., Moussa, W., Yunis, F., McCabe, G., Harrison, G., … & Shaheen, F. (1995). Nutritional intake and conext as predictors of cognition and adaptive behavior of Egyptian school children. International Journal of Behavioral Development, 18, 425-450. doi:10.1177/016502549501800303

Wicherts, J. M., Dolan, C. V., & van der Maas, H. L. J. (2010). A systematic literature review of the average IQ of sub-Saharan Africans. Intelligence, 38, 1-20. doi:10.1016/j.intell.2009.05.002

Wicherts, J. M., Dolan, C. V., Carlson, J. S., & van der Maas, H. L. J. (2010). Raven’s test performance of sub-Saharan Africans: Average performance, psychometric properties, and the Flynn Effect. Learning and Individual Differences, 20, 135-151. doi:10.1016/j.lindif.2009.12.001

THE NIQ-DATASET V1.3 – A Summary (Part – I)

A new version of the THE NIQ-DATASET (V1.3) is nearing completion. It will be based largely on the same data that already underlie V1.2, but will contain a large amount of new statistical analysis. This blog post summarizes the most important findings. It presents comparisons between different estimates,––such as between Lynn and Vanhanen’s 2002 and 2012 estimates and between the combined results of different IQ tests –– and addresses the most striking outliers. However, a deeper investigation of the reasons for deviations or relationships between the different estimates is planned for later posts. Please keep also in mind that due to some new data and changes in methods, the numbers and findings presented here may differ from those in V1.2.

Some explanations in advance: Abbreviations are used to specify different estimations and tests. The abbreviation “L&V” results from the surnames of Richard Lynn and Tatu Vanhanen, and the numbers “02” and “12” represent the years in which Lynn and Vanhanen’s estimates were published (Thus, L&V02 indicates the national IQ estimates published by Lynn and Vanhanen in 2002). “DB” indicates estimates made by me, David Becker. A “+” always marks estimations which include national IQs calculated not only directly from psychometric measurements but from geographic mean IQs from neighboring nations. Thus, L&V02+ indicates the national IQ estimates published by Lynn and Vanhanen in 2002, including nations for which IQs were calculated by averaging the IQs of neighboring nations. Two levels are observed in the analyses: The cross-national level and the cross-sample level, by which numbers at the the cross-national level based on calculations (mostly means) from data at the cross-sample level.

Database

At the time of writing, the database consists of IQs for 203 nations [DB+], of which 125 were calculated from psychometric measurements of intelligence [DB] from 325 scientific sources (see Table 1), and 78 were calculated from geographic means when no data was available for the nations themselves. The World Fact Book of the CIA (2017) currently lists 267 “world entities”; the [DB] estimates include 46.82% of these, and the [DB+] estimates include 76.03%. This includes also such entities as Hong Kong, the Gaza Strip, Puerto Rico, Greenland, and others; these are not “nations” in the sense of wholly autonomous political entities, but they are often listed separately in cross-national comparisons and their IQs should therefore also be of interest.

Sources are mostly publications in scientific periodicals, reports or books. Sources with samples of doubtful origin or methods were not used; neither were sources that were missing necessary data, such as sample size (N of individuals), mean age, geographic origin of a sample and year of test-administration. A huge amount of additional data was collected from the sources for more accurate tracking of methods and eventually necessary corrections. For example, nearly all of the IQs from Raven’s Progressive Matrices were recalculated from raw scores to avoid irregularities caused by different standardizations. Some of the sources gave more than one sample, and some samples were split into more homogeneous sub-samples to allow specific analyses.

Overall, there are 566 samples included (one overall IQ estimate per sample) encompassing 627,098 individuals. The number may seems high; however, it only accounts for <0.01% of the world’s total population of 7,405,107,650 (CIA, 2017). Possible multiple inclusion of individuals in different samples cannot be ruled out. However, care was taken to not include the same sample twice if published in more than one source, and the geographical and temporal distances between the different test-administrations should make such overlaps unlikely.

Table 1: Sample sizes in V1.3 on the national and individual levels. *Number of world entities and world’s total population from CIA (2017).

Descriptive statistics

Fig. 1 and 2 survey the geographic distribution of all national IQ scores [DB; DB+], which pattern is similar to that found by Lynn and Vanhanen (2002; 2012) and described in the earlier post. The purpose of Table 2 is to give a more detailed look into the data at the cross-national level. According to the dataset, the mean IQ weighted by national populations is 88.17 [DB] or 88.16 [DB+]. These means are similar to those found by Lynn and Vanhanen in 2002 (89.30 or 88.00) and 2012 (83.71 or 89.73), and the standard deviations are also similar.

Fig.1: National IQs [DB] from THE NIQ-DATASET (V1.3) (M=84.74; SD=12.79; N=125). Data from psychometric tests only.

Fig.2: National IQs [DB+] from THE NIQ-DATASET (V1.3) (M=82.57; SD=12.50; N=199). Data from psychometric tests and geographic means.

For IQ[DB] and IQ[DB+], the nation with the lowest estimated score is Nicaragua (NIC) with 53.69, an implausible value that should be treated with caution. It does not make sense considering the IQs of nearby and culturally or ethnically similar nations; nor does it fit Nicaragua’s economic and educational status. Unfortunately, Nicaragua is neither included as a case with psychometric measured IQ in one of the L&V estimations, nor into the international school assessment tests PISA or TIMSS, and can therefore not be validated. The highest IQ score for Nicaragua found within a sample is 63.63, which seems more realistic but more should not be speculated here. In the L&V estimations, the nation with the lowest estimated IQ is Equatorial Guinea (GNQ) with 59.00 in 2002 or Malawi (MWI) with 60.10 in 2012. It should be noted, that the score of GNQ was wrongly attributed to GNQ and the value thus a mistake, as noted by Wicherts, Dolan & van der Maas (2010). On the other hand, the East Asian political entities Hong Kong (HKG) and Singapore (SGP) obtained the highest IQ scores in both the present database and the L&V12 database. For Singapore, these are 113.67 [DB; DB+] and 107.10 [L&V12; L&V12+]. For Hong Kong, these are 105.90 [DB; DB+] and … [L&V12; L&V12+]. In 2002, Lynn and Vanhanen estimated an IQ for Hong Kong of 107.00, which is very close to DB’s estimate of 105.90. The last line of Table 2 shows a combination of the two estimations DB and L&V12, for which a simple mean was calculated if scores were available for both variables; otherwise, the only available score was taken.

In lines 2 to 10 a breakdown of IQ[DB] is made for different tests. These are the Raven’s Progressive Matrices (RPM) in their standard (SPM), standard-plus (SPM+), colored (CPM) and advanced (APM) version. For Wechsler Intelligence Scales (WIS), only the two specific tests for children WISC-R (Wechsler Intelligence Scale for Children––Revised, the second edition) and (WISC-III) (the third edition) are listed. At least, the Culture Fair Test (CFT) is also separated. Results from many other tests like the Kaufman Assessment Battery for Children (K-ABC), the Stanford-Binet Intelligence Scales (SBIS), or other Wechsler tests (including the WAIS, the WPPSI, and other editions of the WISC) are included in the dataset but not shown below due to the low number of estimates that use these tests. Overall, separated results are very close to each other, except those from the CFT, which are around 10 IQ points higher. Deviations might be caused by case selection. There are 26 national IQs measured by the CFT from which 22 are for Western or European nations plus Hong Kong. For this reason, the standard deviation of the CFT scores is comparatively low.

Table 2: Descriptive statistics for national IQs by different estimations at the cross-national level. Country-codes: ISO 3166-1 ALPHA-3; *Unweighted means of both estimates or the only available estimate. Only tests with N(nations)≥10 separated: RPM=Raven’s Progressive Matrices (all); SPM=Raven’s Standard Progressive Matrices; SPM+=Raven’s Standard Progressive Matrices Plus; CPM=Raven’s Coloured Progressive Matrices; APM=Raven’s Advanced Progressive Matrices; WIS= Wechsler Intelligence Scales (all); WISC-R=Wechsler Intelligence Scale for Children revisited ed.; WISC-III=Wechsler Intelligence Scale for Children 3^rd ed.; CFT=Culture Fair Test.

Similar pattern can be seen in Table 3 for the cross-sample level. Means and standard deviations are close to those from the cross-national level. The differences between the global means from DB and L&V (available for 450 samples up to 2016) is only 0.03, and the mean of the absolute value differences between both estimations is only 5.00 with a standard deviation of 5.15. This means that a majority of 64% of the DB estimations are less than 5 IQ points away from the L&V estimations.

Sources and References (Part – I and II)

Central Intelligence Agency (CIA) (2017). The World Fact Book. From: https://www.cia.gov/library/publications/the-world-factbook/

Lynn, R., & Vanhanen, T. (2002). IQ and the Wealth of Nations. Westport, CT: Praeger.

Lynn, R., & Vanhanen, T. (2012). Intelligence: A Unifying Construct for the Social Sciences. London: Ulster Institute for Social Research.

Wicherts, J. M., Dolan, C. V., & van der Maas, H. L. J. (2010). A systematic literature review of the average IQ of sub-Saharan Africans. Intelligence, 38, 1-20. doi:10.1016/j.intell.2009.05.002

A Warm Welcome

Scientific conferences often start with a greeting to the participants and audience who have traveled long distances around the globe to communicate with, or just listen to, their colleagues. I would like to adhere to this tradition, even if the Internet has significantly shortened the travel time, because this blog has as its aim to communicate with interested people from around the globe about a topic that has a global scale: The IQ of the nations of the world.

IQ is a parameter for intelligence, which can be defined as a “very general mental capability that, among other things, involves the ability to reason, plan, solve problems, think abstractly, comprehend complex ideas, learn quickly and learn from experience.” It “can be measured, and intelligence tests measure it well” and it is of “great practical and social importance” (Gottfredson, 1994).

A trait so important for a huge variety of individual and social phenomena might be able to explain differences existing in many social, political, economic, cultural, and/or biological areas between nations and geographic regions. However, to observe such associations requires the existence of valid figures for the IQs of the world’s nations. This blog aims to present work on national IQ to an unlimited audience.

Ground already covered

In 2002, a book was published that had both a considerable impact on intelligence research and a substantial public reception. The name of this book was “IQ and the Wealth of Nations“, written by the British psychologist Richard Lynn, Professor Emeritus at the University of Ulster, UK, and the Finnish professor of political science at the University of Tampere, Finland, Tatu Vanhanen (†2015), well known for creating the Vanhanen Index of Democratization (Lynn & Vanhanen, 2002; Lynn, 2017; Vanhanen, 2003). One highlight of this book was a list of average national IQs for 185 nations, of which 81 national IQs were collected and calculated directly from studies, the remaining 104 national IQs being estimated from the IQs of nearby and ethnically comparable countries. This list was expanded and improved over time, first in “Race Differences in Intelligence: An Evolutionary Analysis” (Lynn, 2006) next in “The Global Bell Curve: Race, IQ, and Inequality Worldwide” (Lynn, 2008), and finally in “Intelligence: A Unifying Construct for the Social Sciences” (Lynn & Vanhanen, 2012a).

To calculate the national IQs, Lynn and Vanhanen collected IQ test results from samples that were as representative or applicable as possible of/to their respective national populations, and set these results on a scale with the IQ of the UK at 100. This comparison IQ is often called the Greenwich IQ Mean, after Greenwich Mean Time. Some corrections were needed, in the main to adjust for the FLynn Effect, the increase of IQs ascending over time during the 20^th century in many nations (Lynn, 1982; Flynn, 1984; Pietschnig & Voracek, 2015). If more than one measurement of IQ was available for a nation, the mean or median was calculated. IQs for other nations for which no measurements could be found were calculated as means of their adjacent nations.

Fig. 1 is based on the 2012 version and shows how the national IQ levels form geographic clusters, with the lowest scores, between 60 and 80, in Sub-Saharan Africa; middling scores, between 80 and 95, mostly in Arabian or Middle Eastern, South Asian, Oceanian and South American nations; high scores, between 95 and 102, in European nations or those with a population descended mostly from Europe; and finally very high scores, above 102, in Northeast Asian nations.

Fig. 1: National IQs as calculated or estimated by Lynn and Vanhanen (2012).

Such a non-random global pattern suggests that national IQs might be related to other unequally distributed factors; indeed, a large amount of significant correlates, summarized by Lynn and Vanhanen (2012b) have been found between national IQs and other international variables. Statistical relations were found between national IQs and variables in the fields of education (e.g. results from school assessment, literacy, etc.); cognitive outputs (e.g. scientific publications, patents, Nobel prizes, etc.); economics (e.g. GDP/c in varying designs, economic growth, income, etc.); politics (e.g. democracy, freedom, corruption, etc.); social behavior (e.g. crime, suicide rate, happiness, modernization, etc.); medicine (e.g. infant mortality, HIV rates, fertility, etc.); geography (temperature, latitude); and anthropology (skin color).

In most cases, correlations were positive for those variables for which higher scores are usually considered desirable, and negative for variables for which lower scores are usually considered undesirable. On average, nations with higher IQs had stronger economies; were healthier, wealthier, and more educated; and were more modernized, politically stable, and peaceful (national suicide rates are an exception: suicides are less common in low-IQ nations). As the map suggests, there are strong positive correlations between national IQs and average skin brightness of a nation’s population (.69 to .92), and between national IQ and a nation’s mean latitude (.68 to .72). More recently, variables from population genetics joined in, such as population frequencies of intelligence-associated SNPs (e.g. Piffer, 2013, 2015), Y-chromosomal haplogroups (Rindermann, Woodley, & Stratford, 2012) or genetic distances in general (León & Burga-León, 2015; Becker & Rindermann, 2016).

Hunt and Sternberg (2006) called the list of national IQs “far from ideal” in a review of Templer and Arikawa’s (2006) paper of correlations between national IQs, skin color, and income. They (Hunt and Sternberg) criticized the non-representativeness of the included samples; Lynn and Vanhanen’s failure to weight the studies for sample size; and the practice of estimating some countries’ IQs from the IQs of nearby and ethnically comparable countries.

Malloy (2013-2014) addressed Lynn and Vanhanen’s work in great detail at the Human Varieties blog. A huge amount of different measurements of IQ were collected for 15 nations or insular areas in Southeast Asia or the Caribbean. Some national means were very close to those from Lynn and Vanhanen; others showed large differences, especially Cambodia. I compared the national IQs from both sources (some numbers listed in the Malloy column were calculated by me from Malloy’s data) in Table 1 and got a correlation of .47 (N = 12; p > 0.050) with and .79 (N = 11; p < 0.001) without Cambodia. On the one hand, these numbers suggest considerable reliability of Lynn and Vanhanen’s results; on the other hand, some ambiguities, errors, and deviations were reported by Malloy, suggesting that Lynn and Vanhanen’s IQs are not as reliable as have sometimes been believed.

Nation	IQ		Dif.
Nation	Lynn and Vanhanen (2012)	Malloy (2013-2014)	Dif.
Bahamas, The	84.00	92.52	-8.52
Bermuda	90.00	92.00	-2.00
Burma	85.00	97.00	-12.00
Cayman Islands	n.d.	75.00	n.d.
Cuba	85.00	90.00	-5.00
Dominican Republic	82.00	73.14	8.86
Haiti	67.00	68.00	-1.00
Jamaica	71.00	79.00	-8.00
Laos	89.00	90.60	-1.60
Puerto Rico	83.50	84.60	-1.10
Thailand	89.90	93.90	-4.00
Turks and Caicos Isl.	n.d.	89.40	n.d.
U.S. Virgin Islands	n.d.	82.20	n.d.
Vietnam	94.00	91.40	2.60
Cambodia	92.00	66.56	25.44
r	0.47
p	>0.050
r (KHM* excluded)	0.79
p (KHM* excluded)	<0.001

Table 1: National IQs from Lynn and Vanhanen (2012) and Malloy (2013-2014) compared. *KHM = Cambodia.

Notwithstanding all criticism, Lynn and Vanhanen’s dataset remained the only comprehensive cross-national collection of psychometric IQ measurements. In 2015, a group of scientists corresponded about how to make it more accessible to a larger audience and preserve it for future times. As a participant in this correspondence, I welcomed this idea and saw a lot of potential to expand this intention to a comprehensive open access database of global IQs; however, at the time, I could not guess what scale the project would get. It all started with a simple search for digital copies of all the (or as many as possible of the) sources Lynn and Vanhanen had used, and the cataloguing of these digital copies. As we tried to get a deeper understanding of the methods, we obtained more and more IQ data, so that it became possible to re-estimate some of the IQs Lynn and Vanhanen had calculated.

In May 2017, the first version (0.9) of this project was presented at the London Conference on Intelligence (Becker, 2017). This revision included around 340 recalculations of IQs for 115 nations, around half of the nations in the original dataset. At the time, the revision included only IQs measured by Raven’s Matrices but the correlations to Lynn and Vanhanen’s IQs were .95 at the sample level and .83 at the cross-national level (Fig. 2). In the following months, data from other IQ tests such as the Wechsler tests for children and adults (WPPSI; WISC; WAIS; WASI); the Stanford-Binet Intelligence Scale (SBIS); the Cattell Cultural Fair Tests (CFT); and some others, were added to the dataset. On July 9^th, 2017, the NIQ-DATASET version 1.0 was born, followed by version 1.1 on October 22^nd with improved calculations and corrections.

Fig. 2: Scatterplot between original national IQs (horizontal axis) and the new national IQs (vertical axis) (r = .83; N(nat.) = 92; N(samp.) = 691(L&V)|347(rev.); p < 0.001). Red marks debatable cases.

New insights and old problems made the current version 1.2 necessary, which contains some structural changes. This version has calculated IQs for 124 nations (presented in Fig. 3), based on 543 datapoints. A comparison with the national IQs from Lynn and Vanhanen (2012), which unlike the IQs in the present dataset were partially based on international school assessment tests, showed great similarities but also differences. Using version 1.2, a correlation of .87 (N = 445; p < 0.001) was calculated between Lynn and Vanhanen’s estimated IQs and my estimated IQs. For the same datasets; a correlation of .86 (N = 123; p < 0.001) was calculated between Lynn and Vanhanen’s estimated IQs and my estimated IQs for the same nations. Furthermore, I calculated a mean IQ for 124 nations of 84.90 (SD = 12.72) or 88.12 if the nations’ IQs were weighted by population.

Since one of my intentions was to make all the methods for calculations and corrections comprehensible, I started to write a manual, which helped for understanding but turned out to be not suitable to go into all details, to include all ideas, or to help springboard discussions. So, when a post on James Thompson’s blog (Thompson, 2017) was published about this project and gained a lot of attention, praise, and criticism, it gave me the incentive to start a blog of my own. The blog aims to present, explain, and discuss this project in a more detailed way, and to document its progress step by step.

Fig.3: National IQs from the NIQ-DATASET (V1.2).

About the Dataset and Blog

This blog is divided into two sections.

(1) The database section contains the current, as well as all earlier, versions of the National IQ dataset, starting with V1.1. The folders contain the main dataset in an Excel file together with a few notes about methods and updates, and also some additional and changing material, like maps, graphs, etc.

(2) The blog will be used to take a closer look at important issues, findings, and thematically related things, and to allow open discussion. Posts will be published irregularly to document the current state of the national IQ project.

Everything here should follow the principle of an open source project, and not a single number or calculation should be left in the “black box”. Another principle is the use of standardized methods. Special methods for individual cases should be avoided. All uploaded data are available for free use. All this is intended to guarantee clarity and traceability. I hope to meet the needs of interested people as much as possible, and I look forward to readers’ comments and discussions.

With best regards,

David Becker

Sources and References

Becker, D. (2017). National IQs revisited: The first steps of a long-term project. London Conference on Intelligence, 12-14 May 2017.

Becker, D., & Rindermann, H. (2016). The relationship between cross-national genetic distances and IQ-differences. Personality and Individual Differences, 98, 300-310. doi:10.1016/j.paid.2016.03.050

Flynn, J. R. (1984). The mean IQ of Americans: Massive gains 1932 to 1978. Psychological Bulletin, 95, 29-51. doi:10.1037/0033-2909.95.1.29

Gottfredson, L. S. (1994). Mainstream science on intelligence: An editorial with 52 signatories, history, and bibliography. Intelligence, 24, 13-23. doi:10.1016/S0160-2896(97)90011-8

Hunt, E., & Sternberg, R. J. (2006). Sorry, wrong numbers: An analysis of a study of a correlation between skin color and IQ. Intelligence, 34, 131-137. doi:10.1016/j.intell.2005.04.004

León, F. R., & Burga-León, A. (2015). How geography influences complex cognitive ability. Intelligence, 50, 221-227. doi:10.1016/j.intell.2015.04.011

Lynn, R. (1982). IQ in Japan and the United States shows a growing disparity. Nature, 297, 222-223. doi:10.1038/297222a0

Lynn, R. (2006). Race Differences in Intelligence: An Evolutionary Analysis. Augusta, GA: Washington Summit Publishers.

Lynn, R. (2008). The Global Bell Curve: Race, IQ, and Inequality Worldwide. Augusta, GA: Washington Summit Publishers.

Lynn, R. (2017). Personal Homepage. From: http://www.rlynn.co.uk/

Lynn, R., & Vanhanen, T. (2002). IQ and the Wealth of Nations. Westport, CT: Praeger.

Lynn, R., & Vanhanen, T. (2012a). Intelligence: A Unifying Construct for the Social Sciences. London: Ulster Institute for Social Research.

Lynn, R., & Vanhanen, T. (2012b). National IQs: A review of their educational, cognitive, economic, political, demographic, sociological, epidemiological, geographic and climatic correlates. Intelligence, 40, 226-234. doi:10.1016/j.intell.2011.11.004

Malloy, J. (2013-2014). HVGIQ. From: http://humanvarieties.org/

Piffer, D. (2013). Factor analysis of population allele frequencies as a simple, novel method of detecting signals of recent polygenic selection: The example of educational attainment and IQ. The Mankind Quarterly, 65, 168-184.

Piffer, D. (2015). A review of intelligence GWAS hits: Their relationship to country IQ and the issue of spatial autocorrelation. Intelligence, 53, 43-50. doi:10.1016/j.intell.2015.08.008

Pietschnig, J., & Voracek, M. (2015). One century of global IQ gains: A formal meta-analysis of the Flynn Effect (1909–2013). Perspectives on Psychological Science, 10, 282-306. doi:10.1177/1745691615577701

Rindermann, H., Woodley, M. A., & Stratford, J. (2012). Haplogroups as evolutionary markers of cognitive ability. Intelligence, 40, 362-375. doi:10.1016/j.intell.2012.04.002

Templer, D. I., & Arikawa, H. (2016). Temperature, skin color, per capita income, and IQ: An international perspective. Intelligence, 34, 121-139. doi:10.1016/j.intell.2005.04.002

Thompson, J. (2017). The World’s IQ = 86. Test results of 550,492 individuals in 123 countries. From: http://www.unz.com/jthompson/the-worlds-iq-86/

Vanhanen, T. (2003). Democratization: A comparative analysis of 170 countries. London; New York: Routledge.

Wicherts, J. M., Dolan, C. V., & van der Maas, H. L. J. (2010). A systematic literature review of the average IQ of sub-Saharan Africans. Intelligence, 38, 1-20. doi:10.1016/j.intell.2009.05.002