Data Fabrication Retraction Rates 2026: Country Rankings
Data fabrication retraction rates per 10,000 publications, ranked across 39 countries. China leads globally; Nordic and Q1 countries score lowest. Built from Retraction Watch data.
TL;DR
Data fabrication (D6) rankings from AMI v1.5: China 100 (highest), Russia 78, India 70, Iran 65, Pakistan 65, Egypt 60. Lowest: New Zealand 12, Sweden 15, Norway 15, Ireland 15, Netherlands 15. Built from Retraction Watch 69,911 records normalised by publication volume.
TL;DR
D6 (data fabrication) rankings from AMI v1.5. Built from Retraction Watch (69,911 records, 5,390 misconduct-linked) normalised by publication volume from OpenAlex. China (D6=100) anchors the top; Nordic and Q1 countries score lowest. Detection-incidence confound applies.
Full D6 rankings
Top — highest D6 scores
| Country | D6 | Note |
|---|---|---|
| China | 100 | Highest globally |
| Russia | 78 | Dissernet documentation |
| India | 70 | Major research producer |
| Iran | 65 | Post-2010s output growth |
| Pakistan | 65 | Documented HEC cases |
| Egypt | 60 | Cairo/Ain Shams retractions |
| Nigeria | 55 | African leader |
| South Korea | 55 | Post-Hwang context |
| Malaysia | 50 | Regional pattern |
| Turkey | 50 | YÖK reforms ongoing |
Middle — moderate D6 scores
| Country | D6 |
|---|---|
| Indonesia | 45 |
| Saudi Arabia | 45 |
| Brazil | 40 |
| Ukraine | 35 |
| Italy | 35 |
| Mexico | 32 |
| Japan | 30 |
| Poland | 30 |
| Philippines | 30 |
| South Africa | 30 |
| US | 30 |
| Thailand | 30 |
Lower — lower D6 scores
| Country | D6 |
|---|---|
| Spain | 28 |
| France | 25 |
| Canada | 22 |
| Vietnam | 22 |
| Germany | 20 |
| Kenya | 20 |
| Singapore | 20 |
| Australia | 18 |
| UK | 18 |
Bottom — lowest D6 scores
| Country | D6 |
|---|---|
| New Zealand | 12 |
| Sweden | 15 |
| Norway | 15 |
| Netherlands | 15 |
| Ireland | 15 |
| Colombia | 0 |
| Argentina | 0 |
| Greece | 0 |
Note: Colombia, Argentina, and Greece show 0 because their absolute Retraction Watch presence is very low relative to publication volume. The 0 reflects the rescaling rather than literal zero retractions.
Methodology
The D6 dimension is built from Retraction Watch data:
Step 1 — Filter to misconduct-linked retractions
Retraction Watch records include retractions for many reasons (errors, duplicate publication, ethics issues, requests, misconduct). The AMI methodology filters to misconduct-linked retractions:
- Fabrication
- Falsification
- Image manipulation
- Plagiarism (in research context)
- Fraud
- Manipulation of peer review
The filter retains ~5,390 of the 69,911 total retractions.
Step 2 — Country attribution
Each retraction is attributed to a country based on author affiliations. Multi-country papers are attributed proportionally.
Step 3 — Normalise by publication volume
Retraction counts are divided by total publications from OpenAlex for the same country and time period (2016–2025 [verify exact window]). The result is retractions per 10,000 publications.
Step 4 — Rescale
The rates are rescaled to 0–100 across the 39-country set, with China's rate (the highest) anchoring 100.
Why China scores 100
China's D6 score of 100 reflects:
- Largest research output globally — China is now the world's largest producer of academic papers
- Documented paper mill industry — Fang, Steen & Casadevall (2012, PNAS) established systematic patterns
- Liang et al. (2024) — Nature study found 6.3–17.5% of Chinese papers contain detectable AI-generated content
- High absolute retraction count — even after normalising by publication volume, China leads
The score reflects both the scale of the problem and the relative effectiveness of detection. The actual misconduct rate may be higher still given undetected cases.
Why Russia scores 78
Russia's D6 score of 78 reflects:
- Dissernet documentation — 10,000+ plagiarised dissertations identified
- Retraction Watch entries — Russian-affiliated retractions per publication are elevated
- Limited institutional response — even when misconduct is identified, consequences are often limited
Why some low-output countries score 0
Colombia, Argentina, and Greece score D6=0. This does not mean zero misconduct — it means their absolute retraction count is very small relative to publication volume, and the rescaling produces 0 at the bottom of the distribution.
For these countries, the methodology limitation is that the Retraction Watch dataset is partly biased by:
- Higher-volume publishers being more represented (typically English-language)
- Sting operations and systematic detection efforts being concentrated on high-volume producers
Latin American and Greek research output is smaller in absolute terms; detection effort is correspondingly less concentrated. Future methodology iterations may address this through different normalisation approaches.
The detection-incidence confound
Countries with strong post-publication review culture (Netherlands, Sweden, Germany) catch more cases. The Macchiarini case (Karolinska, Sweden) demonstrated how a country can have very public misconduct that produces strong response — improving R-Score — while also having very low underlying base rate.
Countries with weak post-publication review may have undetected misconduct. Their low D6 scores partly reflect under-detection rather than genuine absence of misconduct.
The AMI methodology applies a detection correction in the dimension scoring but the fundamental confound remains. Cross-country comparison of D6 scores should weight this consideration.
Time trends
The Retraction Watch database has grown substantially over the past decade:
- 2010: ~5,000 records
- 2015: ~10,000 records
- 2020: ~30,000 records
- 2024: ~70,000 records [verify exact 2024 count]
Growth reflects both increasing actual misconduct detection and increasing database coverage. The acceleration in retractions in 2020–2024 was partly driven by systematic paper mill detection efforts following Crossref initiatives.
Sources
- Retraction Watch Database, Crossref/GitLab (April 2026)
- OpenAlex publication counts
- Fang, Steen & Casadevall (2012), PNAS
- Liang et al. (2024), Nature
- AMI v1.5 methodology
Full methodology | Download dataset
Related
Frequently asked questions
Which country has the highest data fabrication retraction rate?
China scores 100 on the AMI's D6 dimension — the highest rate of misconduct-linked retractions per 10,000 publications in the dataset. Russia follows at 78, India at 70, Iran and Pakistan at 65 each, Egypt at 60. The pattern correlates with both research output volume and the strength of institutional integrity infrastructure.
How is the AMI's D6 dimension calculated?
The Retraction Watch database is filtered to misconduct-linked retractions (fabrication, falsification, fraud, image manipulation). Each retraction is country-attributed by author affiliation. Counts are then divided by total publications from OpenAlex for the same country and time period, producing a retractions-per-10,000-publications rate. The rates are rescaled to 0–100 across the 39-country set, with China's rate anchoring 100.
Does a low D6 score mean no research misconduct?
No — D6 measures *detected and retracted* misconduct. Countries with strong post-publication review culture (Netherlands, Sweden, Norway) detect and retract more, but also produce less of the underlying misconduct. Countries with weak post-publication review may have undetected misconduct that does not appear in retraction data. The methodology applies a detection correction but the fundamental confound remains.
How to cite this article
APA: Booth, F. (2026). Data Fabrication Retraction Rates 2026: Country Rankings. Academic Misconduct Index. https://academicmisconductindex.com/blog/data-fabrication-retraction-rates
BibTeX: @misc{booth2026data, author={Booth, Francisco}, title={Data Fabrication Retraction Rates 2026: Country Rankings}, year={2026}, url={https://academicmisconductindex.com/blog/data-fabrication-retraction-rates}}
Francisco Booth
Independent researcher, founder of the Academic Misconduct Index
Related posts