Data20 May 2026

Data Fabrication Retraction Rates 2026: Country Rankings

Data fabrication retraction rates per 10,000 publications, ranked across 39 countries. China leads globally; Nordic and Q1 countries score lowest. Built from Retraction Watch data.

TL;DR

Data fabrication (D6) rankings from AMI v1.5: China 100 (highest), Russia 78, India 70, Iran 65, Pakistan 65, Egypt 60. Lowest: New Zealand 12, Sweden 15, Norway 15, Ireland 15, Netherlands 15. Built from Retraction Watch 69,911 records normalised by publication volume.

data fabricationRetraction WatchD6datarankings 2026

TL;DR

D6 (data fabrication) rankings from AMI v1.5. Built from Retraction Watch (69,911 records, 5,390 misconduct-linked) normalised by publication volume from OpenAlex. China (D6=100) anchors the top; Nordic and Q1 countries score lowest. Detection-incidence confound applies.

Full D6 rankings

Top — highest D6 scores

Country	D6	Note
China	100	Highest globally
Russia	78	Dissernet documentation
India	70	Major research producer
Iran	65	Post-2010s output growth
Pakistan	65	Documented HEC cases
Egypt	60	Cairo/Ain Shams retractions
Nigeria	55	African leader
South Korea	55	Post-Hwang context
Malaysia	50	Regional pattern
Turkey	50	YÖK reforms ongoing

Middle — moderate D6 scores

Country	D6
Indonesia	45
Saudi Arabia	45
Brazil	40
Ukraine	35
Italy	35
Mexico	32
Japan	30
Poland	30
Philippines	30
South Africa	30
US	30
Thailand	30

Lower — lower D6 scores

Country	D6
Spain	28
France	25
Canada	22
Vietnam	22
Germany	20
Kenya	20
Singapore	20
Australia	18
UK	18

Bottom — lowest D6 scores

Country	D6
New Zealand	12
Sweden	15
Norway	15
Netherlands	15
Ireland	15
Colombia	0
Argentina	0
Greece	0

Note: Colombia, Argentina, and Greece show 0 because their absolute Retraction Watch presence is very low relative to publication volume. The 0 reflects the rescaling rather than literal zero retractions.

Methodology

The D6 dimension is built from Retraction Watch data:

Step 1 — Filter to misconduct-linked retractions

Retraction Watch records include retractions for many reasons (errors, duplicate publication, ethics issues, requests, misconduct). The AMI methodology filters to misconduct-linked retractions:

Fabrication
Falsification
Image manipulation
Plagiarism (in research context)
Fraud
Manipulation of peer review

The filter retains ~5,390 of the 69,911 total retractions.

Step 2 — Country attribution

Each retraction is attributed to a country based on author affiliations. Multi-country papers are attributed proportionally.

Step 3 — Normalise by publication volume

Retraction counts are divided by total publications from OpenAlex for the same country and time period (2016–2025 [verify exact window]). The result is retractions per 10,000 publications.

Step 4 — Rescale

The rates are rescaled to 0–100 across the 39-country set, with China's rate (the highest) anchoring 100.

Why China scores 100

China's D6 score of 100 reflects:

Largest research output globally — China is now the world's largest producer of academic papers
Documented paper mill industry — Fang, Steen & Casadevall (2012, PNAS) established systematic patterns
Liang et al. (2024) — Nature study found 6.3–17.5% of Chinese papers contain detectable AI-generated content
High absolute retraction count — even after normalising by publication volume, China leads

The score reflects both the scale of the problem and the relative effectiveness of detection. The actual misconduct rate may be higher still given undetected cases.

Why Russia scores 78

Russia's D6 score of 78 reflects:

Dissernet documentation — 10,000+ plagiarised dissertations identified
Retraction Watch entries — Russian-affiliated retractions per publication are elevated
Limited institutional response — even when misconduct is identified, consequences are often limited

Why some low-output countries score 0

Colombia, Argentina, and Greece score D6=0. This does not mean zero misconduct — it means their absolute retraction count is very small relative to publication volume, and the rescaling produces 0 at the bottom of the distribution.

For these countries, the methodology limitation is that the Retraction Watch dataset is partly biased by:

Higher-volume publishers being more represented (typically English-language)
Sting operations and systematic detection efforts being concentrated on high-volume producers

Latin American and Greek research output is smaller in absolute terms; detection effort is correspondingly less concentrated. Future methodology iterations may address this through different normalisation approaches.

The detection-incidence confound

Countries with strong post-publication review culture (Netherlands, Sweden, Germany) catch more cases. The Macchiarini case (Karolinska, Sweden) demonstrated how a country can have very public misconduct that produces strong response — improving R-Score — while also having very low underlying base rate.

Countries with weak post-publication review may have undetected misconduct. Their low D6 scores partly reflect under-detection rather than genuine absence of misconduct.

The AMI methodology applies a detection correction in the dimension scoring but the fundamental confound remains. Cross-country comparison of D6 scores should weight this consideration.

Time trends

The Retraction Watch database has grown substantially over the past decade:

2010: ~5,000 records
2015: ~10,000 records
2020: ~30,000 records
2024: ~70,000 records [verify exact 2024 count]

Growth reflects both increasing actual misconduct detection and increasing database coverage. The acceleration in retractions in 2020–2024 was partly driven by systematic paper mill detection efforts following Crossref initiatives.

Sources

Retraction Watch Database, Crossref/GitLab (April 2026)
OpenAlex publication counts
Fang, Steen & Casadevall (2012), PNAS
Liang et al. (2024), Nature
AMI v1.5 methodology

Full methodology | Download dataset

Explore the full dataset

Frequently asked questions

Which country has the highest data fabrication retraction rate?

China scores 100 on the AMI's D6 dimension — the highest rate of misconduct-linked retractions per 10,000 publications in the dataset. Russia follows at 78, India at 70, Iran and Pakistan at 65 each, Egypt at 60. The pattern correlates with both research output volume and the strength of institutional integrity infrastructure.

How is the AMI's D6 dimension calculated?

The Retraction Watch database is filtered to misconduct-linked retractions (fabrication, falsification, fraud, image manipulation). Each retraction is country-attributed by author affiliation. Counts are then divided by total publications from OpenAlex for the same country and time period, producing a retractions-per-10,000-publications rate. The rates are rescaled to 0–100 across the 39-country set, with China's rate anchoring 100.

Does a low D6 score mean no research misconduct?

No — D6 measures *detected and retracted* misconduct. Countries with strong post-publication review culture (Netherlands, Sweden, Norway) detect and retract more, but also produce less of the underlying misconduct. Countries with weak post-publication review may have undetected misconduct that does not appear in retraction data. The methodology applies a detection correction but the fundamental confound remains.

How to cite this article

APA: Booth, F. (2026). Data Fabrication Retraction Rates 2026: Country Rankings. Academic Misconduct Index. https://academicmisconductindex.com/blog/data-fabrication-retraction-rates

BibTeX: @misc{booth2026data, author={Booth, Francisco}, title={Data Fabrication Retraction Rates 2026: Country Rankings}, year={2026}, url={https://academicmisconductindex.com/blog/data-fabrication-retraction-rates}}

Francisco Booth

Independent researcher, founder of the Academic Misconduct Index

Country Profile

China: Academic Misconduct Index Country Profile

Data

Academic Plagiarism Statistics by Country 2026

Data

ChatGPT and Academic Cheating: What the Data Actually Shows

← Back to all posts

Data Fabrication Retraction Rates 2026: Country Rankings

TL;DR

Full D6 rankings

Top — highest D6 scores

Middle — moderate D6 scores

Lower — lower D6 scores

Bottom — lowest D6 scores

Methodology

Step 1 — Filter to misconduct-linked retractions

Step 2 — Country attribution

Step 3 — Normalise by publication volume

Step 4 — Rescale

Why China scores 100

Why Russia scores 78

Why some low-output countries score 0

The detection-incidence confound

Time trends

Sources

Related

Frequently asked questions

Which country has the highest data fabrication retraction rate?

How is the AMI's D6 dimension calculated?

Does a low D6 score mean no research misconduct?