Guardian FOI Investigation: 7,000 UK Students Caught Using AI
The Guardian's 2025 FOI investigation produced the largest single confirmed-case dataset on AI academic misconduct globally. Nearly 7,000 UK students caught in one academic year. Here is what the data showed and how the AMI uses it.
TL;DR
The Guardian Freedom of Information investigation (June 2025) showed nearly 7,000 UK university students were formally caught using AI tools in 2023–24 — a rate of 5.1 per 1,000 students. The data is the largest single confirmed-case dataset for AI misconduct globally and is incorporated into the AMI's D2 dimension methodology.
TL;DR
The Guardian Freedom of Information investigation (June 2025) showed nearly 7,000 UK university students were formally caught using AI tools in 2023–24 — 5.1 per 1,000 students. The data is the largest single confirmed-case dataset on AI misconduct globally and contributes to the AMI's D2 dimension methodology.
What the Guardian did
In early 2025, The Guardian submitted Freedom of Information requests to UK universities asking for statistics on formal academic misconduct cases involving AI tools for the 2023–24 academic year. The investigation:
- Scope: most UK universities responded
- Output: nearly 7,000 confirmed cases across the responding institutions
- Rate: 5.1 per 1,000 students (formally caught and processed)
- Categories: ChatGPT, Bard, Claude, Gemini, and other AI tool use treated as academic misconduct
The investigation was published in June 2025 as a long-form piece with institution-level breakdowns and editorial analysis.
Why the data matters
First systematic national-level confirmed-case dataset
Before the Guardian investigation, AI misconduct data was fragmented — individual institutions reported numbers, anecdotal evidence circulated, but no systematic national picture existed.
The Guardian data provided:
- A national-level confirmed-case rate for the first time
- Institution-level breakdowns enabling cross-institutional comparison
- A baseline for measuring whether incidence is growing or falling
Detection-incidence reasoning
The 5.1/1000 confirmed rate combined with the Scarfe et al. (2024) detection finding (94% of AI submissions undetected) suggests:
- 5.1/1000 detected × (1/0.06) detection factor = 85/1000 true rate estimate
- This implies an ~8.5% true incidence rate
The AMI methodology applies similar detection corrections in calculating D2 scores from observed signals.
Institution-level variation
The Guardian data showed substantial variation across UK universities:
- Some institutions reported very high rates (e.g. above 10/1000)
- Others reported very low rates (below 1/1000)
The variation likely reflects differences in:
- Detection investment and deployment
- Institutional reporting practices
- Specific course types and assessment design
- Student population composition
The variation is itself informative — it suggests detection capability is the principal determinant of reported rates, with actual incidence relatively more uniform across institutions.
Russell Group FOI follow-up
Times Higher Education published a parallel investigation focused specifically on UK Russell Group universities. The Russell Group data:
- Showed lower per-student rates than the broader UK pattern [verify specific Russell Group rate]
- Reflected stronger detection infrastructure at research-intensive institutions
- Provided cross-institutional comparison among peer institutions
The Russell Group pattern is consistent with the broader hypothesis: stronger detection capability produces more reported cases (in jurisdictions where detection drives reporting), while actual incidence varies less.
How the AMI uses the data
Direct UK D2 contribution
The AMI's D2 dimension for the UK is partially anchored on the Guardian confirmed-case rate. The UK D2 score of 44 reflects:
- Moderate Google Trends signal
- Confirmed-case rate from FOI
- Detection capability adjustment
The UK D2 is lower than the maxed-D2 countries (Colombia, Argentina, etc.) partly because of the confirmed-case data providing actual rates rather than relying on demand signals alone.
Cross-country calibration
The Guardian data provides a calibration point for D2 across the dataset. Countries without confirmed-case data are scored on demand signals (Google Trends), but the Guardian baseline lets the methodology estimate the demand-to-incidence translation more confidently.
Methodology improvement
Future AMI versions will incorporate similar FOI data as it becomes available in other jurisdictions:
- Australia (FOI submissions reportedly in progress)
- US (state-level FOI variability)
- Other Anglophone systems
The Guardian investigation effectively demonstrated FOI as a viable data source for AI misconduct measurement.
Why the UK could produce this data
The UK FOI Act applies to universities (most are public-funded). Combined with:
- Active investigative journalism (the Guardian's higher education team)
- Substantial UK university sector size enabling meaningful statistical analysis
- Standardised institutional misconduct reporting categories
The UK is structurally well-positioned to produce this kind of national dataset. Other countries with weaker FOI laws, less centralised university systems, or less active education journalism cannot easily replicate.
What the data does not show
Undetected misconduct
The 5.1/1000 rate is *confirmed-case* — students formally identified and processed for misconduct. The Scarfe correction (94% undetected) suggests the actual rate is substantially higher.
Severity gradient
Confirmed cases include a wide range of misconduct severity — from minor undisclosed AI assistance through wholesale AI-generated work submitted as original. The Guardian data does not differentiate.
Course type variation
Different subject areas likely have very different AI misconduct rates. Essay-based humanities and social science courses are likely more affected than mathematics or laboratory-based STEM courses. The Guardian data aggregates across course types.
Trend data
The Guardian data covers 2023–24. The 2024–25 data (still being collected) will be more informative for trend analysis.
Broader significance
The Guardian investigation established that:
- FOI is a viable data source for AI misconduct measurement
- Institutional variation in reporting reflects detection capability
- Confirmed-case rates substantially understate true incidence (Scarfe correction)
- AI misconduct is a measurable population-level phenomenon, not an anecdotal one
The reporting accelerated UK policy discussion of AI in education and contributed to detection tool investment decisions at institutions across the sector.
Sources
- The Guardian (June 2025), FOI investigation [verify specific article reference]
- Times Higher Education Russell Group FOI reporting [verify]
- Scarfe, P., et al. (2024), University of Reading detection study
- AMI v1.5 D2 dimension methodology
Frequently asked questions
What did the Guardian FOI investigation find?
The Guardian (June 2025) submitted Freedom of Information requests to UK universities asking for formal academic misconduct case statistics involving AI tools for 2023–24. The combined data showed nearly 7,000 confirmed cases — a rate of 5.1 per 1,000 UK higher education students. The data was the largest single confirmed-case dataset on AI misconduct globally.
How does the AMI use the Guardian FOI data?
The Guardian data provides one of the few confirmed-case (rather than demand-signal) data sources for the AMI's D2 dimension. The UK score is partially anchored on this confirmed-case rate. The 5.1/1000 rate, combined with the Scarfe et al. (2024) finding that 94% of AI submissions go undetected, suggests true rates above 8%. Both data points contribute to the methodology.
Have other countries done similar FOI investigations?
Times Higher Education has published similar FOI data specifically for Russell Group universities in the UK. Other Anglophone countries have had similar journalism — Australian, New Zealand, and US investigations of varying scale — but the Guardian's 2025 investigation remains the largest single dataset covering most institutions in a national higher education system.
How to cite this article
APA: Booth, F. (2026). Guardian FOI Investigation: 7,000 UK Students Caught Using AI. Academic Misconduct Index. https://academicmisconductindex.com/blog/guardian-foi-investigation
BibTeX: @misc{booth2026guardian, author={Booth, Francisco}, title={Guardian FOI Investigation: 7,000 UK Students Caught Using AI}, year={2026}, url={https://academicmisconductindex.com/blog/guardian-foi-investigation}}
Francisco Booth
Independent researcher, founder of the Academic Misconduct Index
Related posts