The policy does not prohibit researchers from using AI tools entirely. Authors can still use language models for drafting, editing, or analytical support. What triggers enforcement are visible failures that indicate a paper was submitted without meaningful human oversight, including hallucinated citations, fabricated references, placeholder notes, or chatbot instructions accidentally left inside the manuscript.
Examples cited by Dietterich include references to publications that do not exist, unfinished instructions such as “fill in with the real numbers from your experiments,” or meta-comments from AI systems that were never removed before submission. If moderators identify those issues and a section chair confirms the finding, the author can receive a one-year submission ban. After that period, the researcher’s future submissions must first be accepted by a peer-reviewed journal before appearing on arXiv.
The move reflects growing pressure on academic publishing systems as generative AI tools make it easier to rapidly produce scientific-looking papers with minimal human verification. Although arXiv does not conduct formal peer review, it serves as one of the most important distribution channels for early-stage research in fields including machine learning, computer science, mathematics, and physics. Papers posted to arXiv are frequently cited and circulated before formal publication, meaning fabricated references or inaccurate claims can spread quickly through research communities.
A recent study published in The Lancet by researchers at Columbia University highlighted the scale of the problem. The researchers analyzed 2.5 million biomedical papers and found fabricated citations rising sharply since 2023. According to the study, the rate increased from roughly one paper in 2,828 containing fake references in 2023 to one in 458 by 2025. During the opening weeks of 2026, the rate climbed further to one in 277 papers.
The study linked the increase to wider use of AI writing systems, noting earlier research estimating that between 30% and 69% of LLM-generated biomedical references are fabricated. ArXiv’s moderation system was not originally designed to screen large volumes of machine-generated submissions. The repository processes thousands of papers each month and relies heavily on volunteer moderators.
Dietterich described the new enforcement approach as a “one-strike” rule, though decisions can be appealed and require confirmation from section leadership before penalties are imposed. Rather than attempting to ban AI-assisted writing broadly, arXiv’s policy focuses on provable negligence. The platform’s existing rules already state that authors retain “full responsibility” for submitted content regardless of how it was produced. The new penalties effectively formalize that principle for the generative AI era.
That narrower scope also avoids the technical and practical problems associated with AI detection systems, which remain unreliable and prone to false positives. Instead of trying to determine whether AI tools were used at all, moderators only need to identify obvious evidence that the author failed to review the material before submission.
The issue extends beyond arXiv. Major computer science conferences including NeurIPS and ICML have reported rising volumes of submissions that appear to rely heavily on automated text generation with limited human oversight. Nature also warned recently that AI-generated “slop” is overwhelming reviewers and reducing the overall quality of submissions in some technical fields.
Even peer-reviewed journals are struggling. The Lancet study found fabricated citations inside papers that had already passed review, raising concerns that reviewers themselves may be unable to reliably identify AI-generated inaccuracies at current submission volumes. Lead author Maxim Topaz warned that clinicians and guideline developers may unknowingly rely on evidence that does not actually exist.
ArXiv is also undergoing broader institutional changes that could shape how it responds to the issue going forward. After operating under Cornell University for more than two decades, the platform is transitioning into an independent nonprofit organization. The shift could give arXiv greater flexibility to develop moderation policies and fund additional infrastructure focused on quality control.
The platform has already added new submission safeguards, including requiring first-time authors to obtain endorsements from established contributors before posting papers. Still, the new rule has clear limits. It targets the most careless forms of AI misuse rather than deeper scientific misconduct. Researchers who use AI systems to generate plausible but inaccurate claims, fabricate data, or produce technically polished but scientifically weak work may still evade detection if the final paper appears coherent.
What the policy ultimately establishes is a stricter accountability standard around authorship. ArXiv is signaling that regardless of how heavily researchers rely on AI systems during the writing process, authors remain responsible for verifying every citation, claim, and sentence attached to their names.
This analysis is based on reporting from TNW | Artificial-Intelligence.
Image courtesy of Unsplash.
This article was generated with AI assistance and reviewed for accuracy and quality.