ArXiv bans authors 1 year for unchecked AI-generated papers
One-strike rule: hallucinated references or LLM comments = immediate ban.
ArXiv, the widely used open repository for preprint research, is intensifying its crackdown on the careless use of large language models in scientific papers. In a move announced Thursday by Thomas Dietterich, chair of ArXiv's computer science section, the policy targets papers that show "incontrovertible evidence" that authors did not check the results of LLM generation. Such evidence includes hallucinated references, inappropriate language, plagiarized content, and even stray comments to or from the LLM. Authors caught violating this rule will receive a one-year ban from ArXiv, after which they must first have their subsequent submissions accepted by a reputable peer-reviewed venue before posting. Dietterich described it as a "one-strike" rule, though moderators must flag issues and section chairs confirm the evidence before imposing the penalty; authors can appeal.
This policy is not an outright prohibition on using LLMs—rather, it insists that authors take "full responsibility" for content irrespective of how it is generated. The move comes as ArXiv, hosted by Cornell for over 20 years, transitions to an independent nonprofit structure to raise more funds to address issues like AI-generated slop. Recent peer-reviewed research has already documented a rise in fabricated citations in biomedical research, likely due to LLMs. By enforcing accountability, ArXiv aims to preserve trust in its repository, which serves as a primary distribution channel for cutting-edge research in fields like computer science, mathematics, and physics. The policy underscores a growing tension between leveraging AI tools for efficiency and maintaining scientific rigor.
- One-year ban for authors with incontrovertible evidence of unchecked LLM use (e.g., hallucinated references).
- After ban, authors must have papers accepted by a reputable peer-reviewed venue before posting on ArXiv again.
- Policy is a one-strike rule with moderator flags, section chair confirmation, and an appeal process.
Why It Matters
Cracks down on AI-slurry papers, preserving preprint credibility for the global research community.