Study Finds a Third of New Websites Are AI-Generated
A study by researchers from Stanford, Imperial College London, and the Internet Archive found that 35% of new websites since 2022 are AI-generated or AI-assisted. The research analyzed web content from August 2022 to May 2025 and found AI-generated text is less diverse semantically and more positive in tone. Contrary to concerns, the study did not find evidence that AI-generated content increases false statements or fails to cite sources.
- ▪Researchers found that 35% of newly published websites by mid-2025 were AI-generated or AI-assisted.
- ▪The study used Pangram v3, an AI-detection tool, to identify AI-generated websites from archived snapshots via the Wayback Machine.
- ▪AI-generated text was found to be more positive and less semantically diverse, but did not show increased falsehoods or lack of source citations.
- ▪Human fact-checkers were employed to verify factual claims in AI-generated content as part of the study.
- ▪The research team plans to develop a continuous monitoring tool with the Internet Archive to track AI's impact on the web.
Opening excerpt (first ~120 words) tap to expand
Researchers working with data from the Internet Archive have discovered that a third of websites created since 2022 are AI-generated. The team of researchers—which includes people from Stanford, the Imperial College London, and the Internet Archive—published their findings online in a paper titled “The Impact of AI-Generated Text on the Internet.” The research also found that all this AI-generated text is making the web more cheery and less verbose.Inspired by the Dead Internet Theory—the idea that much of the internet is now just bots talking back and forth—the team set out to find out how ChatGPT and its competitors had reshaped the internet since 2022.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at 404 Media.