US publishers tell Common Crawl to stop scraping and delete archive (pressgazette.co.uk) AI
US publisher trade group Digital Content Next has sent a lawyer’s cease-and-desist letter to the Common Crawl Foundation, demanding it stop scraping protected publisher content and delete previously archived material, citing concerns over copyright and delays in honoring opt-out requests. The article says Common Crawl has denied claims of misleading publishers and argues it initiates removal processes and does not “go behind paywalls,” while the dispute is framed against Common Crawl’s role in training major AI systems.
June 09, 2026 14:05
Source: Hacker News