Weekly Dispatch · archived
Weekly Dispatch · Week 17 of 2026
Crawling & Publisher Controls
This week's key developments include Cloudflare's new 'Redirects for AI Training' feature, which enforces canonical content for AI crawlers, and a ProcessWire module enabling direct management of robots.txt to block various AI training bots.
- Redirects for AI Training enforces canonical content - The Cloudflare Blog
Cloudflare introduced a new feature to redirect AI crawlers to canonical content, preventing them from ingesting deprecated information, supported by server-log analysis.
"In the first seven days, 100% of AI training crawler requests to pages with non-self-referencing canonical tags were redirected and were not served with deprecated content."
- RobotsTxt — Manage robots.txt from the admin UI - Modules/Plugins - ProcessWire
A new ProcessWire module allows administrators to directly manage robots.txt, including specific rules to block various AI training bots like GPTBot and ClaudeBot.
"Block AI training bots (GPTBot, CCBot, anthropic-ai, Google-Extended, FacebookBot, Omgilibot)"
Agents
This week's reporting highlights the emergence of self-improving open-source AI agent infrastructure for enterprise deployment, alongside critical warnings regarding AI-driven security vulnerabilities and supply chain risks. A practical example showcases an AI agent's application in security auditing and risk identification.
- Hermes Agent v0.10: Self-Improving Open-Source AI Agent
This report details an open-source, self-improving AI agent framework, highlighting its persistent memory, learning, and enterprise deployment advantages over hosted or orchestration-only solutions.
"Hermes is the first fully MIT-licensed Tier 3 runtime."
- Chrome Skills: Getting Started with Google Cloud examples
This article demonstrates an AI mentor agent designed for SREs and Security Auditors to identify risks, breaking changes, and deprecations in Google Cloud release notes, suggesting immediate mitigation actions.
"Your goal is to identify risks in the release notes. Scan the text specifically for the following keywords: "Breaking," "Deprecation," "Security," "Vulnerability," "CVE," and "Action Required.""
Copyright & Legal
This week saw significant developments in AI copyright, including a major $1.5 billion settlement by Anthropic for using pirated training data, ongoing legal battles involving OpenAI and other AI companies, and policy shifts in the UK against broad AI text and data mining exceptions. Regulatory and academic discussions also focused on clarifying secondary liability for online platforms and the broader legal indeterminacy in AI governance.
- AI Music Copyright: Legal Risks Content Creators Must Know (2026)
Analyzes legal risks for AI music creators, citing US Copyright Office rulings, UK policy shifts against opt-out, and major industry settlements.
"The US Copyright Office released its definitive guidance on AI-generated content, and the news isn't good for AI music generators. The ruling is crystal clear: 100% AI-generated content cannot be copyrighted and falls into the public domain."
- Court Rules in Favor of Public Access to Information
Reports on a US appeals court ruling supporting public access to building standards incorporated into law, highlighting the importance of the government edicts doctrine.
"In a win for information advocates and the public, the US Court of Appeals for the Third Circuit ruled that the American Society for Testing & Materials (ASTM) was unlikely to succeed on its claim that online research platform UpCodes infringed copyright by posting ASTM building standards on its website."
- Anthropic settlement draws nearly 120,000 author claims
Details the Anthropic $1.5 billion settlement with authors for using pirated books, emphasizing the distinction between data acquisition and AI training legality.
"Nearly 120,000 authors and other copyright holders have filed claims to share in Anthropic's US$1.5 billion class-action settlement over the unauthorized use of books in AI training, with claims covering 91% of more than 480,000 works, court filings in California show."
- Developer policy update: Intermediary liability, copyright, and transparency
Discusses a US Supreme Court decision clarifying secondary copyright liability for online services and its implications for developer platforms and DMCA reviews.
"The Court's opinion reinforced that service providers are not automatically liable for copyright infringement by users without evidence of intent to encourage or materially contribute to infringement."
- EPIC Files Amicus Brief Countering Big Tech Claim that Surveillance-Based Feeds Are Protected by the First Amendment
Reports on EPIC's amicus brief defending a South Carolina law against tech industry claims that behavioral profiling in algorithmic feeds is protected speech.
"Our amicus brief explains that behavioral profiling in recommendation systems is not expressive, and companies cannot claim First Amendment protection for their non-expressive conduct simply by combining it with something that is expressive (e.g., content moderation)."
Web Ecosystem & AI Impact
This week's discourse highlights the ongoing challenge of AI Overviews severely impacting publisher traffic and revenue, prompting calls for new monetization strategies like content licensing and regulatory intervention. Concerns are also rising regarding AI's effect on indigenous content, minority languages, and the viability of small publishers.
- Google AI Overviews Impact on Publishers Explained
Analysis of AI Overviews reducing publisher traffic and revenue, advocating for collective content licensing deals.
"When an AI Overview appeared, users clicked a traditional search result just 8% of the time. Without an AI Overview, they clicked 15% of the time, nearly twice as often."
- AI Overviews & the Decline of Organic Traffic: What Smart Marketers Need to Do Now
Examines how AI Overviews accelerate organic traffic decline, necessitating new strategies beyond last-click organic sessions.
"Ahrefs analyzed 300,000 keywords and reported that the presence of an AI Overview correlated with a 34.5% lower average click-through rate for the top-ranking page."
- Le Monde CEO urges publishers to sign AI partnerships to stay competitive
Le Monde's CEO advocates for AI licensing deals, citing significant new revenue and subscriber conversions from their partnerships.
"The French daily newspaper and website has seen a “significant amount of new revenue” including via conversions to paid subscriptions after signing three AI partnership agreements."
- AI generating fake Indigenous content
Experts express concern over AI generating false Indigenous content, potentially harming language revitalization and cultural reconnection efforts.
"AI generated Indigenous content is drawing attention away from authentic Indigenous context due to the flood of misinformation AI can generate."
- AI Search Engine Statistics 2026: Market Share Data
Presents 2026 statistics on AI Overviews' asymmetric traffic impact on publishers, with significant drops for informational queries.
"non-branded informational query traffic has dropped 15-30% across content sites year over year. Transactional and branded query traffic has held up far better."
- Inside TIME's rollout of its TIMEAI interactive agent
TIME is building its own AI agent to leverage its archives, deepen engagement, and ensure content attribution beyond licensing deals.
"TIME isn't just licensing its journalism to AI platforms; it has built its own agent to keep users on-site, deepen engagement, and prove that trusted archives can power differentiated AI experiences."
- OpenAI Is Not a Lab Anymore. It's a Media Company. | The Gradient | Daily AI Mail
Argues OpenAI is now a media company, critiquing its rapid content licensing, advertising, and owned media strategy that displaces publishers.
"OpenAI is paying publishers for content while building a platform that redirects the audience that content depends on. Those two activities are not in tension — they are structurally connected."
- What is Brief History of Criteo Company? - Matrix BCG
Criteo's evolution to a commerce media platform, leveraging first-party data monetization and generative AI for ad creative and bidding.
"Criteo's shift to first-party data and publisher partnerships aims to sustain double-digit growth in retail media despite cookie deprecation."
- Wired Italia is closing down – a sign of a bigger problem
Commentary on Wired Italia's closure highlighting broader challenges for niche media, including AI-generated content and the struggle for financial support.
"Today, however, readers are put off by content that is clearly written by AI, as fewer and fewer people are willing to devote their attention and time to reading content that no human has had the time or inclination to produce."
- GEO drives new brand-media deals - Axios
Brands are investing in generative engine optimization (GEO) to gain organic media coverage and reach consumers through AI tools.
"The rise of chatbots and AI search engines is forcing big brands to invest in new generative engine optimization that favors third-party validation, as opposed to traditional search engine optimization."