ARC Prize has released details for “ARC-AGI-3,” a new stage of its benchmark/challenge aimed at evaluating progress toward more general AI systems.
Show HN: A plain-text cognitive architecture for Claude Code (lab.puga.com.br) AI
A developer blog post describes a plain-text cognitive architecture concept intended to work with Claude Code.
Show HN: Optio – Orchestrate AI coding agents in K8s to go from ticket to PR (github.com) AI
Show HN introduces Optio, a tool for orchestrating AI coding agents on Kubernetes to turn tickets into pull requests.
“Disregard That” Attacks (calpaterson.com) AI
The post discusses “disregard”/instruction-following attack techniques that can cause systems (e.g., LLMs) to ignore or override intended instructions.
From zero to a RAG system: successes and failures (en.andros.dev) AI
The post explains the process of building a RAG (retrieval-augmented generation) system and shares lessons from both successes and failures.
Elevated error rates on Opus 4.6 (status.claude.com) AI
A status-page incident reports elevated error rates affecting Claude’s Opus 4.6 model/service.
Judge blocks Pentagon effort to 'punish' Anthropic with supply chain risk label (cnn.com) AI
A judge blocks the Pentagon from using a supply-chain risk label to “punish” Anthropic, after the company challenged the move.
Order Granting Preliminary Injunction – Anthropic vs. U.S. Department of War [pdf] (storage.courtlistener.com) AI
A court order grants a preliminary injunction in a legal dispute involving Anthropic and the U.S. Department of War.
Agent-to-agent pair programming (axeldelafosse.com) AI
The post discusses using agent-to-agent collaboration for pair programming using AI agents.
Chroma Context-1: Training a Self-Editing Search Agent (trychroma.com) AI
Chroma publishes research on Context-1, a self-editing search agent designed to improve its own search behavior over time.
HyperAgents: Self-referential self-improving agents (github.com) AI
Facebook Research has released HyperAgents, a framework for self-referential self-improving AI agents.
$500 GPU outperforms Claude Sonnet on coding benchmarks (github.com) AI
A GitHub project claims a $500 GPU setup that outperforms Claude Sonnet on coding benchmarks.
Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer (georgelarson.me) AI
The author describes deploying an AI agent on a low-cost VPS, using IRC as the communication/transport layer.
Pretraining Language Models via Neural Cellular Automata (hanseungwook.github.io)
A research post exploring whether neural cellular automata can help with language model pretraining.
Astral to Join OpenAI (astral.sh)
Astral's own write-up on joining OpenAI and what it expects for Ruff, uv, and open source tooling.
OpenAI to Acquire Astral (openai.com)
OpenAI says it plans to acquire Astral, the team behind fast Python tooling like uv and Ruff.