AI News

ARC-AGI-3 (arcprize.org) AI

ARC Prize has released details for “ARC-AGI-3,” a new stage of its benchmark/challenge aimed at evaluating progress toward more general AI systems.

12 days ago Source: Hacker News

Show HN: A plain-text cognitive architecture for Claude Code (lab.puga.com.br) AI

A developer blog post describes a plain-text cognitive architecture concept intended to work with Claude Code.

12 days ago Source: Hacker News

Show HN: Optio – Orchestrate AI coding agents in K8s to go from ticket to PR (github.com) AI

Show HN introduces Optio, a tool for orchestrating AI coding agents on Kubernetes to turn tickets into pull requests.

12 days ago Source: Hacker News

“Disregard That” Attacks (calpaterson.com) AI

The post discusses “disregard”/instruction-following attack techniques that can cause systems (e.g., LLMs) to ignore or override intended instructions.

12 days ago Source: Hacker News

From zero to a RAG system: successes and failures (en.andros.dev) AI

The post explains the process of building a RAG (retrieval-augmented generation) system and shares lessons from both successes and failures.

12 days ago Source: Hacker News

Elevated error rates on Opus 4.6 (status.claude.com) AI

A status-page incident reports elevated error rates affecting Claude’s Opus 4.6 model/service.

12 days ago Source: Hacker News

Judge blocks Pentagon effort to 'punish' Anthropic with supply chain risk label (cnn.com) AI

A judge blocks the Pentagon from using a supply-chain risk label to “punish” Anthropic, after the company challenged the move.

12 days ago Source: Hacker News

Order Granting Preliminary Injunction – Anthropic vs. U.S. Department of War [pdf] (storage.courtlistener.com) AI

A court order grants a preliminary injunction in a legal dispute involving Anthropic and the U.S. Department of War.

12 days ago Source: Hacker News

Agent-to-agent pair programming (axeldelafosse.com) AI

The post discusses using agent-to-agent collaboration for pair programming using AI agents.

12 days ago Source: Hacker News

Chroma Context-1: Training a Self-Editing Search Agent (trychroma.com) AI

Chroma publishes research on Context-1, a self-editing search agent designed to improve its own search behavior over time.

12 days ago Source: Hacker News

HyperAgents: Self-referential self-improving agents (github.com) AI

Facebook Research has released HyperAgents, a framework for self-referential self-improving AI agents.

12 days ago Source: Hacker News

$500 GPU outperforms Claude Sonnet on coding benchmarks (github.com) AI

A GitHub project claims a $500 GPU setup that outperforms Claude Sonnet on coding benchmarks.

12 days ago Source: Hacker News

Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer (georgelarson.me) AI

The author describes deploying an AI agent on a low-cost VPS, using IRC as the communication/transport layer.

12 days ago Source: Hacker News

Pretraining Language Models via Neural Cellular Automata (hanseungwook.github.io)

A research post exploring whether neural cellular automata can help with language model pretraining.

13 days ago

Astral to Join OpenAI (astral.sh)

Astral's own write-up on joining OpenAI and what it expects for Ruff, uv, and open source tooling.

13 days ago

OpenAI to Acquire Astral (openai.com)

OpenAI says it plans to acquire Astral, the team behind fast Python tooling like uv and Ruff.

13 days ago