AI news

EU Commission looking at practical consequences of Anthropic decision (reuters.com) AI

The EU Commission is reportedly reviewing the practical implications following an Anthropic decision, according to a Reuters report with details not available in the provided article text.

9 minutes ago Source: Hacker News

Rio de Janeiro's city government model Rio3.5 beats Qwen3.7 in recent benchmarks (twitter.com) AI

A post claims that Rio de Janeiro’s city government model, “Rio3.5,” performs better than “Qwen3.7” on recent benchmarks, based on reported results.

19 minutes ago Source: Hacker News

Formal Methods and the Future of Programming (blog.janestreet.com) AI

Jane Street’s Yaron Minsky argues that while the firm previously found full formal methods too costly compared with benefits, recent advances in agentic coding are making the tradeoff more favorable—both by increasing the need to verify messy, invariant-breaking agent output and by using formal methods as a powerful feedback mechanism alongside testing and type systems. He says the company is now building a team focused on formal methods, leveraging its control over the language (including OxCaml) and a user base that can support near-term improvements and longer-term proof-oriented directions, with hiring planned in London and New York.

24 minutes ago Source: Hacker News

KPMG pulls report on AI usage due to apparent hallucinations (techcrunch.com) AI

KPMG has pulled a 2025 report on AI usage after multiple organizations said its claims were untrue or misleading, with a research group attributing the inaccuracies to AI hallucinations in the writing process; KPMG said it removed the report while investigating and reiterated expectations for responsible AI use with human oversight.

about 1 hour ago Source: Hacker News

Reinventing Control Theory One Feature at a Time: The Fallacy of Agentic Loops (medium.com) AI

The article argues that “agentic loops” in AI coding—adding agents to monitor, review, and iterate over each other’s work—amount to a fragmented, hype-driven rediscovery of control theory without the full methodology needed for safe, reliable operation. It warns that probabilistic agents validating one another are not automatically a robust control system unless stop conditions, trusted signals, authority, boundaries, and fallback paths are explicitly designed, and it urges teams and leadership to address hard operational and financial questions before deploying such loops.

about 2 hours ago Source: Hacker News

UK police officer under criminal investigation over alleged use of AI (ft.com) AI

The FT reports that a UK police officer is under criminal investigation over allegations involving the use of AI, though details were not available in the provided article text.

about 4 hours ago Source: Hacker News

Frontier AI companies will never exceed the capability frontier again (andrewtrask.substack.com) AI

The Substack post argues that “frontier” AI companies will no longer be able to surpass today’s capability frontier, claiming that ensembles and decentralized networks of smaller models increasingly outperform single top-tier systems on speed, accuracy, and cost, due to scaling/ensemble effects and improved inference efficiency like caching and indexing.

about 7 hours ago Source: Hacker News

Don't trust large context windows (garrit.xyz) AI

Garrit argues that LLMs have a “smart” attention region and a “dumb” region within the context window, so advertised context sizes (100k+ to millions of tokens) are often mostly marketing and effective performance drops as the window fills—especially for coding agents. The post suggests avoiding the degraded part by restarting sessions and handing off stable written artifacts (specs/plans/skills) rather than relying on auto-compaction summaries that occur after degradation.

about 8 hours ago Source: Hacker News

Show HN: I run a vision model on every screenshot, locally, on a 4GB GPU (github.com) AI

ScreenMind (open source) is presented as a privacy-first “screen memory” that captures screenshots when the screen changes, analyzes them locally with Gemma 4 multimodal capabilities (plus OCR and semantic embeddings), and lets users search and chat over their screen history. The project claims all processing runs on-device with no telemetry after the initial model download, offers modes for faster vs deeper analysis, and includes features like voice memo/meeting transcription, analytics, and integrations via an MCP server and other tools.

about 9 hours ago Source: Hacker News

Making Claude a Chemist (anthropic.com) AI

Anthropic says its Claude models are increasingly useful for chemistry by testing them on NMR spectroscopy tasks, comparing predictions from multiple Claude versions (Opus 4.7/4.6, Sonnet 4.6) against dedicated NMR tools using data from 20 recently published compounds. The company reports Opus 4.7 produced notably accurate 1D NMR peak positions and splitting patterns, and also performed “inverse” structure elucidation from NMR peak lists plus formula (and, for harder cases, an added starting-material hint), reaching correct structures in all simpler cases and in most harder ones.

about 10 hours ago Source: Hacker News

The future of Siri, or: why private inference isn’t private enough (blog.cryptographyengineering.com) AI

Cryptography engineer Matthew Green argues that Apple’s planned “private” Siri/AI via Private Cloud Compute and confidential inference may limit direct access by Apple and Google, but privacy is not assured once Siri-style agents must interact with external services for real-world actions, creating new avenues for data leakage through queries and the agent’s discretion.

about 11 hours ago Source: Lobsters

The whirlwind 24 hours that led to export controls on Anthropic (politico.com) AI

Politico reports on the rapid, 24-hour sequence of events that culminated in the White House imposing export controls on Anthropic, based on developments leading up to the decision.

about 12 hours ago Source: Hacker News

'Tell Him He's a Piece of Shit': Meta's New AI Unit Is a Total Mess (wired.com) AI

WIRED reports that Meta’s newly formed Applied AI unit has deep internal frustration tied to the company’s broader AI restructuring, including accounts of a chaotic employee-only livestream incident and widespread complaints that assigned tasks feel menial or soul-crushing.

about 12 hours ago Source: Hacker News

LLMs Pre-Commodify Ideas (summerlightning.substack.com) AI

The post argues that ideas generated and shared through LLMs are increasingly “pre-commodified,” arriving around the same time to multiple people with unclear provenance because the models recombine temporally deep training data into a shared latent space; the author contrasts “Boomers” (legal/slow, consistent origin claims) with “Sooners” (front-running ideas and profiting from later diffusion), and suggests that establishing provenance—and new threats like data poisoning—will become central complements to AI deployment and distribution.

about 14 hours ago Source: Hacker News

Rio 3.5 Open 397B – from Rio de Janeiro's city government (huggingface.co) AI

A Hugging Face model card describes Rio 3.5 Open 397B, an open, multimodal “frontier-class” AI model from Rio de Janeiro’s municipal IT company IplanRIO, post-trained from Qwen 3.5 397B and released under the MIT license. The card highlights its SwiReasoning framework for dynamically switching between latent and explicit reasoning to improve the accuracy/efficiency trade-off, lists key model specs (Mixture-of-Experts with ~397B total/~17B active parameters and a ~1M token context window), and provides benchmark results plus implementation examples for Transformers, vLLM, and SGLang.

about 15 hours ago Source: Hacker News

Chatbot teddies for three‑year‑olds? Why AI toys are risky for kids (rnz.co.nz) AI

RNZ reports on research and concerns that AI-powered toys like chatbot teddies may be especially risky for very young children, because their “human-like” and overly validating language can build strong emotional trust and attachment. The article also warns about “infinite chat” driving prolonged engagement, potential exposure to adult topics, and privacy/data-collection issues from seemingly personal conversations, particularly when toys are used without adult supervision.

about 15 hours ago Source: Hacker News

Amazon security research reportedly led to the White House's Anthropic Fable ban (theverge.com) AI

A Wall Street Journal report, echoed by The Verge, says Amazon cybersecurity research and discussions between Andy Jassy and the White House helped trigger an export control directive that led Anthropic to cut off access to its Fable 5 and Mythos 5 models for foreign nationals.

about 15 hours ago Source: Hacker News

State Attorneys General Are Investigating OpenAI (nytimes.com) AI

The article says multiple U.S. state attorneys general are investigating OpenAI, though no further details are available from the provided text.

about 15 hours ago Source: Hacker News

AI

Summary

Stories