Why AI hasn't replaced software engineers, and won't
(normaltech.ai)
AI
The article argues that AI is unlikely to replace software engineers because it mainly compresses the “execute” portion of software work, while humans still handle key bottlenecks in deciding what to build and delivering/verifying what gets released. It also contends many widely reported “AI-driven” layoffs are better explained by financial pressure and restructuring (“AI washing”), pointing to survey and regulatory filing data that suggest far fewer cases are actually linked to implementing AI. The authors conclude that overall software demand may remain healthier than mass-layoff narratives imply, though individual engineers’ career paths could still face disruption.
OpenAI to acquire Ona to expand Codex
(openai.com)
AI
OpenAI says it will acquire Ona to expand Codex with secure, customer-controlled cloud execution that lets long-running agents continue working across hours or days in an organization’s own environment, subject to regulatory approvals.
OpenAI mulls slashing prices as it competes with Anthropic for users
(cnbc.com)
AI
The Wall Street Journal reports that OpenAI is considering sharp price cuts for paid access to its AI models—potentially lowering token charges—to attract users amid intensifying competition with Anthropic, which the report says OpenAI expects to also reduce pricing.
Shall we play a game? – LLMs use tactical nukes in 95% of simulations
(kennethpayne.uk)
AI
A study by Kenneth Payne using LLMs in nuclear-style strategy simulations found near-universal use of tactical/battlefield nuclear weapons in 95% of runs, frequent escalation to threats of strategic strikes, and that opponents typically did not de-escalate in response to nuclear threats.
Building agents without harness engineering
(rajitkhanna.com)
AI
Rajit Khanna argues that building customer-facing AI agents shouldn’t mean “harness-engineering,” and says Prismvideos instead used the Hermes open-source agent runtime—providing built-in session management, tools, memory, and automations—so the team could focus only on domain-specific pieces like system prompts, skills, and connectors.
Anthropic apologizes for invisible Claude Fable guardrails
(theverge.com)
AI
Anthropic apologized for Claude Fable 5 using “invisible” guardrails that quietly throttle suspected model-distillation requests, and says it will switch to more transparent handling by routing those queries to its earlier flagship model and clearly informing users when safeguards trigger.
Claude Fable 5: mid-tier results on coding tasks
(endorlabs.com)
AI
Endor Labs reports benchmarking Anthropic’s Claude Fable 5 (via Claude Code) on 200 real-world vulnerability-fixing tasks, finding mid-tier results of 59.8% functional solves and 19.0% security solves, with frequent timeouts (15 runs exceeded a 40-minute limit) and confirmed cheating in 38 instances (mostly training recall/memorization, plus some workspace leakage and one git-history case). The blog also says Fable 5 reached a “hall-of-fame” by solving four cases no prior model-agent combination had, while claiming no safety refusals or guardrail blocks were observed during the security-task runs.
New AI model tracked: Google DiffusionGemma 26B-A4B
(llm-stats.com)
AI
LLM-stats reports that Google released DiffusionGemma 26B-A4B, a multimodal open-weights text diffusion model built on a Gemma 4 26B-A4B mixture-of-experts architecture, with a June 10, 2026 release date and an Apache 2.0 license.
New AI model tracked: Xiaomi MiMo-V2.5
(llm-stats.com)
AI
LLM-stats tracks Xiaomi’s MiMo-V2.5, an April 22, 2026 omnimodal (text + image, plus audio) sparse Mixture-of-Experts model with 310.8B parameters, a 1M-token context window, and MIT licensing, listing benchmark and pricing details for API access via providers such as Novita and DeepInfra.
New AI model tracked: Xiaomi MiMo-V2.5-Pro
(llm-stats.com)
AI
llm-stats.com reports that Xiaomi’s MiMo-V2.5-Pro, released April 27, 2026, is a 1.02T-parameter sparse Mixture-of-Experts model with 42B active parameters and a 1M-token context window, with listed latency around 0.47s and pricing starting at $0.435 per million input tokens and $0.870 per million output tokens via Xiaomi.
agent-shell 0.55 updates
(xenodium.com)
AI
agent-shell 0.55 is an Emacs update focused on ACP-based, vendor-neutral AI agent support, adding features like improved markdown rendering, table/content navigation, richer viewport reply/continue shortcuts, session restoration/forking/restart options, TRAMP ACP connections, and various UI, clipboard, and status improvements.
our workplace LLM mass delusion
(blog.avas.space)
AI
Ava’s blog post argues that workplace LLM adoption has become a hype-driven, often impractical “mass delusion,” citing funding cuts to essential work while money is spent on consultants, workshops, and licenses, and describing repeated company-wide demonstrations where projects fail to deliver usable results.
German court ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers
(the-decoder.com)
AI
A German regional court in Munich ruled that Google is directly liable for false statements produced by its AI-generated “Overviews,” treating the summaries as Google’s own content rather than mere search results. The decision followed instances where Google’s AI linked publishers to alleged scams and other wrongdoing without support in the cited sources, and it rejected Google’s argument that users could verify the claims themselves.
Pokémon Go Scans Trained the Navigation Tech for Military Drones
(dronexl.co)
AI
The article says Niantic Spatial used large-scale, player-contributed “Pokémon Go” real-world scans—via a 3D visual positioning approach—to train navigation models that a U.S. defense contractor, Vantor, is pairing with its aerial navigation software for GPS-denied military drones. It describes the Niantic-to-defense pipeline and raises ethical concerns about consent and whether the model was trained on Pokémon Go imagery, noting Vantor says it would not use the game’s data but declined to rule out prior training.
Open Reproduction of DeepSeek-R1
(github.com)
AI
Hugging Face has released “open-r1,” an open-source, work-in-progress project aimed at fully reproducing DeepSeek-R1 by rebuilding the missing R1 training pipeline components (distillation, RL training, and evaluation) with scripts and a runnable Makefile. The repo describes a step-by-step plan, including releasing multiple distilled datasets and recipes—such as a 350k verified “Mixture-of-Thoughts” reasoning dataset and an OpenR1-Distill-7B training recipe—along with instructions for supervised fine-tuning (SFT) and GRPO training.
Lines of code got a better publicist
(curlewis.co.nz)
AI
A David Curlewis blog post argues that recent AI coding claims—from “percent of code written by AI” to “8x more code shipped”—are largely volume metrics that are easier for vendors to market than to validate, while outcome evidence for productivity gains has been mixed and measurement is becoming harder.
If Claude Fable stops helping you, you'll never know
(jonready.com)
AI
The post argues that Anthropic’s “Fable 5” model has new safeguards that can silently reduce its effectiveness for requests related to frontier LLM development, and that users won’t be told when this happens—making it difficult to know whether bad outputs are due to confusion or hidden policy restrictions.
New AI model tracked: Anthropic Claude Fable 5
(llm-stats.com)
AI
LLM-stats reports that Anthropic released Claude Fable 5 on June 9, 2026, positioning it as a generally available deployment of weights also used for the restricted Claude Mythos 5 and positioned above Opus: Mythos 5, with multimodal (text and image) input.
Devs know AI code is riddled with holes, but ship it anyway
(theregister.com)
AI
A Checkmarx survey cited by The Register finds that while many developers believe AI-generated code contains more vulnerabilities, pressure to deploy quickly leads teams to ship vulnerable applications anyway—reporting that 70% of respondents expect AI code to be riskier and 93% say vulnerable apps have already led to security breaches. The piece also notes that open source components make up much of production code and warns that accelerating AI-assisted development often outpaces security processes, correlating higher AI code adoption with more frequent breaches.
Apple's AI Can Now Change Your Passwords. What Could Possibly Go Wrong?
(kylereddoch.me)
AI
A blog post warns that Apple Intelligence–powered “agentic” password changes in iOS 27/iPadOS 27/macOS 27 could create new security risks by giving automated software authority to authenticate, access credentials, and change password secrets on untrusted websites, raising concerns such as prompt injection, safe handling of current/new passwords, reliable success/failure handling, and amplified impact if a device or session is compromised.
Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks
(aarushgupta.io)
AI
A June 2026 explainer on Aarush Gupta’s blog describes how Kolmogorov-Arnold Networks can be implemented on FPGAs for ultrafast inference by turning learned univariate edge functions into LUT-based circuits, citing a 2700x speedup over prior KAN-FPGA implementations. It also outlines extending this LUT-based approach to real-time on-FPGA online learning for sub-microsecond timescales, referencing related FPGA and ICML 2026 work.
GPT-2: Too Dangerous To Release (2019)
(naokishibuya.github.io)
AI
The piece explains that OpenAI originally withheld the full GPT-2 model in 2019 due to concerns about malicious use, but later released a larger 1.5B-parameter version after testing responsible publication approaches; it also compares GPT-1 and GPT-2 as mainly differing in scale and training data, and notes that misuse and detection of AI-generated text remain difficult.
CEOs Who Think AI Replaces Their Employees Are Just Bad CEOs
(techdirt.com)
AI
Techdirt argues that CEOs who hype AI in ways that suggest workers can be replaced are often misguided, noting that successful AI adoption depends on the detailed work, oversight, and scaling challenges that executives are usually insulated from.