Zain Naboulsi

Posted on Feb 24 • Originally published at dailyairundown.substack.com

Daily AI Rundown - February 23, 2026

#ai #machinelearning #news #newsletter

This is the February 23, 2026 edition of the Daily AI Rundown newsletter. Subscribe on Substack for daily AI news.

Tech News

Anthropic

Anthropic says DeepSeek, Moonshot, and MiniMax used 24,000 fake accounts to rip off Claude

Anthropic has publicly accused Chinese AI laboratories DeepSeek, Moonshot AI, and MiniMax of orchestrating an industrial-scale campaign to "siphon" proprietary capabilities from its Claude models using approximately 24,000 fraudulent accounts. The San Francisco-based company alleges the labs generated over 16 million exchanges to leapfrog years of research through unauthorized model distillation, violating terms of service and regional access restrictions. This disclosure significantly escalates tensions between American and Chinese developers while providing concrete evidence for the ongoing debate over tightening U.S. chip export controls. Anthropic warned that these increasingly sophisticated campaigns demand rapid, coordinated intervention from policymakers and the global AI community to protect billions of dollars in research investment.

Detecting and preventing distillation attacks

Anthropic has identified industrial-scale distillation campaigns by Chinese AI firms DeepSeek, Moonshot, and MiniMax, which allegedly used approximately 24,000 fraudulent accounts to illicitly extract capabilities from the Claude model. These labs generated over 16 million exchanges to bypass developmental costs and regional restrictions, effectively training their own models on Anthropic’s proprietary outputs in violation of service terms. The company warns that such illicit distillation poses severe national security risks, as it allows foreign actors to strip away essential safety safeguards and repurpose frontier AI for offensive cyber operations, disinformation, and mass surveillance. Furthermore, Anthropic asserts that these attacks undermine U.S. export controls by allowing foreign competitors to close technological gaps using stolen American innovation, necessitating urgent coordinated action among global policymakers and the AI industry.

The persona selection model

Anthropic researchers have introduced the "persona selection model" to explain why artificial intelligence assistants frequently exhibit human-like traits and emotions by default. The theory suggests that during the pretraining process, AI models learn to function as sophisticated simulators by recreating the diverse characters, or "personas," found in vast datasets of human text and dialogue. Rather than being explicitly programmed for humanity, these systems derive their behavior from the necessity of accurately predicting how a specific character would respond in a given context. Consequently, AI assistants effectively act as roleplaying engines that simulate a helpful human persona rather than operating as traditional, logic-based software.

Other News

One engineer made a production SaaS product in an hour: here's the governance system that made it possible

Treasure Data has launched Treasure Code, an AI-native command-line interface developed by a single engineer in just 60 minutes using Claude Code. While the rapid development highlights the potential of agentic coding, the company emphasizes that the project’s success relied on a rigorous governance framework and multi-week planning phase to de-risk the production environment. This system utilizes upstream guardrails to inherit platform-level security permissions, ensuring that AI-generated code cannot bypass access controls or expose sensitive data. To maintain quality, the company implemented a three-tier pipeline where an AI-based reviewer validates all code for architectural alignment and security compliance before deployment.

How AI helps break the cost barrier to COBOL modernization

Artificial intelligence is dismantling the long-standing financial and technical barriers to COBOL modernization by automating the complex reverse-engineering of decades-old legacy systems. While COBOL remains vital to global infrastructure, powering an estimated 95% of U.S. ATM transactions, the critical shortage of specialized developers and outdated documentation has historically made system updates prohibitively expensive. Modern AI tools now streamline this transition by mapping hidden dependencies and documenting intricate data flows, transforming what were once multi-year consulting projects into initiatives completed in a matter of quarters. This shift allows organizations to finally migrate mission-critical logic to modern frameworks while maintaining the reliability and data integrity of their original legacy code.

😸 4 AIs walk into a bar

Elon Musk’s xAI has launched Grok 4.20, introducing a multi-agent architecture where four specialized AI models debate in real time to reach a consensus, a process that has reportedly reduced hallucinations by 65%. This release coincides with significant industry shifts, including Meta’s integration of Manus AI into its Ads Manager and Apple’s expansion into AI-powered hardware such as smart glasses and wearable pendants. The tech sector also faced scrutiny as OpenAI CEO Sam Altman cautioned against "AI washing," criticizing companies that falsely attribute routine layoffs to technological displacement. Collectively, these developments highlight an intensifying competitive landscape focused on increasing model reliability and expanding AI's presence in consumer hardware.

Guide Labs debuts a new kind of interpretable LLM

San Francisco-based startup Guide Labs has launched Steerling-8B, an open-source 8-billion-parameter large language model featuring a novel architecture designed for inherent interpretability. Unlike traditional deep learning models that function as "black boxes," this system utilizes a specialized concept layer that allows every generated token to be traced back to specific origins in its training data. This foundational engineering approach aims to solve persistent industry challenges such as hallucinations and bias by providing developers with precise, auditable control over the model's outputs. The company anticipates the technology will be particularly valuable for regulated sectors like finance and scientific research, where the ability to verify and steer AI behavior is critical for safe deployment.

Prefer to listen? ReallyEasyAI on YouTube

Biz News

Google

Google could reportedly invest $100M in AI cloud operator Fluidstack

Google is reportedly in talks to invest $100 million in Fluidstack Ltd., an AI cloud startup that could be valued at $7.5 billion following the deal. The startup specializes in the rapid provisioning of large-scale graphics card clusters for artificial intelligence training, utilizing advanced hardware such as Nvidia’s Blackwell-series chips. This investment is viewed as a strategic move by Google to accelerate the adoption of its proprietary Tensor Processing Unit (TPU) chips through Fluidstack's infrastructure network. Furthermore, the deal underscores Fluidstack’s growing industry footprint, which includes a $10 billion credit line for expansion and a $50 billion data center collaboration with AI developer Anthropic.

Google clamps down on Antigravity 'malicious usage', cutting off OpenClaw users in sweeping ToS enforcement move

Google has restricted access to its Antigravity AI platform and suspended associated accounts, citing "malicious usage" that allegedly degraded service quality for other users. The crackdown specifically targets those using OpenClaw, an open-source autonomous agent tool recently linked to rival OpenAI, which Google claims was being used to exploit Gemini token limits. In response to the sweeping enforcement of terms of service, OpenClaw creator Peter Steinberger announced the project will remove Google support entirely. This move effectively severs a strategic pipeline that allowed an OpenAI-adjacent tool to leverage Google’s advanced models while addressing ongoing security and governance concerns surrounding autonomous agents.

Anthropic

With AI, investor loyalty is (almost) dead: At least a dozen OpenAI VCs now also back Anthropic

The traditional venture capital norm of exclusive loyalty is rapidly eroding as at least a dozen major investors in OpenAI, including Sequoia Capital and Founders Fund, have also backed rival Anthropic’s recent $30 billion funding round. This shift toward dual-backing challenges long-standing industry practices where VCs typically protect confidential startup data and avoid supporting direct competitors in exchange for board influence. High-profile firms like BlackRock are participating in this trend despite leadership overlaps with OpenAI’s board, signaling a transition toward hedge-fund-style diversification to meet the sector's massive capital demands. While OpenAI CEO Sam Altman has reportedly pressured investors to avoid "non-passive" support of rivals, the unprecedented scale of AI financing is forcing a fundamental reevaluation of traditional fiduciary boundaries.

Anthropic Education Report: The AI Fluency Index

Anthropic’s new AI Fluency Index introduces a specialized benchmark for measuring how effectively individuals collaborate with artificial intelligence using a framework of 24 distinct behaviors. The report, which analyzed nearly 10,000 anonymized interactions from early 2026, found that the most proficient users treat AI as an augmentative "thought partner" rather than a tool for simple delegation. While these collaborative sessions exhibit high levels of fluency, the data reveals a concerning trend where users are significantly less likely to scrutinize AI reasoning or identify missing context when the technology generates tangible outputs like code or documents. These findings establish a baseline for tracking the evolution of AI literacy as the global workforce shifts toward more complex, integrated human-AI workflows.

Other News

Yahoo Finance

Shares in several major sectors plummeted Monday as growing fears regarding artificial intelligence’s disruptive potential triggered a broad market selloff. International Business Machines Corp. suffered its sharpest single-day decline in 25 years, while payment processors and delivery services like American Express and DoorDash saw significant losses following a bearish research report outlining future economic risks. The downturn was further fueled by Anthropic’s release of an AI tool targeting legacy programming languages and warnings from author Nassim Taleb about impending volatility in the software industry. Investors are increasingly reevaluating the stability of current market leaders as emerging AI capabilities threaten to displace traditional business models across the global economy.

All the important news from the ongoing India AI Impact Summit

India is hosting a four-day AI Impact Summit designed to attract global investment, drawing approximately 250,000 visitors and top executives from industry giants like OpenAI, Alphabet, and Anthropic. The high-profile event features participation from Prime Minister Narendra Modi and French President Emmanuel Macron, signaling significant geopolitical backing for the country’s technological expansion. Key leaders in attendance include Sam Altman, Sundar Pichai, and Mukesh Ambani, who are gathering to discuss the future of the sector and witness major domestic launches such as Sarvam AI’s new line of India-built models and devices. This summit reinforces India's position as a central hub for the next wave of artificial intelligence development and international collaboration.

Prefer to listen? ReallyEasyAI on YouTube

Podcasts

The Shape of AI: Jaggedness, Bottlenecks and Salients

The evolution of artificial intelligence is characterized by a jagged frontier of capabilities, meaning that while AI can perform highly complex tasks like medical diagnoses or statistical analysis at a superhuman level, it often struggles with seemingly simpler functions such as memory retention or navigating edge cases. This uneven development creates critical bottlenecks where a single deficiency prevents the complete automation of comprehensive tasks, ensuring that human intervention remains essential for the foreseeable future. However, as AI laboratories actively target these specific weaknesses, termed reverse salients, breakthroughs can trigger sudden and massive leaps forward in overall system functionality, as demonstrated when recent advancements in image generation suddenly unlocked new capacities for AI to create complex visual presentations. Ultimately, while successfully overcoming these bottlenecks will cause AI capabilities to advance in unpredictable lurches, the inherent jaggedness of its ongoing development suggests that the technology will continue to complement rather than entirely replace human expertise, particularly in areas requiring nuanced judgment, institutional navigation, and real-world collaboration.

https://www.oneusefulthing.org/p/the-shape-of-ai-jaggedness-bottlenecks

Real-Time Adaptive Tracking of Fluctuating Relaxation Rates in Superconducting Qubits

Superconducting qubits are essential components of quantum computers, but their performance is severely limited by unpredictable environmental noise that causes their energy relaxation times to fluctuate. Traditionally, scientists measured these relaxation times using slow, nonadaptive methods that took seconds or minutes, which unfortunately averaged out and masked any rapid changes in the qubit's environment. To solve this problem, researchers developed a real-time, adaptive tracking protocol using a specialized hardware controller equipped with a field-programmable gate array, which uses Bayesian statistics to update its estimations continuously based on single-shot measurements. This new technique measures qubit relaxation times up to one hundred times faster than previous methods, completing estimates in just a few milliseconds. By operating at this unprecedented speed, the researchers discovered that a qubit's relaxation time can drastically change in mere tens of milliseconds due to environmental defects, known as two-level systems, that switch on and off at rates up to 10 Hertz. Ultimately, this rapid detection method provides a deeper understanding of quantum decoherence and offers a new pathway for dynamically calibrating quantum processors and reducing errors in real time.

https://journals.aps.org/prx/pdf/10.1103/gk1b-stl3

Shadow Mode, Drift Alerts and Audit Logs: Inside the Modern Audit Loop

As artificial intelligence systems continuously evolve and adapt in real time, traditional, intermittent compliance reviews are no longer sufficient to mitigate risks or ensure reliable performance. To address this challenge, organizations must implement an inline audit loop, which integrates continuous governance directly into the lifecycle of AI development and deployment. This proactive framework relies on three foundational pillars: shadow mode rollouts that safely test new models alongside existing systems without impacting live operations, real-time monitoring that immediately detects and escalates issues like data drift, algorithmic bias, or user misuse, and meticulously engineered, immutable audit logs that record both the outcomes and the underlying rationales of AI decisions to guarantee legal defensibility. Ultimately, rather than hindering progress, this continuous approach to compliance accelerates innovation by automating oversight, preventing catastrophic failures, and fostering profound trust among developers, regulators, and the public.

https://venturebeat.com/orchestration/shadow-mode-drift-alerts-and-audit-logs-inside-the-modern-audit-loop

Stack Overflow: 2025 Developer Survey

The 2025 Stack Overflow Developer Survey, which gathered insights from over 49,000 global respondents, highlights a complex landscape of technological adoption and evolving professional sentiments. While artificial intelligence tools have seen widespread integration, with 84 percent of developers utilizing them, overall enthusiasm has notably waned, largely due to persistent frustrations with inaccurate outputs and the time-consuming nature of debugging AI-generated solutions that are almost right. Concurrently, foundational technologies continue to solidify their dominance, evidenced by Python's accelerated growth, Rust's enduring status as the most admired programming language, and the near-universal reliance on tools like Docker and Visual Studio Code. In the workplace, despite 42 percent of respondents feeling the survey itself was excessively lengthy, general job satisfaction has marginally improved to 24 percent, driven primarily by desires for autonomy, competitive compensation, and the opportunity to solve tangible problems, even as confidence slightly slips regarding the long-term threat of artificial intelligence to job security. Furthermore, developers continue to rely heavily on Stack Overflow for human-verified knowledge, particularly when troubleshooting complex or AI-related issues, underscoring the enduring necessity of human expertise in an increasingly automated industry.

https://survey.stackoverflow.co/2025/

Cinder: A Fast and Fair Matchmaking System

Cinder is a novel, two-stage matchmaking system designed to improve the fairness and speed of multiplayer online game pairings, particularly when pre-made teams exhibit wide or skewed skill distributions that traditional mean or median metrics fail to accommodate. To achieve this, the system first employs a rapid preliminary filter that utilizes the Ruzicka similarity index to evaluate the overlap of non-outlier skill ranges between potential opposing teams, immediately discarding fundamentally incompatible matches. Lobbies that pass this initial filter undergo a more rigorous secondary evaluation where individual player ratings are sorted into a non-linear series of skill buckets, formulated using an inverted normal distribution to provide greater precision around average skill levels. Finally, the system calculates the Wasserstein distance between these sorted bucket indices to generate a quantifiable Sanction Score, representing the overall dissimilarity or unfairness between the teams. By establishing maximum acceptable thresholds for this score, as demonstrated through extensive large-scale simulations of over 140 million pairings, game developers can efficiently balance queue wait times with optimal match fairness.

https://arxiv.org/pdf/2602.17015

AI GAMESTORE: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games

In an effort to rigorously evaluate the progress of artificial intelligence toward human-like general cognitive capabilities, researchers have introduced the AI GAMESTORE, a scalable and open-ended benchmarking platform that tests models against a diverse array of human-designed games. Arguing that traditional, static benchmarks are too narrow to capture true cognitive versatility, the authors propose evaluating systems on the Multiverse of Human Games, a conceptual space encompassing all conceivable games created for human enjoyment. To operationalize this, the researchers utilized large language models alongside human-in-the-loop refinement to automatically source, synthesize, and standardize 100 playable digital games based on popular titles from commercial platforms like the Apple App Store and Steam. When seven state-of-the-art vision-language models were tested against human participants on these environments, the models exhibited a profound performance deficit, achieving less than ten percent of the median human score while requiring significantly more computational time. Specifically, the artificial intelligence systems demonstrated severe limitations in tasks demanding long-term memory, sophisticated forward planning, and the dynamic acquisition of world models, ultimately highlighting that current machine intelligence still falls considerably short of human adaptability in complex, interactive scenarios.

https://arxiv.org/pdf/2602.17594
https://aigamestore.org/

Medclarify: AI Agent for Medical Diagnosis With Case-Specific Follow-Up Questions

Current medical artificial intelligence models frequently struggle to accurately diagnose patients when initial clinical information is incomplete, a limitation that often results in significant diagnostic errors. To address this challenge, researchers developed MedClarify, an information-seeking artificial intelligence agent that mirrors the iterative reasoning of human clinicians by proactively asking targeted follow-up questions to resolve uncertainty. The system operates by first generating a list of candidate diagnoses and then selecting the most informative follow-up questions using a novel mathematical metric called diagnostic expected information gain. This metric calculates how effectively a potential question will reduce diagnostic uncertainty, specifically utilizing standardized medical codes to rule out entire branches of related diseases at once. By combining this advanced question-selection strategy with a Bayesian updating framework that continuously adjusts the probability of each diagnosis as new patient evidence is gathered, the agent systematically zeroes in on the correct condition. Experimental evaluations across multiple comprehensive medical datasets demonstrated that MedClarify substantially outperforms standard single-prediction models, improving diagnostic accuracy by approximately 27 percentage points on incomplete patient cases and proving the critical necessity of active information retrieval in automated clinical decision-making.

https://arxiv.org/pdf/2602.17308

Enhancing LLMs for Telecom using Dynamic Knowledge Graphs and Explainable RAG

Large language models often struggle to provide accurate and reliable information in the telecommunications sector due to the domain's complex standards, specialized terminology, and rapidly evolving network states. To overcome these limitations, researchers have introduced KG-RAG, a novel framework that integrates dynamic knowledge graphs with retrieval-augmented generation to enhance the performance of these models in telecom-specific applications. Instead of retrieving unstructured text chunks like standard systems, KG-RAG extracts information from authoritative documents, such as 3GPP specifications, to build a structured knowledge graph composed of interconnected entities and relationships. When a user poses a query, the system retrieves relevant, schema-aligned facts from this continuously updated graph to ground the language model's generated response. This structured approach ensures that the output is highly accurate, explicitly traceable to original regulatory guidelines, and capable of reflecting real-time network configurations. Experimental evaluations demonstrate that KG-RAG significantly outperforms traditional language models and standard retrieval frameworks in both text summarization and question-answering tasks by substantially reducing hallucinations and improving factual consistency across complex telecom scenarios.

https://arxiv.org/pdf/2602.17529

Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents

Large language models increasingly interact with external environments to gather information for complex tasks, but every exploratory action, such as running a unit test or searching a database, incurs a cost in time or computing resources. Because standard models often rely on rigid exploration strategies that fail to weigh these costs against the potential benefits of new information, researchers have introduced a new framework called Calibrate-Then-Act. This approach treats environment exploration as a sequential decision-making problem and explicitly feeds the model prior probabilities, or estimates of its own uncertainty, before it selects an action. By prompting the model to explicitly compare the cost of further exploration against the risk of committing to a premature answer, Calibrate-Then-Act enables the agent to dynamically adapt its strategy. Across synthetic decision-making scenarios, open-domain question answering, and interactive coding tasks, providing these explicit uncertainty estimates allowed models to consistently achieve optimal trade-offs between accuracy and resource expenditure, demonstrating superior performance over both standard prompting and traditional reinforcement learning techniques.

https://arxiv.org/pdf/2602.16699
https://github.com/Wenwen-D/env-explorer

GPSBench: Do Large Language Models Understand GPS Coordinates?

Large Language Models are increasingly integrated into location-aware applications, yet their capacity to perform genuine spatial reasoning using GPS coordinates has remained largely underexplored. To systematically assess these capabilities, researchers developed GPSBENCH, a comprehensive evaluation framework consisting of 57,800 samples across 17 tasks that test both pure geometric coordinate computations and applied geographic reasoning. An evaluation of 14 advanced models demonstrates that while they possess robust coarse-grained geographic knowledge, particularly at the country level, they lack the dense coordinate-to-city mappings required for precise localization and struggle significantly with complex spherical geometry. Furthermore, the models exhibit robustness to coordinate noise, suggesting they rely on generalized spatial representations rather than merely memorizing training data. Ultimately, although augmenting prompts with GPS coordinates enhances performance on downstream spatial tasks, attempting to finetune models specifically for GPS reasoning introduces a capability trade-off that improves mathematical geometric computation at the expense of integrated real-world geographic knowledge.

https://arxiv.org/pdf/2602.16105

Probability-Aware Parking Selection

Current navigation systems typically calculate travel times based solely on the driving duration to a destination, completely ignoring the time required to search for a parking spot and walk to the final location. This omission not only frustrates drivers and contributes to urban congestion, but it also creates a misleading comparison between driving and public transit, which inherently includes walk times. To resolve this, researchers have introduced a probability-aware parking selection model that uses a dynamic programming framework to minimize the total expected time-to-arrive. By evaluating parking availability as a lot-level probability and modeling the decision-making process as a Markov decision process, the system intelligently directs drivers to optimal parking locations rather than straight to their destinations. Even when parking availability is estimated through intermittent data from connected vehicles, the model maintains a low mean absolute error rate of under seven percent. Simulations using real-world parking data from Seattle demonstrate that these probability-aware navigation strategies can reduce total travel time by up to 66 percent compared to traditional routing methods. Ultimately, establishing time-to-arrive as a unified metric provides a much more accurate reflection of personal vehicle travel, revealing that real-world trips take up to 123 percent longer than naive time-to-drive estimates currently suggest.

https://arxiv.org/pdf/2601.00521
https://github.com/chickert/

Revolutionizing Long-Term Memory in Ai: New Horizons With High-Capacity and High-Speed Storage

This paper investigates advanced memory architectures essential for developing artificial superintelligence by challenging the prevailing "extract then store" paradigm, which inherently risks losing valuable information during the extraction process. Instead, the authors advocate for the "Store Then ON-demand Extract" approach, which involves retaining raw, unfiltered experiences in their entirety to allow flexible, cross-task information retrieval without data loss. Furthermore, the researchers propose two complementary strategies to enhance artificial intelligence learning: "deeper insight discovery," which applies statistical processing to multiple probabilistic experiences to improve decision-making accuracy in uncertain environments, and "experience memory sharing" across multiple agents, which drastically reduces the computational burden and time required for individual trial-and-error learning. Although these intuitive methods demonstrate significant performance improvements in preliminary experiments, realizing their full potential will require overcoming substantial technological hurdles, including the need for unprecedented storage capacities, faster inference processing, and more sophisticated infrastructure for comprehensive data recall and secure memory sharing.

https://arxiv.org/pdf/2602.16192

How AI Coding Agents Communicate: A Study of PR Descriptions and Human Review Responses

The rapid integration of artificial intelligence in software development has introduced autonomous coding agents that not only write code but also generate complete pull requests, necessitating a closer look at how these tools communicate with human reviewers. A recent empirical study analyzed over 33,000 pull requests generated by five distinct AI agents, including GitHub Copilot, OpenAI Codex, and Claude Code, to determine how their stylistic differences influence human evaluation. Researchers discovered that these agents exhibit highly diverse communication strategies, ranging from highly structured descriptions utilizing headers and lists to verbose, code-centric explanations that lack organizational formatting. Consequently, these stylistic variations significantly impact reviewer engagement, feedback sentiment, and ultimate project outcomes. Specifically, agents that generate well-structured and concise pull request descriptions, such as OpenAI Codex, achieve considerably higher merge rates and faster review times compared to those producing less organized submissions. Ultimately, the findings emphasize that in human-AI collaborative programming, an agent's ability to clearly articulate and organize its modifications is just as critical for successful project integration as the functional accuracy of the underlying code itself.

https://arxiv.org/pdf/2602.17084

More AI paper summaries: AI Papers Podcast Daily on YouTube