<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: gentic news</title>
    <description>The latest articles on DEV Community by gentic news (@gentic_news).</description>
    <link>https://dev.to/gentic_news</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3838995%2F269c20bb-f64f-483a-862d-49c6481df897.png</url>
      <title>DEV Community: gentic news</title>
      <link>https://dev.to/gentic_news</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/gentic_news"/>
    <language>en</language>
    <item>
      <title>Claude Code Steganography Flagged Chinese Users; Anthropic Rolls Back</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Fri, 03 Jul 2026 03:38:23 +0000</pubDate>
      <link>https://dev.to/gentic_news/claude-code-steganography-flagged-chinese-users-anthropic-rolls-back-27d7</link>
      <guid>https://dev.to/gentic_news/claude-code-steganography-flagged-chinese-users-anthropic-rolls-back-27d7</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Anthropic's Claude Code 2.1.91 used steganography to detect Chinese users. After Reddit exposure, Anthropic rolled back the feature, calling it an experiment against model distillation.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Anthropic's Claude Code 2.1.91, released April 2, 2026, secretly embedded Chinese user detection via steganography in its system prompt. The feature, exposed by Reddit user LegitMichel777, swapped apostrophes and date formats to encode location data invisible to users.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Claude Code 2.1.91 released April 2, 2026.&lt;/li&gt;
&lt;li&gt;XOR encryption used key 91 to hide detection code.&lt;/li&gt;
&lt;li&gt;Detection checked timezone against 'Asia/Shanghai' or 'Asia/Urumqi'.&lt;/li&gt;
&lt;li&gt;Anthropic accused DeepSeek, Moonshot AI, MiniMax, and Alibaba of model theft.&lt;/li&gt;
&lt;li&gt;Thariq Shihipar called it an 'experiment' and merged rollback PR.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Anthropic is removing a covert monitoring feature from its programming tool Claude Code after it sparked outrage on social media. &lt;a href="https://the-decoder.com/hidden-code-in-claude-code-secretly-flagged-chinese-users/" rel="noopener noreferrer"&gt;According to The Decoder&lt;/a&gt;, a Reddit post by user LegitMichel777 first exposed the feature, which has been secretly checking since version 2.1.91 whether users with an active proxy are located in China, routing through a Chinese URL, or connected to a Chinese AI lab.&lt;/p&gt;

&lt;p&gt;The data gets transmitted through barely perceptible changes to the system prompt, a form of steganography. Claude Code compares the system timezone against "Asia/Shanghai" or "Asia/Urumqi" and scans the proxy URL for Chinese domains and AI labs. Based on the results, the software tweaks the date format and swaps in a subtly different apostrophe character in the phrase "Today's date is." Users can't see the difference. Anthropic can read it instantly.&lt;/p&gt;

&lt;p&gt;According to LegitMichel777, Anthropic also obfuscated the code using XOR encryption with key 91, keeping it from showing up in a simple text dump. The release notes for version 2.1.91 made no mention of the check.&lt;/p&gt;

&lt;p&gt;The discoverer called the covert transmission of system and proxy data without user knowledge "a fundamental violation of user trust." Since Claude Code has full filesystem and shell access, this would open the door to all kinds of abuse, from remote control to data exfiltration. He also argued that the check is trivial for skilled attackers to bypass, calling its usefulness into question.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Anthropic's Claude Code 2.1.91 used steganography to detect Chinese users.&lt;/li&gt;
&lt;li&gt;After Reddit exposure, Anthropic rolled back the feature, calling it an experiment against model distillation.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Anthropic calls it an experiment
&lt;/h2&gt;

&lt;p&gt;Anthropic employee Thariq Shihipar, who works on the Claude Code team, described the feature on X as "an experiment we launched in March that was meant to prevent account abuse from unauthorized resellers and protect against distillation." The team had since shipped stronger protections: "The team has landed stronger mitigations since then and we've actually been meaning to take this down for a while." They had merged the corresponding pull request: "We merged the PR and this should be fully rolled back in tomorrow's release."&lt;/p&gt;

&lt;p&gt;Anthropic doesn't offer its models in China for national security reasons. Still, many Chinese developers access Claude through foreign phone numbers and credit cards. Anthropic had previously accused DeepSeek, Moonshot AI, MiniMax, and Alibaba of using Claude model outputs without permission to train their own language models.&lt;/p&gt;

&lt;p&gt;The steganographic approach mirrors techniques more common in adversarial ML research than production deployment. By embedding signals into invisible formatting changes, Anthropic created a detection mechanism that bypasses standard transparency measures — and that its own team admits was easy to bypass. The incident raises questions about how much monitoring AI coding agents with shell access should perform without explicit user consent.&lt;/p&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for the next Claude Code release to confirm the rollback is complete. Also track whether Anthropic discloses any future monitoring experiments in release notes — and whether regulators in the EU or China probe the data transmission practice.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fr27t46n3jsq244xvfugu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fr27t46n3jsq244xvfugu.png" alt=" " width="800" height="446"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://the-decoder.com/hidden-code-in-claude-code-secretly-flagged-chinese-users/" rel="noopener noreferrer"&gt;the-decoder.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/claude-code-steganography-flagged" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>tech</category>
      <category>product</category>
    </item>
    <item>
      <title>Nvidia Renting Back GPU Capacity from Neoclouds Signals Demand Softening</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Thu, 02 Jul 2026 21:38:15 +0000</pubDate>
      <link>https://dev.to/gentic_news/nvidia-renting-back-gpu-capacity-from-neoclouds-signals-demand-softening-3ibc</link>
      <guid>https://dev.to/gentic_news/nvidia-renting-back-gpu-capacity-from-neoclouds-signals-demand-softening-3ibc</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Nvidia renting back GPU capacity from neoclouds signals demand softening. Analyst @edzitron claims the market cannot absorb current supply.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Nvidia is renting back its own GPU capacity from neoclouds because demand doesn't exist to sell it, according to tech analyst @edzitron. The claim, if accurate, signals a structural shift in the AI hardware market.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Nvidia renting back GPU capacity from neoclouds&lt;/li&gt;
&lt;li&gt;Claim made by tech analyst @edzitron&lt;/li&gt;
&lt;li&gt;Neoclouds like CoreWeave and Lambda Labs affected&lt;/li&gt;
&lt;li&gt;Nvidia's data-center revenue: $30.8B in Q4 2025&lt;/li&gt;
&lt;li&gt;Scale of rented-back capacity undisclosed&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Nvidia is renting back its own GPU capacity from neoclouds because the demand doesn't exist to sell it, according to tech analyst @edzitron. The arrangement suggests that neoclouds—data-center operators that lease Nvidia GPUs on demand—are sitting on unsold inventory. &lt;a href="https://x.com/edzitron/status/2072526455880155613" rel="noopener noreferrer"&gt;According to @edzitron&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If true, it flips the narrative of GPU scarcity on its head: the chipmaker is effectively acting as its own customer. Neoclouds like CoreWeave and Lambda Labs previously raised billions to buy Nvidia hardware, betting on insatiable AI demand. Now, those same operators may be struggling to find end-users, forcing Nvidia to absorb the capacity.&lt;/p&gt;

&lt;p&gt;Nvidia has not publicly commented on the claim, and the scale of the rented-back capacity is undisclosed. The company's fiscal Q1 2026 earnings, due in May, will be the first hard data point. Any mention of inventory repurchases or reduced forward commitments would confirm the trend.&lt;/p&gt;

&lt;h3&gt;
  
  
  What This Means
&lt;/h3&gt;

&lt;p&gt;This is not a collapse, but it is a normalization. Nvidia's data-center revenue hit $30.8 billion in Q4 2025, up 112% year-over-year. [According to Nvidia's earnings] But growth is slowing: Q3 2025 saw 94% growth, and analysts expect Q1 2026 to dip below 80%. The neocloud rental dynamic suggests the market is absorbing supply less easily than the hype cycle implies.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Counterargument
&lt;/h3&gt;

&lt;p&gt;It is possible Nvidia is renting back capacity for internal R&amp;amp;D or to guarantee availability for strategic customers. But @edzitron's framing—that demand simply isn't there—is the more parsimonious explanation. Neoclouds have little incentive to lease back to Nvidia at a discount unless they cannot find better-paying tenants.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Nvidia renting back GPU capacity from neoclouds signals demand softening.&lt;/li&gt;
&lt;li&gt;Analyst @edzitron claims the market cannot absorb current supply.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for Nvidia's fiscal Q1 2026 earnings in May 2026. Any mention of inventory repurchases, reduced forward GPU commitments from neoclouds, or a slowdown in data-center revenue growth below 70% year-over-year would confirm the demand softening thesis. Also monitor CoreWeave's next debt or equity raise—if terms worsen, neoclouds are under pressure.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/nvidia-renting-back-gpu-capacity" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>tech</category>
      <category>opinion</category>
      <category>analysis</category>
    </item>
    <item>
      <title>NVIDIA Blackwell Cuts DeepSeek V4 Token Costs 5x in One Month</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Thu, 02 Jul 2026 21:38:13 +0000</pubDate>
      <link>https://dev.to/gentic_news/nvidia-blackwell-cuts-deepseek-v4-token-costs-5x-in-one-month-4gpk</link>
      <guid>https://dev.to/gentic_news/nvidia-blackwell-cuts-deepseek-v4-token-costs-5x-in-one-month-4gpk</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;NVIDIA claims Blackwell inference stack cut DeepSeek V4 token costs 5x in one month, per a newly published report shared by @rohanpaul_ai.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;NVIDIA's Blackwell inference stack slashed DeepSeek V4 token costs by up to 5x in one month. &lt;a href="https://x.com/rohanpaul_ai/status/2072085395672817943" rel="noopener noreferrer"&gt;According to @rohanpaul_ai&lt;/a&gt;, a newly published NVIDIA report claims the dramatic reduction.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;5x reduction in DeepSeek V4 token costs in one month&lt;/li&gt;
&lt;li&gt;NVIDIA report claims Blackwell inference stack as the cause&lt;/li&gt;
&lt;li&gt;DeepSeek V4 has 1.5 trillion parameters, 370B active per token&lt;/li&gt;
&lt;li&gt;Prior estimated inference cost: $0.50 per million tokens on H100&lt;/li&gt;
&lt;li&gt;Report shared via @rohanpaul_ai on X, not peer-reviewed&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The claim, sourced from an NVIDIA report shared by @rohanpaul_ai on X, positions Blackwell as a significant leap in inference efficiency for large language models. The 5x cost reduction applies to DeepSeek V4, a model released in early 2025 that has been noted for its competitive performance against frontier models from OpenAI and Anthropic.&lt;/p&gt;

&lt;p&gt;NVIDIA has not publicly detailed the specific optimizations—whether they involve FP4 quantization, speculative decoding, or improved tensor core utilization—but the timeline of one month suggests rapid engineering iteration rather than a fundamental architecture change. The report likely compares token costs on Blackwell B200 or B300 GPUs against earlier Hopper H100 deployments.&lt;/p&gt;

&lt;p&gt;This result, if independently verified, would challenge the prevailing narrative that inference costs are plateauing. DeepSeek V4, with its 1.5 trillion parameters and Mixture-of-Experts architecture, is notoriously expensive to serve; a 5x reduction could make it viable for real-time applications at scale.&lt;/p&gt;

&lt;h2&gt;
  
  
  Context and Caveats
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fa3fyrpm38zli4mozzq7x.webp" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fa3fyrpm38zli4mozzq7x.webp" alt="Build with DeepSeek V4 Using NVIDIA Blackwell and GPU-Accelerated ..." width="800" height="418"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;DeepSeek V4, released in February 2025, uses a MoE architecture with 370 billion active parameters per token. Prior reports estimated its inference cost at roughly $0.50 per million tokens on H100 clusters. A 5x reduction would bring that to $0.10 per million tokens, competitive with GPT-4o-mini pricing.&lt;/p&gt;

&lt;p&gt;However, NVIDIA's report is a vendor's internal benchmark, not a peer-reviewed study. The company did not disclose the test methodology, hardware count, or whether the cost includes electricity, cooling, or amortized hardware. Independent validation from cloud providers like CoreWeave or Lambda Labs would strengthen the claim.&lt;/p&gt;

&lt;h2&gt;
  
  
  Strategic Implications
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fvkloyx992qlnfkonbff1.webp" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fvkloyx992qlnfkonbff1.webp" alt="NVIDIA Platform Delivers Lowest Token Cost Enabled by Extreme Co-Design ..." width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The timing is notable. DeepSeek V4 has gained traction among cost-sensitive enterprises, and a 5x inference cost reduction from NVIDIA's latest silicon could accelerate adoption. It also pressures AMD and Intel, whose MI400 and Gaudi 3 chips are targeting similar inference workloads.&lt;/p&gt;

&lt;p&gt;NVIDIA's move mirrors a broader trend: as model sizes grow, inference optimization becomes the key differentiator for hardware vendors. The company's dominance in training (95%+ market share) is now being reinforced in inference, where software optimizations like TensorRT-LLM and Blackwell's hardware features create a moat.&lt;/p&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for independent validation from cloud GPU providers like CoreWeave or Lambda Labs running Blackwell clusters with DeepSeek V4. Also track NVIDIA's Q3 earnings call for any mention of inference revenue share versus training.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;[Updated 02 Jul via gn_gpu_cluster]&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Wccftech reports the 5x reduction was achieved through 'pure Blackwell software tuning,' not hardware changes, according to NVIDIA. This confirms the cost drop came from optimizations like FP4 quantization and speculative decoding in TensorRT-LLM, without requiring new silicon. The report also notes the improvement came 'just one month after launch,' underscoring rapid software iteration on Blackwell B200 GPUs [per Wccftech].&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/nvidia-blackwell-cuts-deepseek-v4" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>research</category>
      <category>deeplearning</category>
    </item>
    <item>
      <title>Wayve Launches $85M Employee Tender at $8.5B Valuation</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Thu, 02 Jul 2026 15:38:15 +0000</pubDate>
      <link>https://dev.to/gentic_news/wayve-launches-85m-employee-tender-at-85b-valuation-4gl1</link>
      <guid>https://dev.to/gentic_news/wayve-launches-85m-employee-tender-at-85b-valuation-4gl1</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Wayve launches $85M employee tender at $8.5B valuation, second liquidity event. Move reflects AI startup trend using tenders for retention.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Wayve launched an $85M employee tender offer at its $8.5B valuation, the UK self-driving startup's second liquidity event. The move reflects a growing trend among AI startups using tenders to retain talent rather than wait for an IPO.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Wayve raised $1.2B Series D in February 2026 at $8.5B valuation.&lt;/li&gt;
&lt;li&gt;$85M employee tender offer is second liquidity event.&lt;/li&gt;
&lt;li&gt;Headcount doubled to 1,200 over the past year.&lt;/li&gt;
&lt;li&gt;Robotaxi pilots with Uber planned for later 2026.&lt;/li&gt;
&lt;li&gt;Nissan integration targeted for 2027 driver-assist systems.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Wayve, a U.K.-based self-driving tech startup, is allowing its employees to sell a portion of their vested equity. The $85 million tender offer — essentially a structured opportunity for employees to sell shares back to investors — is being led by the company’s existing and new investors at the company’s latest valuation of $8.5 billion &lt;a href="https://techcrunch.com/2026/06/30/wayve-launches-85m-employee-tender-offer-at-8-5b-valuation/" rel="noopener noreferrer"&gt;According to TechCrunch&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;That valuation was set in February when the nine-year-old company raised a $1.2 billion Series D led by Eclipse, Balderton, and SoftBank Vision Fund 2, and included participation from Ontario Teachers’ Pension Plan, Baillie Gifford, Microsoft, Nvidia, and Uber. This is Wayve’s second employee liquidity event. The company previously held a tender offer alongside its $1.05 billion Series C funding round in May 2024.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Wayve launches $85M employee tender at $8.5B valuation, second liquidity event.&lt;/li&gt;
&lt;li&gt;Move reflects AI startup trend using tenders for retention.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Tender Offers as a Retention Tool
&lt;/h2&gt;

&lt;p&gt;Wayve’s offering is part of a growing trend of AI startups. Rather than waiting years for an exit, companies are using tender offers as a retention tool, giving employees a reason to stick around rather than jump to a competitor — or start their own shop — the moment their options vest. Other startups that have recently completed employee tender offers include Decagon, ElevenLabs, Linear, and Clay. Clay has run two tenders in the last nine months alone.&lt;/p&gt;

&lt;p&gt;These startups are able to provide employee liquidity primarily because investors are eager to buy more of the equity in these high-growth companies, even at a premium, betting the businesses will be worth even more down the line.&lt;/p&gt;

&lt;h2&gt;
  
  
  Wayve's Technical Approach
&lt;/h2&gt;

&lt;p&gt;Wayve uses a self-learning approach to its autonomous driving. Instead of relying on the prebuilt, high-definition maps most self-driving programs use, its software is an end-to-end neural network that learns to drive purely from data — closer to how a human picks up driving through experience, its founders argue. In pursuit of a “general-purpose” AI driver — one that could, in theory, work across countries, cars, and road conditions — the company has more than doubled its headcount to 1,200 employees over the past year.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F49at1t5uejzqumfo4281.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F49at1t5uejzqumfo4281.jpg" alt="Image Credits:Wayve" width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Wayve is targeting robotaxi pilot launches in partnership with Uber later this year, while separately planning to integrate its AI software into Nissan’s next-generation driver-assist systems starting in 2027.&lt;/p&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for Wayve's robotaxi pilot launch with Uber later this year, and whether the tender's $8.5B valuation holds or gets tested by the public market. Also track if the company files for an IPO in 2027 as a potential exit signal for investors.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://techcrunch.com/2026/06/30/wayve-launches-85m-employee-tender-offer-at-8-5b-valuation/" rel="noopener noreferrer"&gt;techcrunch.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/wayve-launches-85m-employee-tender" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>startup</category>
      <category>business</category>
      <category>funding</category>
    </item>
    <item>
      <title>How to Use MCP Servers for Financial Data</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Thu, 02 Jul 2026 15:38:13 +0000</pubDate>
      <link>https://dev.to/gentic_news/how-to-use-mcp-servers-for-financial-data-4d6g</link>
      <guid>https://dev.to/gentic_news/how-to-use-mcp-servers-for-financial-data-4d6g</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;MCP servers turn financial data sources into auditable, replaceable protocol endpoints. For Claude Code users building agentic BFSI systems, this means 90% fewer custom integrations and regulator-ready logging.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;MCP servers turn financial data sources into auditable, replaceable protocol endpoints.&lt;/li&gt;
&lt;li&gt;For Claude Code users building agentic BFSI systems, this means 90% fewer custom integrations and regulator-ready logging.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What Changed — MCP Is Now the Standard for Financial Agent Infrastructure
&lt;/h2&gt;

&lt;p&gt;MCP hit 97 million monthly SDK downloads by late 2025. Every major AI provider — Anthropic, OpenAI, Google, Microsoft, Amazon — has adopted it. The IMF published a formal note in April 2026 citing MCP and A2A as the technical foundation for agentic payments.&lt;/p&gt;

&lt;p&gt;For Claude Code users building financial systems, this isn't abstract. The three-layer AI protocol stack — MCP for tools, A2A for agents, WebMCP for web access — is now the consensus architecture for enterprise deployments. If you're writing agents that touch banking data, you need MCP servers.&lt;/p&gt;

&lt;h2&gt;
  
  
  What It Means For You — The N×M Integration Problem Solved
&lt;/h2&gt;

&lt;p&gt;A typical bank with 15 AI agents and 40 financial data sources needs 600 custom integrations. Each breaks when either side updates. Each is a compliance surface that must be independently audited.&lt;/p&gt;

&lt;p&gt;MCP reduces this to 55 connections — 15 agents plus 40 MCP servers. Each server is built and audited once. Agents connect through a standardized JSON-RPC 2.0 wire format.&lt;/p&gt;

&lt;p&gt;For your Claude Code workflows, this means:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Replace 40 brittle API calls&lt;/strong&gt; with one MCP client that talks to 40 servers&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Audit at the protocol layer&lt;/strong&gt; — every agent interaction follows a defined format&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Swap vendors without rewrites&lt;/strong&gt; — A2A makes each agent replaceable&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Try It Now — The BFSI MCP Server Taxonomy
&lt;/h2&gt;

&lt;p&gt;Here's the server taxonomy a typical tier-two bank implementation uses. Each becomes a Claude Code MCP server:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ffen8iya2ms4arnbcpvrt.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ffen8iya2ms4arnbcpvrt.png" alt="Cover image for # MCP and A2A in Agentic BFSI Systems: The Complete Implementation Guide" width="800" height="337"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Core Banking MCP Server&lt;/strong&gt; — account balances, transaction history, product holdings. Requires row-level security so each agent only sees authorized customer records.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Credit Intelligence MCP Server&lt;/strong&gt; — credit bureau data, internal scores, debt-to-income calculations. Flag as high-risk under EU AI Act.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Sanctions and AML MCP Server&lt;/strong&gt; — real-time sanctions screening, PEP checks, adverse media. Every call must have immutable timestamps.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Market Data MCP Server&lt;/strong&gt; — real-time prices, volatility surfaces, yield curves. Sub-100ms latency required.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Document Intelligence MCP Server&lt;/strong&gt; — OCR, entity extraction, KYC document verification.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Regulatory Capital MCP Server&lt;/strong&gt; — Basel III/IV calculations, RWA data, LCR/NSFR metrics. Read-heavy — agents query before acting.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Communication MCP Server&lt;/strong&gt; — email, SMS, secure messaging. No agent sends customer comms without routing through this server.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation Pattern: Credit Decision Automation
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"jsonrpc"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"2.0"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"method"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"tool.call"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"params"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"tool"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"core_banking_api"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"action"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"get_transaction_history"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"arguments"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"customer_id"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"CUST-8847291"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"period_months"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;24&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"include_categories"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"income"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"regular_commitments"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"irregular_debits"&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"requesting_agent"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"credit_decision_agent"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"authorization_context"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{}&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Your Claude Code agent sends this to the Core Banking MCP server. The server handles auth, row-level security, and returns structured JSON. The agent then calls the Credit Intelligence server, the Sanctions server, and the Regulatory Capital server — all through the same protocol. No custom API wrappers. No brittle orchestration code.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Compliance Layer
&lt;/h2&gt;

&lt;p&gt;DORA (effective Jan 17, 2025) requires EU financial institutions to continuously monitor ICT systems. Every agent decision affecting customers or transactions must be logged, classified, and reportable.&lt;/p&gt;

&lt;p&gt;MCP makes this tractable: when every agent interaction follows a defined wire format, audit logging happens at the protocol layer. You don't bolt compliance onto each application — it's built into the connection.&lt;/p&gt;

&lt;h2&gt;
  
  
  For Claude Code Users
&lt;/h2&gt;

&lt;p&gt;Set up your MCP servers in &lt;code&gt;CLAUDE.md&lt;/code&gt;:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;## MCP Servers
- core-banking: http://localhost:3001
- credit-intelligence: http://localhost:3002
- sanctions-aml: http://localhost:3003
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Then your Claude Code agent can call &lt;code&gt;tool.call&lt;/code&gt; on any server without custom integration code. The protocol handles auth, logging, and error handling. You focus on the agent logic, not the plumbing.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://dev.to/nikhil_ramank_152ca48266/-mcp-and-a2a-in-agentic-bfsi-systems-the-complete-implementation-guide-1egp"&gt;dev.to&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;[Updated 02 Jul via gn_mcp_protocol]&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;X (formerly Twitter) has released its own MCP server, enabling AI agents to directly query trending topics, user profiles, and post data via the protocol [per TechCrunch]. This marks the first major social media platform to offer a native MCP endpoint, expanding the protocol's reach beyond traditional BFSI use cases. For Claude Code users, this means a single MCP client can now bridge financial data servers with real-time social sentiment analysis from X, all under the same JSON-RPC 2.0 format.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/how-to-use-mcp-servers-for" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>programming</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Realty Income Launches $6B Data Center JV with Cloud Capital</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Thu, 02 Jul 2026 09:38:16 +0000</pubDate>
      <link>https://dev.to/gentic_news/realty-income-launches-6b-data-center-jv-with-cloud-capital-4nap</link>
      <guid>https://dev.to/gentic_news/realty-income-launches-6b-data-center-jv-with-cloud-capital-4nap</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Realty Income formed a $6B data center JV with Cloud Capital and an institutional investor, signaling REITs are normalizing hyperscale AI infrastructure as core assets.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Realty Income launched a $6 billion data center joint venture with Cloud Capital and an unnamed global institutional investor. The programmatic partnership targets hyperscale data center investments to meet surging AI infrastructure demand.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Seed assets valued at over $6 billion&lt;/li&gt;
&lt;li&gt;Programmatic JV with Cloud Capital and institutional investor&lt;/li&gt;
&lt;li&gt;Realty Income is a $50B+ market cap REIT&lt;/li&gt;
&lt;li&gt;Google Cloud spends $11B/year on SpaceX compute&lt;/li&gt;
&lt;li&gt;Data center REITs have surged on AI demand&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Realty Income, the net-lease REIT giant, is pivoting hard into AI infrastructure. The company announced a programmatic joint venture with Cloud Capital and a global institutional investor to invest in hyperscale data centers, with initial seed assets valued at over $6 billion &lt;a href="https://news.google.com/rss/articles/CBMi2wJBVV95cUxNT2tZWVJrbUx3WllWWHBhOWZEbFkwQVo4RkxOS21XTlZjcERCajIyTzhRNG9lQUpXbEFfdVpwZ1VHVDFkeEtHN0UzRFZ3SnFHNWM0YTVqdjREeGl4ckMtV3BEM3M1T1M1QkdIeEdnTnV1dVdVampFVExjc2xjRWpENzhyR2tITEpncVhCS19SaVJMTncySFp0ODFqaVQ1Q2dfQXh4MDZYelUyWDBSdUlEWDBraUlPTGVsQlljTUlTZklqamYxQTd1OTJGTm1XRWN3MVNkeXp1MDdRM3hXX2xNbzRFdnVSWURrUlpNeFQ4WWx1MzIybW55a3VBVm5kSHZPMlJYWDFNTmE1SjVMSkl5OFpQNXlvbG1meEUtS2ZKVExVclF6bWVIcWxWV1YzVXR4aFFJVjZvM0hSUU8zU3B4eVE4UF93Y0NKckZhakhrYVJ4a1FXYUhYbzUzNA?oc=5" rel="noopener noreferrer"&gt;According to the PR Newswire release&lt;/a&gt;.&lt;/p&gt;

&lt;h3&gt;
  
  
  Why This Matters More Than the Press Release Suggests
&lt;/h3&gt;

&lt;p&gt;Realty Income is best known for owning single-tenant retail properties like Walgreens and Dollar General. This deal signals that traditional REITs now view hyperscale data centers as core real estate assets, not speculative plays. The $6 billion seed valuation is small relative to the $200 billion+ annual AI infrastructure spend, but it marks a structural shift: institutional capital is normalizing data center ownership alongside office and industrial.&lt;/p&gt;

&lt;p&gt;Cloud Capital, a data center developer, brings the operational expertise that Realty Income lacks. The unnamed institutional investor—likely a pension fund or sovereign wealth fund—provides the patient capital that hyperscale projects require. The programmatic structure means this isn't a one-off; the JV can acquire additional assets over time.&lt;/p&gt;

&lt;h3&gt;
  
  
  Competitive Landscape
&lt;/h3&gt;

&lt;p&gt;Google Cloud, which has committed $11 billion per year to SpaceX compute as of June 2026, is a major tenant for hyperscale data centers. Realty Income's JV could compete with or partner with Google's data center expansion. Microsoft and Amazon are also building their own capacity, creating a landlord-tenant dynamic that REITs are eager to exploit.&lt;/p&gt;

&lt;p&gt;The deal comes amid a broader trend: data center REITs like Digital Realty and Equinix have seen valuations soar as AI workloads drive demand. Realty Income's entry adds a new, well-capitalized player to the mix, potentially compressing cap rates for stabilized assets.&lt;/p&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for the identity of the unnamed institutional investor, which could signal sovereign wealth fund appetite for AI infrastructure. Also monitor whether Realty Income discloses the JV's target return on equity in its Q3 2026 earnings call.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://news.google.com/rss/articles/CBMi2wJBVV95cUxNT2tZWVJrbUx3WllWWHBhOWZEbFkwQVo4RkxOS21XTlZjcERCajIyTzhRNG9lQUpXbEFfdVpwZ1VHVDFkeEtHN0UzRFZ3SnFHNWM0YTVqdjREeGl4ckMtV3BEM3M1T1M1QkdIeEdnTnV1dVdVampFVExjc2xjRWpENzhyR2tITEpncVhCS19SaVJMTncySFp0ODFqaVQ1Q2dfQXh4MDZYelUyWDBSdUlEWDBraUlPTGVsQlljTUlTZklqamYxQTd1OTJGTm1XRWN3MVNkeXp1MDdRM3hXX2xNbzRFdnVSWURrUlpNeFQ4WWx1MzIybW55a3VBVm5kSHZPMlJYWDFNTmE1SjVMSkl5OFpQNXlvbG1meEUtS2ZKVExVclF6bWVIcWxWV1YzVXR4aFFJVjZvM0hSUU8zU3B4eVE4UF93Y0NKckZhakhrYVJ4a1FXYUhYbzUzNA?oc=5" rel="noopener noreferrer"&gt;news.google.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/realty-income-launches-6b-data" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>startup</category>
      <category>business</category>
      <category>funding</category>
    </item>
    <item>
      <title>Kling AI Nears $3B Round at $18B Valuation, Tencent Investing</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Thu, 02 Jul 2026 09:38:13 +0000</pubDate>
      <link>https://dev.to/gentic_news/kling-ai-nears-3b-round-at-18b-valuation-tencent-investing-f4m</link>
      <guid>https://dev.to/gentic_news/kling-ai-nears-3b-round-at-18b-valuation-tencent-investing-f4m</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Kuaishou-backed Kling AI nears $3B round at $18B valuation, down from $20B target. Tencent invests as HK listing looms within 12 months.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Kuaishou-backed Kling AI is nearing a $3 billion round at an $18 billion valuation, with Tencent participating, per sources. The valuation was cut from an initial $20 billion target set in April, reflecting shifting market sentiment for the AI video generation startup.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;$3 billion fundraising round near close&lt;/li&gt;
&lt;li&gt;$18 billion post-money valuation&lt;/li&gt;
&lt;li&gt;Valuation cut from $20 billion target in April&lt;/li&gt;
&lt;li&gt;Tencent among investors in the round&lt;/li&gt;
&lt;li&gt;HK listing process expected within 12 months&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Kuaishou-backed Kling AI is close to completing a US$3 billion fundraising round that would value the company at US$18 billion post-investment, according to people familiar with the matter [per SCMP]. The latest valuation was narrowed from an initial target of US$20 billion set in April, when Chinese short-video giant Kuaishou first planned to spin off Kling AI, according to a source, reflecting the shifting market sentiment for Kling AI. Investors backing this round include Chinese video gaming and social media giant Tencent, another source said.&lt;/p&gt;

&lt;p&gt;Kuaishou disclosed in a May 12 filing with the Hong Kong stock exchange that it was "assessing a proposal to restructure" Kling AI that could involve raising funds from external investors. Kuaishou also expects to start Kling AI's Hong Kong stock exchange listing process in the next 12 months, according to a source close to the deal, with fundraising from the initial public offering going towards the buildout of computing and data centres and the acquisition and retention of talent.&lt;/p&gt;

&lt;p&gt;The $18 billion valuation, while down from the initial $20 billion target, still places Kling AI among the most valuable AI startups globally, especially in the video generation segment. The round comes as competition intensifies in China's AI video generation sector, with rivals including Sora and other domestic players. Tencent's involvement is notable given its existing investments in other Chinese AI startups like Moonshot AI and DeepSeek [per entity relationships], signaling a consolidation play in the space.&lt;/p&gt;

&lt;p&gt;Kuaishou's Hong Kong-listed shares closed up 1.46 per cent at HK$41.60 on Wednesday. Kuaishou, Kling AI and Tencent did not immediately reply to a request for comment.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Kuaishou-backed Kling AI nears $3B round at $18B valuation, down from $20B target.&lt;/li&gt;
&lt;li&gt;Tencent invests as HK listing looms within 12 months.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for Kuaishou's next HK exchange filing detailing the restructuring and investor names. The key metric to track is Kling AI's ARR growth trajectory, which the source says soared after its latest AI video model release. A formal listing application within 6-9 months would signal confidence.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F3xbsxdduu6ai30v9f65k.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F3xbsxdduu6ai30v9f65k.jpg" alt="Kling AI is said to be close to completing a US$3 billion fundraising round that would value the company at US$18 billion post-investment. Photo: Shut" width="800" height="533"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://www.scmp.com/tech/big-tech/article/3359059/chinas-kling-ai-nears-us3-billion-round-us18-billion-valuation-sources" rel="noopener noreferrer"&gt;scmp.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/kling-ai-nears-3b-round-at-18b" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>startup</category>
      <category>business</category>
      <category>funding</category>
    </item>
    <item>
      <title>Google Cloud Joins MCP: How to Connect Claude Code to BigQuery</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Thu, 02 Jul 2026 03:38:24 +0000</pubDate>
      <link>https://dev.to/gentic_news/google-cloud-joins-mcp-how-to-connect-claude-code-to-bigquery-5chh</link>
      <guid>https://dev.to/gentic_news/google-cloud-joins-mcp-how-to-connect-claude-code-to-bigquery-5chh</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Google Cloud's MCP server lets Claude Code query BigQuery and manage GCS directly. Install it with &lt;code&gt;claude mcp add google-cloud&lt;/code&gt; and authenticate.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Google Cloud's MCP server lets Claude Code query BigQuery and manage GCS directly.&lt;/li&gt;
&lt;li&gt;Install it with &lt;code&gt;claude mcp add google-cloud&lt;/code&gt; and authenticate.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What Changed
&lt;/h2&gt;

&lt;p&gt;Google Cloud has adopted the Model Context Protocol (MCP) natively, making it the first major cloud provider to ship an official MCP server for its platform. This means you can now connect Claude Code directly to BigQuery, Cloud Storage (GCS), and Vertex AI without third-party wrappers or custom scripts.&lt;/p&gt;

&lt;p&gt;The MCP ecosystem has exploded to over 13,000 servers as of late June 2026, but quality varies wildly — 54% of those have zero community adoption. Google Cloud's entry is a signal that enterprise-grade MCP servers are here, and they're built for production use.&lt;/p&gt;

&lt;h2&gt;
  
  
  What It Means For You
&lt;/h2&gt;

&lt;p&gt;If you're a Claude Code user working in GCP, this changes your daily workflow. Instead of context-switching to the GCP Console or running separate &lt;code&gt;gcloud&lt;/code&gt; commands, you can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Query BigQuery tables&lt;/strong&gt; directly from Claude Code: "Show me the top 10 customers by revenue from the last quarter"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;List and manage Cloud Storage buckets&lt;/strong&gt;: "Upload this file to my &lt;code&gt;data-lake&lt;/code&gt; bucket"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Invoke Vertex AI models&lt;/strong&gt;: "Run a prediction using my deployed model"&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;All of this happens inside your Claude Code session, with the model understanding your GCP resources and schema.&lt;/p&gt;

&lt;h2&gt;
  
  
  Try It Now
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Installation
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;claude mcp add google-cloud
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This adds the official Google Cloud MCP server to your Claude Code configuration. You'll be prompted to authenticate with your GCP credentials — the server uses Application Default Credentials (ADC), so if you already have &lt;code&gt;gcloud&lt;/code&gt; configured, it should work seamlessly.&lt;/p&gt;

&lt;h3&gt;
  
  
  Configure Access
&lt;/h3&gt;

&lt;p&gt;By default, the server has access to all projects your credentials can reach. To scope it down (recommended for production), edit your &lt;code&gt;~/.claude/settings.json&lt;/code&gt;:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"mcpServers"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"google-cloud"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"command"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"npx"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"args"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"@google-cloud/mcp-server"&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"env"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"GOOGLE_CLOUD_PROJECT"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"my-project-id"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"MCP_ALLOWED_SERVICES"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"bigquery,storage"&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Example Workflow
&lt;/h3&gt;

&lt;p&gt;Start a Claude Code session and try:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;&amp;gt; List my BigQuery datasets in project my-project-id
&amp;gt; What tables are in the `analytics` dataset?
&amp;gt; Write a SQL query to find the top 10 products by sales in the `orders` table and execute it
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Claude Code will use the MCP server to discover your datasets, inspect table schemas, run queries, and return results — all without you leaving the terminal.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;Google's adoption of MCP is significant. They've invested heavily in their own agent frameworks (ADK Go 2.0, released last week with graph-based workflows and human-in-the-loop). Choosing to support MCP alongside their own tools signals that MCP is becoming the universal connector for AI systems.&lt;/p&gt;

&lt;p&gt;It also means Claude Code users get first-class access to GCP without waiting for community-built servers that may or may not be maintained. Given that 54% of MCP servers have zero adoption, relying on official servers from major vendors is the safe bet.&lt;/p&gt;

&lt;h2&gt;
  
  
  Limitations
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Read-only by default&lt;/strong&gt;: The server starts in read-only mode for BigQuery and Storage. You need to explicitly enable write operations via environment variables.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Vertex AI is limited&lt;/strong&gt;: Currently supports model invocation but not training or deployment workflows.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;No IAM management&lt;/strong&gt;: You can't modify permissions through the MCP server — use the GCP Console or &lt;code&gt;gcloud&lt;/code&gt; for that.&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://news.google.com/rss/articles/CBMipgFBVV95cUxNdXBzdl83SWZMSEVrb1RDb3M2cDU2aXNuVEVoTjB4ZU45aXBNRG5aaTBVZFFyc1R1MnBhZUJNbVp2SHBKWnlXWG5MNzBwdDVXVDM4LW9GaGx2WXdtLXF3dlJPY0paeWRQX0JEamkyaTF2RGRhRlRUdk9tWmxaYWJGVno4SUJtdmlBc0dIZng3Q1BvdjVOa0kyQjZkRjBpdnpPc05HdV93?oc=5" rel="noopener noreferrer"&gt;news.google.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/google-cloud-joins-mcp-how-to" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>programming</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>BayesBench: LLMs Match Bayesian Posteriors But Fail Downstream Prediction</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Wed, 01 Jul 2026 21:38:16 +0000</pubDate>
      <link>https://dev.to/gentic_news/bayesbench-llms-match-bayesian-posteriors-but-fail-downstream-prediction-30aa</link>
      <guid>https://dev.to/gentic_news/bayesbench-llms-match-bayesian-posteriors-but-fail-downstream-prediction-30aa</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;BayesBench tests 7 LLMs on multi-turn Bayesian reasoning. Scaling improves latent inference but not prediction, exposing a critical gap for agentic deployment.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;BayesBench tests seven LLMs from 3B to 70B parameters on multi-turn Bayesian reasoning. Scaling improves latent inference but not downstream prediction, exposing a gap in rational belief updating.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;BayesBench evaluates 7 LLMs (3B–70B) on multi-turn belief updating.&lt;/li&gt;
&lt;li&gt;Three tasks: Bayesian estimation, prediction, and latent-framed prediction.&lt;/li&gt;
&lt;li&gt;Scaling improves latent inference but not downstream prediction.&lt;/li&gt;
&lt;li&gt;Updates occasionally match Bayesian posterior but fail in prediction.&lt;/li&gt;
&lt;li&gt;Latent-framed prediction requires joint inference over persona and state.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The &lt;a href="https://arxiv.org/abs/2606.30850" rel="noopener noreferrer"&gt;BayesBench paper&lt;/a&gt;, published June 29, 2026, introduces a suite of simulation environments to evaluate how LLMs update beliefs across multiple conversation turns. The authors—Samanta, Magesh, Lancewicki, et al.—argue that most benchmarks score only final-turn answers, ignoring the trajectory of belief updates. BayesBench probes three tasks: Bayesian estimation (inferring an unknown parameter from sequential evidence), Bayesian prediction (turning latent beliefs into outcome forecasts), and latent-framed Bayesian prediction (joint inference over latent state and user persona).&lt;/p&gt;

&lt;p&gt;Results across seven LLMs (3B–70B) show that scaling improves latent inference and evidence accumulation, with updates “occasionally matching the Bayesian posterior.” However, the paper notes a critical failure: “These gains do not reliably carry over to downstream prediction, exposing a gap between inferring latent structure and using it to rationally update beliefs about the target outcome.” This mirrors a pattern seen in other recent evaluations—models can identify patterns but fail to apply them in dynamic contexts.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Inference-Prediction Gap
&lt;/h3&gt;

&lt;p&gt;here is that scaling alone doesn't close the gap between latent inference and rational prediction. Larger models get better at inferring hidden parameters from evidence, but this doesn't translate to better forecasts. The latent-framed prediction task, which adds a user-persona layer, further degrades performance, suggesting that joint inference over multiple latent variables remains a challenge. This echoes findings from &lt;a href="https://gentic.news/rift-bench-tests-45-agentic" rel="noopener noreferrer"&gt;RIFT-Bench&lt;/a&gt; (published June 24, 2026), which showed that agentic systems struggle with dynamic red-teaming across turns.&lt;/p&gt;

&lt;h3&gt;
  
  
  Implications for Multi-Turn Deployment
&lt;/h3&gt;

&lt;p&gt;For AI engineers deploying LLMs in multi-turn agents—customer support, tutoring, or medical diagnosis—BayesBench highlights a concrete failure mode: models may correctly infer the environment but fail to act on that inference. The paper doesn't release code or data yet, but the methodology is reproducible. Watch for follow-up work that attempts to bridge the inference-prediction gap, possibly via chain-of-thought prompting or fine-tuning on Bayesian update trajectories.&lt;/p&gt;

&lt;h3&gt;
  
  
  Limitations
&lt;/h3&gt;

&lt;p&gt;The study tests only seven models (3B–70B), excluding frontier models like GPT-4 or Claude 3. The authors do not disclose model names beyond size ranges, which limits reproducibility. The benchmark's ecological validity is also unclear—real multi-turn conversations involve more complex evidence structures than the simulation environments.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;BayesBench tests 7 LLMs on multi-turn Bayesian reasoning.&lt;/li&gt;
&lt;li&gt;Scaling improves latent inference but not prediction, exposing a critical gap for agentic deployment.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Frptp65uqkuexbmi75nw1.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Frptp65uqkuexbmi75nw1.jpg" alt="Untitled (Match-Woman I) (1920) // Francis Picabia French, 1879–1953" width="800" height="1031"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Watch for follow-up work that releases code and data, enabling reproducible testing. Also monitor whether frontier models (e.g., GPT-4, Claude 3) show a similar gap or close it with larger scale or specialized training.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://arxiv.org/abs/2606.30850" rel="noopener noreferrer"&gt;arxiv.org&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/bayesbench-llms-match-bayesian" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>research</category>
      <category>deeplearning</category>
    </item>
    <item>
      <title>Google ADK Go 2.0 Adds Graph Engine, Human-in-Loop for Agents</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Wed, 01 Jul 2026 21:38:13 +0000</pubDate>
      <link>https://dev.to/gentic_news/google-adk-go-20-adds-graph-engine-human-in-loop-for-agents-426k</link>
      <guid>https://dev.to/gentic_news/google-adk-go-20-adds-graph-engine-human-in-loop-for-agents-426k</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Google released ADK Go 2.0 on July 2, 2026, adding a graph-based workflow engine and human-in-the-loop for multi-agent orchestration, targeting production reliability.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Google released ADK Go 2.0 on July 2, 2026, adding a graph-based workflow engine and built-in human-in-the-loop for multi-agent orchestration. The update targets production reliability for enterprises deploying agent systems on Google Cloud.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Released July 2, 2026, as ADK Go 2.0.&lt;/li&gt;
&lt;li&gt;Graph-based workflow engine replaces linear chain model.&lt;/li&gt;
&lt;li&gt;Built-in human-in-the-loop for approval/review steps.&lt;/li&gt;
&lt;li&gt;Dynamic orchestration enables runtime agent changes.&lt;/li&gt;
&lt;li&gt;Runs on Google Cloud with Vertex AI integration.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Google released ADK Go 2.0 on July 2, 2026, adding a graph-based workflow engine and built-in human-in-the-loop for multi-agent orchestration. The update targets production reliability for enterprises deploying agent systems on Google Cloud.&lt;/p&gt;

&lt;h2&gt;
  
  
  Graph-Based Workflow Engine
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F46kcsg5uiym2j04du5zd.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F46kcsg5uiym2j04du5zd.jpeg" alt="New Google ADK 2.0 Introduces Graph Based Workflows | by Kartik Marwah ..." width="799" height="451"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The core addition is a graph-based workflow engine that lets developers define agent execution as a directed acyclic graph (DAG) of steps. Each node can be an agent call, a conditional branch, or a parallel fork. This replaces the linear chain model from version 1.0, which struggled with complex branching logic &lt;a href="https://news.google.com/rss/articles/CBMiZkFVX3lxTE5EeFJFaWhEM3V4UjlyZFo3S0dtRTJSOHA4MUZMX1FmbmppWXdyVjZtQWg1QjBqRUdlRXlUUTAzdFRMcTZWRWxESUpxQUtMeVdhallBSEthWDFZalozdTN6dDhyZHpTUQ?oc=5" rel="noopener noreferrer"&gt;According to the source&lt;/a&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  Human-in-the-Loop
&lt;/h2&gt;

&lt;p&gt;ADK Go 2.0 includes built-in human-in-the-loop (HITL) capabilities, allowing developers to inject approval or review steps at any point in the workflow. The HITL system pauses execution until a human operator approves or rejects an action, logs the decision, and resumes from the approved branch. This is critical for regulated industries like finance and healthcare where agent autonomy is restricted.&lt;/p&gt;

&lt;h2&gt;
  
  
  Dynamic Orchestration
&lt;/h2&gt;

&lt;p&gt;Dynamic orchestration enables agents to be added, removed, or reordered at runtime without redeploying the entire workflow. This allows teams to A/B test agent configurations, roll back failing agents, or scale specific agents based on load. Google claims this reduces mean time to recovery (MTTR) for agent failures from hours to minutes.&lt;/p&gt;

&lt;h2&gt;
  
  
  Production Focus
&lt;/h2&gt;

&lt;p&gt;The framework is designed to run on Google Cloud infrastructure, integrating with Vertex AI Agent Builder and Cloud Run. Google did not disclose performance benchmarks or adoption numbers for the 1.0 release. The company competes with Microsoft's AutoGen, Anthropic's Claude agent framework, and OpenAI's Agents SDK.&lt;/p&gt;

&lt;h2&gt;
  
  
  Unique Take
&lt;/h2&gt;

&lt;p&gt;Google is racing to catch up in the agent framework space after lagging behind Microsoft and OpenAI. AutoGen 0.4 from Microsoft already supports DAG-based workflows and HITL. ADK Go 2.0's dynamic orchestration is a differentiator, but the real test is whether enterprises will adopt it over AutoGen's larger ecosystem. The HITL implementation is also less mature than Anthropic's Claude agent framework, which has been in production at financial institutions since early 2026.&lt;/p&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for enterprise adoption metrics in Google Cloud's Q3 2026 earnings call. If ADK Go 2.0 drives a measurable increase in Vertex AI Agent Builder usage, it signals competitive traction against AutoGen. Also track whether Google releases performance benchmarks comparing ADK Go 2.0 to AutoGen 0.4 on latency and throughput.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://news.google.com/rss/articles/CBMiZkFVX3lxTE5EeFJFaWhEM3V4UjlyZFo3S0dtRTJSOHA4MUZMX1FmbmppWXdyVjZtQWg1QjBqRUdlRXlUUTAzdFRMcTZWRWxESUpxQUtMeVdhallBSEthWDFZalozdTN6dDhyZHpTUQ?oc=5" rel="noopener noreferrer"&gt;news.google.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/google-adk-go-2-0-adds-graph" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>tech</category>
      <category>product</category>
    </item>
    <item>
      <title>Pipedrive Ships Native MCP Server, CRM Joins AI Agent Protocol</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Wed, 01 Jul 2026 15:38:14 +0000</pubDate>
      <link>https://dev.to/gentic_news/pipedrive-ships-native-mcp-server-crm-joins-ai-agent-protocol-3j7d</link>
      <guid>https://dev.to/gentic_news/pipedrive-ships-native-mcp-server-crm-joins-ai-agent-protocol-3j7d</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Pipedrive launched a native MCP server for CRM agent access. The move follows Microsoft and X in adopting Anthropic's open standard, which crossed 13,000 servers in June.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Pipedrive launched a native Model Context Protocol (MCP) server on July 6, 2026, according to Business Wire. The CRM provider becomes the latest enterprise SaaS vendor to adopt Anthropic's open standard for connecting AI agents to business data.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Pipedrive MCP server launched July 6, 2026&lt;/li&gt;
&lt;li&gt;MCP ecosystem crossed 13,000 servers in June 2026&lt;/li&gt;
&lt;li&gt;54% of 39,762 MCP servers have zero community adoption&lt;/li&gt;
&lt;li&gt;Microsoft and X also shipped MCP servers in June 2026&lt;/li&gt;
&lt;li&gt;Anthropic introduced MCP open standard in November 2024&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Pipedrive launched a native Model Context Protocol (MCP) server on July 6, 2026, according to &lt;a href="https://news.google.com/rss/articles/CBMi3AFBVV95cUxOQklYMHBvbXNCR0pfM283QXVwT18xYWppa2lmTjVXMDlFU1Rzektxa0tucWprVFpaZ0xXU0pEVWloRDFZT095ZFFyU2FnNFF0dXpXWW5jbzR3THVIWU9KV28zZnVvN0I3am1wU1ZwTXY5UzVfVDdVZ3MxUS1PS0dqako3amI1UTZuVjFhUkJaRmJJYnM1WlNyVEpBcW1tNUlTNmpETFgxTWw0RE9PQ180Z2xaY0sxQThYa3FHU1JwQWxMMnhvbUZ1X3BWRi1XS0dyeEI4WGpUVy1fWTYt?oc=5" rel="noopener noreferrer"&gt;Business Wire&lt;/a&gt;. The MCP server exposes Pipedrive's CRM data and actions — contacts, deals, activities, leads, and pipelines — as tools AI agents can call. This means an AI assistant like Claude can now read a Pipedrive deal, update its stage, log an activity, or create a contact without leaving the chat interface.&lt;/p&gt;

&lt;p&gt;The move places Pipedrive alongside a rapidly growing roster of SaaS platforms integrating MCP. The MCP ecosystem crossed 13,000 servers in late June 2026, per recent reporting — though a June 28 analysis found 54% of 39,762 MCP servers had zero community adoption, suggesting many are experimental or single-use deployments. Pipedrive's server targets production CRM workflows, not hobbyist projects.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Pipedrive launched a native MCP server for CRM agent access.&lt;/li&gt;
&lt;li&gt;The move follows Microsoft and X in adopting Anthropic's open standard, which crossed 13,000 servers in June.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Enterprise SaaS MCP Adoption Accelerates
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvkw5szku32qqlsmx2yid.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvkw5szku32qqlsmx2yid.png" alt="Build Agents using Model Context Protocol on Azure | Microsoft Learn" width="799" height="392"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Pipedrive is not alone. Microsoft shipped a Dynamics 365 Commerce MCP server for agentic commerce workloads. X (formerly Twitter) released an MCP server in late June to expose its platform data to AI tools. The pattern is clear: enterprise SaaS vendors are racing to make their platforms addressable by AI agents via the MCP standard, rather than building proprietary agent integrations for each model provider.&lt;/p&gt;

&lt;p&gt;Anthropic introduced MCP in November 2024 as an open standard and open-source framework. The protocol standardizes how AI systems connect to external tools and data sources — analogous to how USB-C standardized peripheral connections, but for LLM tool use. &lt;a href="https://news.google.com/rss/articles/CBMipgFBVV95cUxNdXBzdl83SWZMSEVrb1RDb3M2cDU2aXNuVEVoTjB4ZU45aXBNRG5aaTBVZFFyc1R1MnBhZUJNbVp2SHBKWnlXWG5MNzBwdDVXVDM4LW9GaGx2WXdtLXF3dlJPY0paeWRQX0JEamkyaTF2RGRhRlRUdk9tWmxaYWJGVno4SUJtdmlBc0dIZng3Q1BvdjVOa0kyQjZkRjBpdnpPc05HdV93?oc=5" rel="noopener noreferrer"&gt;EE Times covered MCP's emergence as a common framework for enterprise AI systems&lt;/a&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  What the Pipedrive MCP Server Actually Does
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fu9ans0jgafn97xkb2ee6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fu9ans0jgafn97xkb2ee6.png" alt="Create MCP Servers Driven by Workflows - Azure Logic …" width="800" height="600"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The server lets AI agents perform CRM operations through natural language. A sales rep could tell an assistant: "Find all deals in Pipedrive that are stuck in negotiation for more than 30 days and update their priority to high." The MCP server handles authentication, data retrieval, and action execution. Pipedrive did not disclose whether the server is self-hosted or cloud-managed, nor did it detail pricing or rate limits.&lt;/p&gt;

&lt;p&gt;For developers, the MCP server is available via the standard MCP client configuration — typically a JSON file pointing to the server's endpoint. The server supports the full MCP specification including tool definitions, resource templates, and prompts. Pipedrive's documentation (not yet publicly linked in the announcement) will presumably cover setup steps and authentication flows.&lt;/p&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;Watch for Pipedrive's MCP server adoption metrics in Q3 2026 and whether competitors like Salesforce or HubSpot ship their own MCP servers. Also track the MCP ecosystem's server quality distribution — the 54% zero-adoption figure suggests a shakeout is coming.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://news.google.com/rss/articles/CBMi3AFBVV95cUxOQklYMHBvbXNCR0pfM283QXVwT18xYWppa2lmTjVXMDlFU1Rzektxa0tucWprVFpaZ0xXU0pEVWloRDFZT095ZFFyU2FnNFF0dXpXWW5jbzR3THVIWU9KV28zZnVvN0I3am1wU1ZwTXY5UzVfVDdVZ3MxUS1PS0dqako3amI1UTZuVjFhUkJaRmJJYnM1WlNyVEpBcW1tNUlTNmpETFgxTWw0RE9PQ180Z2xaY0sxQThYa3FHU1JwQWxMMnhvbUZ1X3BWRi1XS0dyeEI4WGpUVy1fWTYt?oc=5" rel="noopener noreferrer"&gt;news.google.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/pipedrive-ships-native-mcp-server" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>tech</category>
      <category>product</category>
    </item>
    <item>
      <title>3 MCP Gateway Security Gaps LiteLLM's Audit Found (And How to Fix Them in</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Wed, 01 Jul 2026 15:38:13 +0000</pubDate>
      <link>https://dev.to/gentic_news/3-mcp-gateway-security-gaps-litellms-audit-found-and-how-to-fix-them-in-2do7</link>
      <guid>https://dev.to/gentic_news/3-mcp-gateway-security-gaps-litellms-audit-found-and-how-to-fix-them-in-2do7</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;LiteLLM's audit revealed 3 MCP gateway gaps: fail-open resolver, unpinned servers, opt-in least-privilege. Fix them in Claude Code with version pinning and allowed_tools.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;LiteLLM's audit revealed 3 MCP gateway gaps: fail-open resolver, unpinned servers, opt-in least-privilege.&lt;/li&gt;
&lt;li&gt;Fix them in Claude Code with version pinning and allowed_tools.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  The Audit That Found What 'Does It Work?' Misses
&lt;/h2&gt;

&lt;p&gt;Most teams wiring an LLM gateway to MCP tools ask one question: does it work? A production-readiness audit of LiteLLM—a widely deployed open-source LLM proxy—showed that question isn't enough. The real test is: when the authorization check throws an unexpected exception, does the gateway deny the call or allow it?&lt;/p&gt;

&lt;p&gt;That single line of behavior separates a mature platform from an incident waiting for a quiet Tuesday.&lt;/p&gt;

&lt;p&gt;The audit, run by Willian Pinho, scored LiteLLM across seven dimensions: tool-access governance, fail-close behavior, MCP onboarding, observability, multi-LLM routing, secrets, and production-readiness. Result: four green, three yellow, zero red. "Production-ready with caveats."&lt;/p&gt;

&lt;p&gt;Here are the three yellow flags—and exactly how to fix them in your Claude Code setup.&lt;/p&gt;

&lt;h2&gt;
  
  
  Yellow #1: The Fail-Open Authorization Resolver
&lt;/h2&gt;

&lt;p&gt;Every per-level permission resolver in LiteLLM fails closed: on an unexpected exception it logs and returns an empty set, which resolves to "no access" downstream. That's the correct posture.&lt;/p&gt;

&lt;p&gt;The exception is the top-level wrapper &lt;code&gt;get_allowed_mcp_servers()&lt;/code&gt;, which returns the allow-all server set on an unexpected error instead of an empty list. The blast radius is bounded to servers an operator already marked as public, but fail-open in an authorization resolver is the single highest-risk class in the framework.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The fix:&lt;/strong&gt; Patch that one function to return an empty set on exception. It's a one-line change plus a regression test.&lt;/p&gt;

&lt;h2&gt;
  
  
  Yellow #2: Unpinned Third-Party MCP Servers
&lt;/h2&gt;

&lt;p&gt;LiteLLM's curated catalog launches stdio servers with floating commands like &lt;code&gt;npx -y @sentry/mcp-server&lt;/code&gt;, with no version, digest, or checksum pin. That's a tampered-package away from a supply-chain incident—OWASP's LLM03.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fdn7w830vo5ywzkbxeigd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fdn7w830vo5ywzkbxeigd.png" alt="Cover image for I ran an MCP-gateway production-readiness audit on a popular open-source LLM gateway. Here's what it found." width="800" height="336"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The fix in Claude Code:&lt;/strong&gt; When you add MCP servers to your &lt;code&gt;claude_code_mcp_config.json&lt;/code&gt;, always pin the version and digest. For example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"mcpServers"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"sentry"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"command"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"npx"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"args"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"@sentry/mcp-server@1.2.3"&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Use &lt;code&gt;@1.2.3&lt;/code&gt; instead of &lt;code&gt;@latest&lt;/code&gt;. Better yet, use a checksum or digest if available. Reject any server that doesn't provide a versioned release.&lt;/p&gt;

&lt;h2&gt;
  
  
  Yellow #3: Per-Tool Least-Privilege is Opt-In
&lt;/h2&gt;

&lt;p&gt;Authorization at the gateway is strong—enforced on caller identity as a strict intersection of key, team, end-user, agent, and org permissions, and the model is kept out of the authorization decision entirely. That defeats prompt-injection-to-tool-call.&lt;/p&gt;

&lt;p&gt;But absent a per-server &lt;code&gt;allowed_tools&lt;/code&gt; allowlist, any caller with access to a server can invoke any tool on it, including write or external ones—the excessive-agency exposure OWASP tracks as LLM06.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The fix in Claude Code:&lt;/strong&gt; Add an &lt;code&gt;allowed_tools&lt;/code&gt; field to each MCP server definition in your config. For example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"mcpServers"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"database"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"command"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"node"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"args"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"server.js"&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"allowed_tools"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"query"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"schema_list"&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This prevents a compromised agent from calling &lt;code&gt;drop_table&lt;/code&gt; or &lt;code&gt;delete_all&lt;/code&gt;. Make this allowlist required at onboarding and validated in CI.&lt;/p&gt;

&lt;h2&gt;
  
  
  What LiteLLM Got Right (So You Know What to Keep)
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Identity and secrets:&lt;/strong&gt; Green. No inline secrets. JWT/OIDC enforced on the actual gateway call path. End-user identity propagates through to MCP handling and spend logs. For MCP tokens, the gateway supports RFC 8693 OAuth token-exchange with audience and scope binding.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Observability:&lt;/strong&gt; Green. OpenTelemetry with dedicated GenAI semantic-convention mapping. Inbound W3C &lt;code&gt;traceparent&lt;/code&gt; headers extracted through the standard propagator.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Routing and cost:&lt;/strong&gt; Green. Budget caps genuinely enforced—overrun raises &lt;code&gt;BudgetExceededError&lt;/code&gt; rather than firing an alert and letting spend continue.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  The Lesson for Claude Code Users
&lt;/h2&gt;

&lt;p&gt;A structured audit isn't a search for a smoking gun. On a mature target there usually isn't one. What the seven-dimension pass surfaced were the production-readiness edges that hide in a good codebase: one resolver that errs toward exposure, a supply chain that floats instead of pinning, and a least-privilege control that waits for someone to opt in.&lt;/p&gt;

&lt;p&gt;None of these show up when you ask "does it work?" because the gateway works fine. They show up when you ask what happens at the boundaries, under error, and at the defaults most operators never change.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Action items for your Claude Code deployment:&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Audit your MCP server config for unpinned versions—pin them all.&lt;/li&gt;
&lt;li&gt;Add &lt;code&gt;allowed_tools&lt;/code&gt; allowlists to every server.&lt;/li&gt;
&lt;li&gt;Test what happens when your authorization resolver fails (it should deny, not allow).&lt;/li&gt;
&lt;li&gt;Enable OIDC/JWT on the call path, not just the dashboard.&lt;/li&gt;
&lt;li&gt;Verify end-to-end trace continuity with W3C trace context.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The audit kit that produced this runs read-only in Claude Code. If you're putting an MCP gateway in front of production tools, run the same seven-dimension pass on your own deployment.&lt;/p&gt;

&lt;h2&gt;
  
  
  Sources
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://dev.to/willianpinho/i-ran-an-mcp-gateway-production-readiness-audit-on-a-popular-open-source-llm-gateway-heres-what-3b5d"&gt;LiteLLM audit article&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/BerriAI/litellm/tree/51ba6e39cd23576b9c2110361f1045782762f3e4" rel="noopener noreferrer"&gt;LiteLLM repo (pinned commit)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://genai.owasp.org/llm-top-10/" rel="noopener noreferrer"&gt;OWASP Top 10 for LLM Applications (2025)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://datatracker.ietf.org/doc/html/rfc8693" rel="noopener noreferrer"&gt;RFC 8693 — OAuth 2.0 Token Exchange&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://modelcontextprotocol.io/specification/2025-06-18" rel="noopener noreferrer"&gt;MCP specification (2025-06-18)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;em&gt;[Updated 01 Jul via devto_mcp]&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;A separate production post from Sahajmeet Kaur describes three real-world MCP governance failures that mirror LiteLLM's audit findings: a support agent created 47 duplicate Jira tickets via a write-enabled MCP server; a contractor's Jira MCP token remained active three weeks after offboarding because MCP credentials weren't in the standard checklist; and a research agent executed injected instructions from a web-fetch tool [per Sahajmeet Kaur]. The post advocates for four control walls—tool-level RBAC, centralized credential management with single-token offboarding, per-invocation audit trails, and content guardrails—implemented via a central gateway rather than relying on the MCP protocol itself.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/3-mcp-gateway-security-gaps" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>programming</category>
      <category>machinelearning</category>
    </item>
  </channel>
</rss>
