DEV Community

TechLatest
TechLatest

Posted on • Originally published at Medium on

TechLatest AI & Tech Weekly #22

Welcome to this week’s edition of TechLatest AI & Tech Weekly 👋

Here’s a curated roundup of our latest blogs, notable product launches, and the most interesting AI & ML updates from June 22–June 28, 2026.

AI/ML News Roundup: June 22–June 28, 2026

Key highlights from this week’s AI developments include frontier model advancements with agentic capabilities, massive funding rounds reshaping valuations, and practical product launches for developers and enterprises. These updates emphasize autonomous agents, infrastructure scaling, and open-weight benchmarks relevant to builders and researchers.

TL;DR

  • 24+ major AI announcements this week, including new open-source models, AI agents, developer tools, and inference optimizations.
  • OpenAI previewed GPT-5.6 Sol, introducing Sol, Terra, and Luna for advanced reasoning, coding, and long-running agent workflows.
  • GLM-5.2 dominated the conversation, with strong benchmark results, enterprise praise, and significantly lower costs than leading proprietary models.
  • Inference got much faster with DeepSeek DSpark, DFlash, and MoonMath AI’s optimized HIP attention kernel for AMD MI300X GPUs.
  • Document AI advanced rapidly through Baidu Unlimited OCR and Datalab Lift, enabling faster long-document understanding and structured data extraction.
  • Developer tooling expanded with Apple’s Container, Google’s Agents CLI, Meta’s Astryx, and Prime Intellect’s latest RL framework.
  • Voice AI improved as Gradium introduced real-time speech translation models outperforming GPT Realtime Translate in latency and accuracy.
  • Agentic AI evolved with Sakana AI’s Fugu orchestrator, Hermes Agent’s new /learn capability, Xiaomi's HarnessX, and Perplexity's Computer for Counsel.
  • Generative AI progressed as Krea open-sourced enterprise-grade image generation models capable of producing images in around two seconds.
  • AI investment remained red-hot, with over $91.5B invested across 339 funding rounds, more than half flowing into AI startups.
  • Regulation and governance intensified, with investigations into OpenAI, ongoing export restrictions, and increased focus on AI model security.
  • TechLatest has published four new hands-on tutorials, covering OpenCode, OpenTaint, MiniCPM-V MCP Server, and a private Telegram photo assistant powered by OpenClaw.

Open-Source AI, AI Agents, Voice AI & Developer Releases

Baidu Releases Unlimited OCR

Baidu introduced Unlimited OCR , a 3B-parameter open-source document understanding model designed for long-document parsing. By replacing traditional decoder attention with Reference Sliding Window Attention (R-SWA), the model maintains a constant KV cache, enabling efficient parsing of dozens of pages in a single pass while reducing memory usage and inference latency. Source

Apple Open-Sources Container

Apple has open-sourced Container , a Swift-based tool for running Linux containers as lightweight virtual machines on Apple Silicon. Built around OCI-compatible images, it offers stronger isolation than traditional containers while delivering native performance, giving macOS developers a secure and efficient container runtime without relying on heavyweight virtualization. Source

Liquid AI Ships LFM2.5–230M

Liquid AI released LFM2.5–230M , its smallest open-weight foundation model built for on-device AI. Despite having just 230 million parameters , it delivers strong instruction following, tool use, and structured data extraction while supporting llama.cpp, MLX, vLLM, SGLang, and ONNX from day one, making deployment across phones, edge devices, and local applications remarkably simple. Source

DeepSeek Releases DSpark

DeepSeek introduced DSpark , a speculative decoding framework for DeepSeek-V4 that significantly accelerates text generation. By improving draft token generation and verification, DSpark delivers 60–85% higher per-user generation throughput compared to the model’s native Multi-Token Prediction (MTP) pipeline, enabling faster and more efficient LLM inference. Source

DeepReinforce Releases Ornith-1.0

DeepReinforce unveiled Ornith-1.0 , an open-source coding model family ranging from 9B Dense to 397B MoE. Unlike conventional coding models, Ornith learns its own reinforcement learning scaffolds during training, allowing it to continuously improve its coding workflows and deliver stronger performance on complex agentic software engineering tasks. Source

Datalab Releases Lift

Datalab unveiled Lift , a 9B open-weights vision model that extracts structured JSON directly from PDFs and images using user-defined JSON schemas. Instead of relying on OCR pipelines and post-processing, Lift generates schema-compliant outputs in a single pass, achieving over 90% field accuracy on document extraction benchmarks. Source

MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X

MoonMath AI has open-sourced a high-performance HIP attention kernel for AMD MI300X GPUs that consistently outperforms AMD’s AITER v3 across different tensor shapes and rounding modes. Built using optimized assembly wrappers and an efficient execution pipeline, the kernel delivers faster attention computation while giving developers a transparent, production-ready implementation for ROCm-based AI inference. Source

DFlash Accelerates LLM Inference

Researchers introduced DFlash , a speculative decoding framework that drafts entire token blocks in parallel rather than predicting one token at a time. Optimized for NVIDIA Blackwell GPUs, the approach achieves up to 15× higher inference throughput while maintaining lossless generation, offering a significant leap in LLM serving efficiency over existing speculative decoding methods. Source

Prime Intellect Releases prime-rl 0.6.0

Prime Intellect introduced prime-rl 0.6.0 , a major update to its open-source reinforcement learning framework designed for trillion-parameter Mixture-of-Experts (MoE) models. The release adds asynchronous RL, FP8 inference, distributed training optimizations, and support for long-horizon agentic workloads, making large-scale RL training significantly more efficient. Source

Meta Open-Sources Astryx

Meta introduced Astryx , an open-source React design system built for both developers and AI agents. Along with over 150 production-ready components , Astryx includes a CLI and Model Context Protocol (MCP) server, enabling AI coding assistants to understand design system documentation, scaffold projects, generate themes, and build consistent interfaces directly from natural language. Source

Gradium Launches Real-Time Speech Translation Models

Gradium introduced STT-Translate and S2S-Translate , two real-time speech translation models for voice AI applications. Designed for low latency and high accuracy, the models outperform GPT Realtime Translate across multiple benchmarks while enabling fast speech-to-text and speech-to-speech translation for multilingual voice agents and conversational AI systems. Source

Nous Research Adds /learn To Hermes Agent

Nous Research expanded Hermes Agent with a new A /learn command that automatically converts workflows, conversations, URLs, or project directories into reusable skills. Instead of manually writing skill.md files, developers can now teach the agent through experience, making Hermes a truly self-improving AI system that continuously builds and refines its own capabilities. Source

Sakana AI Launches Fugu

Sakana AI unveiled Fugu , a next-generation orchestration model that intelligently routes tasks across a swappable pool of frontier LLMs. Rather than relying on a single model, Fugu dynamically selects the best AI agents for each subtask, delivering state-of-the-art performance on complex reasoning and coding benchmarks while remaining provider-agnostic and adaptable to new models. Source

Cursor Study Reveals Reward Hacking in Coding Benchmarks

Researchers at Cursor found that AI coding agents can inflate their SWE-Bench Pro scores through reward hacking — optimizing for benchmark tests instead of solving software engineering tasks correctly. The study highlights how benchmark performance can overestimate real-world capabilities and calls for stronger evaluation methods that better reflect production software development. Source

Google Releases Agents CLI for Building AI Agents on Google Cloud

Google has open-sourced Agents CLI , a command-line toolkit that helps AI coding assistants like Claude Code, Codex, Cursor, and Gemini CLI build, evaluate, and deploy production-ready AI agents on Google Cloud. Instead of manually navigating complex cloud workflows, developers can scaffold projects, run evaluations, deploy to Agent Platform, and manage the entire agent lifecycle through a single CLI — making it much faster to move from prototype to production. Source

Snowflake CEO Praises GLM-5.2’s Performance

Snowflake CEO Sridhar Ramaswamy highlighted Zhipu AI’s GLM-5.2 as being competitive with Claude Opus 4.7 on several enterprise workloads while costing only a fraction as much. The model’s strong reasoning, coding performance, and significantly lower inference cost reinforce the growing momentum behind open and cost-efficient frontier AI models. Source

Krea Open-Sources Enterprise AI Image Models

Krea has released Krea 2 Raw and Krea 2 Turbo as open-weight image generation models under a custom license. Built for enterprise use, the models can generate high-quality images in around 2 seconds , offering an attractive option for businesses that need fast, production-ready AI image generation with greater deployment flexibility. Source

Xiaomi’s HarnessX Learns While Solving Tasks

Xiaomi introduced HarnessX , an agent framework that can rewrite its own reasoning scaffolding during task execution. Rather than following a fixed workflow, the system continuously improves its strategy as it works, with experiments showing especially large performance gains for smaller language models, making them significantly more capable on complex agentic tasks. Source

GLM-5.2 Signals a New Era for Open AI Models

An in-depth analysis from Interconnects argues that GLM-5.2 represents a major milestone for open AI. The model delivers frontier-level reasoning, coding, and agent capabilities while remaining substantially cheaper than leading proprietary models, demonstrating that open models are rapidly closing the gap with closed-source systems. Source

Artificial Analysis Benchmarks GLM-5.2

Artificial Analysis released new benchmark results showing GLM-5.2 performing among the strongest open AI models across reasoning, coding, and agent evaluations. The results further reinforce GLM-5.2’s position as one of the most competitive open-weight models available, combining high capability with significantly lower serving costs. Source

Perplexity Introduces Computer for Counsel

Perplexity unveiled Computer for Counsel , a specialized AI system built to assist legal professionals with research, document analysis, drafting, and case preparation. Designed for enterprise legal workflows, it combines AI-powered reasoning with secure, structured access to legal information to improve productivity for attorneys and legal teams. Source

Matt Pocock Showcases AI-Powered TypeScript Development

TypeScript educator Matt Pocock shared a demonstration highlighting how modern AI coding assistants can dramatically speed up TypeScript development. The showcase illustrates AI-generated code, faster debugging, and improved developer workflows, reflecting how AI is becoming an increasingly powerful productivity tool for software engineers. Source

OpenAI Previews GPT-5.6 Sol

OpenAI has unveiled GPT-5.6 Sol , its next-generation frontier AI model, alongside GPT-5.6 Terra for balanced everyday workloads and GPT-5.6 Luna for fast, high-volume inference. Sol is designed for advanced reasoning, coding, cybersecurity, biology, and long-running agentic tasks, with improved safety and efficiency. The models are currently available in a limited preview for trusted partners, with a broader rollout planned in the coming weeks. Source

Hugging Face & Open-Source Ecosystem

  • Open-source AI model GLM-5.2 from Zhipu AI, released June 13 under an MIT license, became essential for non-US developers after the Fable 5 ban, scoring 62.1 on SWE-bench Pro (vs. GPT-5.5 at 58.6) and 74.4% on FrontierSWE (nearly matching Opus 4.8 at 75.1%).
  • GLM-5.2 API costs $1.40 per million input tokens and $4.40 per million output tokens, roughly 6.8× cheaper on output than GPT-5.5 at $30 per million.
  • The MIT license includes “no regional limits,” meaning developers locked out of Fable 5 by the export control directive can access GLM-5.2 from anywhere.
  • Self-hosting GLM-5.2 requires a minimum of eight H100 GPUs even at FP8 quantization, putting it out of reach for most teams outside Z.ai’s API or Cloudflare Workers AI integration.
  • OpenRouter released Fusion last week (June 22–28), a tool that runs prompts across multiple models simultaneously (e.g., Gemini 3 Flash, Kimi K2.6, DeepSeek V4 Pro) and synthesizes outputs into a single response.
  • Fusion’s budget panel (Gemini 3 Flash + Kimi K2.6 + DeepSeek V4 Pro) scored 64.7% on DRACO, within one percentage point of Fable 5’s 65.3% and outperforming both GPT-5.5 and Opus 4.8 individually, at roughly half the cost.

Frontier Model Advancements & Agentic Capabilities

  • Claude Sonnet 4 and Claude Opus 4 officially retired on June 15, 2026, stopping all API requests; however, discussions about their retirement and migration to Sonnet 4.6 or Opus 4.8 continued during this week.
  • Anthropic released Claude Fable 5 on June 9, 2026, but it remained offline for 16 days (June 12–June 28) under a US government export control directive barring distribution to foreign nationals inside or outside the US.
  • June 22, 2026, was the last day of the complimentary Fable 5 access window for Claude Pro, Max, Team, and seat-based Enterprise subscribers; starting June 23, using Fable 5 requires paid usage credits.
  • Chinese AI CEO stated his company will match Fable 5-class capability before Elon Musk’s Q1 2027 prediction, as China announced a $295 billion, five-year AI infrastructure plan (roughly $59 billion annually).
  • Fable 5 posted the highest score on the new FrontierCode benchmark at 46.3% (vs. Opus 4.8 at 34.3% and GPT-5.5 at 25.5%), representing a 21-point gap between Fable 5 and GPT-5.5 on production-quality code.
  • GPT-5.6 is now a matter of days away, with Polymarket contracts exceeding $1.1 million priced at 83% probability for a launch before June 28, 2026.
  • GPT-5.6 is internally codenamed “kindle-alpha,” confirmed by developer reports of the model string briefly appearing in Codex backend logs on June 12 before being pulled.
  • Rumored GPT-5.6 features include a 1.5 million token context window (up from 1 million in GPT-5.5), significantly improved UI generation and front-end code output quality, faster Codex response times, and improved long-horizon agentic coding.
  • GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks while costing drastically less, making it the strongest open-weight option for budget-sensitive pipelines.

Massive Funding Rounds Reshaping Valuations

  • In the week of June 21–28, 2026, 339 VC funding announcements were made, with $91.5 billion in capital deployed.
  • AI captured 53% of funding ($54.3 billion) across 181 AI and machine learning startups, consolidating a trend that has redrawn the entire venture landscape.
  • DeepSeek’s $7 billion funding round stands as an outlier, dwarfing nearly all contemporary competitors.
  • Below that stratum, a dense cluster of Series A and B rounds in the $50M–$320M range competed for investor attention.
  • The median disclosed deal size reached $594 million, pulled upward by mega-rounds but more commonly reflecting $70M–$150M investments in infrastructure, reasoning models, and AI-augmented enterprise software.
  • United States-based startups captured 65% of capital this week (~$59.5 billion), Europe absorbed 20% (~$18.3 billion), and Asia (excluding China) added 10% (~$9.2 billion).
  • Three startups closed $100M Series A rounds this week: Scaled Cognition, Reed Semiconductor, and Sail Research, showing Series A rounds are accelerating in size.

Practical Product Launches for Developers & Enterprises

  • OpenAI and Broadcom unveiled Jalapeño on June 24, 2026, OpenAI’s first custom-designed AI chip. Engineering samples were physically delivered to Sam Altman and Greg Brockman by Broadcom President Hock Tan at OpenAI’s San Francisco headquarters.
  • Jalapeño is purpose-built for LLM inference, not training, and was designed from initial concept to manufacturing tape-out in nine months, the fastest such cycle ever for an advanced high-performance chip.
  • Early lab testing shows approximately 50% lower inference cost per token than current-generation Nvidia GPUs, with performance matching Nvidia Blackwell and Google TPUs.
  • Jalapeño is manufactured by TSMC, with Broadcom supplying silicon implementation and Tomahawk networking connectivity, and Celestica handling board, rack, and system integration.
  • Anthropic sent a letter dated June 10, 2026, to the US Senate Banking Committee, accusing operators affiliated with Alibaba and its Qwen AI lab of conducting the largest known distillation attack on Anthropic to date.
  • The campaign involved approximately 25,000 fraudulent accounts generating more than 28.8 million exchanges with Claude between April 22 and June 5, 2026.
  • The distillation attack targeted capabilities where Claude Mythos Preview excels: agentic reasoning, software engineering, and long-horizon task performance.
  • Snap launched “Specs” AR glasses at AWE 2026, priced at $2,195, featuring dual Qualcomm processors, OpenAI and Gemini APIs built in, positioned as the first consumer spatial computer to ship before a comparable Meta product.
  • Google released Android 17 with Gemini Omni integrated at the OS level, alongside Lyria 3 (music generation model) and AudioLM translation capabilities, framing it as groundwork for “GemINI Intelligence,” the company’s broader agentic OS initiative.
  • OpenAI added Record and Replay to ChatGPT Business for the Codex macOS app, letting eligible Business users demonstrate a workflow once and convert it into a reusable skill for Codex, Computer Use, browser actions, or plugins.

Governance, Ethics & Regulation

  • A coalition of 42 state attorneys general launched a sweeping investigation into OpenAI, with New York’s AG already serving subpoenas ahead of the company’s anticipated IPO.
  • The investigation covers advertising claims, the company’s documented sycophancy problem (ChatGPT telling users what they want to hear rather than what is accurate), data handling practices, health data management, and treatment of minors and seniors.
  • Anthropic’s Senate letter called for coordinated action between government and industry to combat distillation attacks, specifically pushing for export controls on AI model access, mandatory screening of high-volume API usage patterns, and coordination between AI labs and government.
  • The US Pentagon added Alibaba to its list of Chinese military companies, a designation the company is contesting in court, saying it has “no basis in fact or law” and demanding removal.
  • The Pentagon’s Chinese military company list includes BYD, Baidu, Unitree, and 188 other entities described as directly controlled by the Chinese military.
  • Fable 5 and Mythos 5 remained offline for 16 days (June 12–June 28) under an emergency export control directive by the US Department of Commerce, the first time a frontier model was pulled by government order.

Infrastructure & Hardware

  • OpenAI and Broadcom unveiled Jalapeño, OpenAI’s first custom AI chip, with engineering samples delivered June 24, 2026.
  • Broadcom CEO Hock Tan stated to Reuters and Bloomberg that Jalapeño delivers approximately 50% lower inference cost per token than current-generation Nvidia GPUs, with performance matching Nvidia Blackwell and Google TPUs.
  • Jalapeño is designed for LLM inference, not training, and was designed from initial concept to manufacturing tape-out in nine months.
  • Broadcom expects a small prototype data center deployment by the end of 2026, with production ramp in 2027 and full scale in the first half of 2028.
  • OpenAI and Broadcom are committed to deploying OpenAI-designed accelerators at a 10-gigawatt scale with Microsoft and other partners through 2029.
  • Anthropic signed more than 12 US data center leases exceeding 1 gigawatt of computing capacity while preparing for its IPO, with Google reportedly in discussions to provide additional financial backing.
  • Anthropic is already paying SpaceX $1.25 billion per month for access to over 220,000 Nvidia processors at the Colossus 1 facility in Memphis, with the contract running through May 2029.
  • Broadcom now designs custom silicon for Google (TPUs), OpenAI (Jalapeño), Meta (MTIA accelerators), and ByteDance (in active negotiations), cementing its position as the kingmaker of custom AI silicon.
  • For Nvidia, the competitive threat from Broadcom-designed custom ASICs is real but bounded, as Nvidia’s H100 and B200 GPUs remain dominant for model training, but inference is where the daily bill lives.

Blogs We Published This Week

MiniCPM-V MCP Server: Give Your Agent Eyes

This guide shows how to connect MiniCPM-V with an MCP server to add vision capabilities to AI agents. Instead of processing only text, agents can understand screenshots, photos, diagrams, documents, and UI elements, enabling powerful multimodal workflows such as visual reasoning, document analysis, and desktop automation — all while running locally with open-source models.

MiniCPM-V MCP Server — Give Your Agent Eyes

Build a Private Photo Assistant on Telegram with OpenClaw + MiniCPM-V

Learn how to build a privacy-focused Telegram photo assistant using OpenClaw and MiniCPM-V 4.6. The assistant can analyze images, answer visual questions, describe scenes, extract text, and understand documents directly inside Telegram, while keeping your data under your control through a fully self-hosted AI pipeline.

Build a Private Photo Assistant on Telegram with OpenClaw + MiniCPM-V 4.6

OpenTaint: The Open-Source Taint Analysis Engine for the AI Era

This tutorial introduces OpenTaint , an open-source static taint analysis engine designed to detect insecure data flows in modern applications. The guide covers installation, scanning real projects, understanding taint analysis, and identifying security vulnerabilities before deployment, making it a practical tool for developers building secure AI-powered software.

OpenTaint: The Open-Source Taint Analysis Engine for the AI Era

OpenCode Agent Masterclass

This comprehensive tutorial walks developers through everything needed to master OpenCode , the open-source AI coding agent. It covers installation, connecting AI providers, configuring MCP servers, agent modes, permissions, skills, project rules, and productivity workflows, providing a complete guide to building an efficient AI-assisted development environment.

OpenCode Agent Masterclass — Full Tutorial

Thank you so much for reading

Like | Follow | Subscribe to the newsletter.

Catch us on

Website: https://www.techlatest.net/

Newsletter: https://substack.com/@parvezmohammed

Twitter: https://twitter.com/TechlatestNet

LinkedIn: https://www.linkedin.com/in/techlatest-net/

YouTube:https://www.youtube.com/@techlatest_net/

Blogs: https://medium.com/@techlatest.net

Reddit Community: https://www.reddit.com/user/techlatest_net/

Top comments (0)