DEV Community

柚子哥
柚子哥

Posted on

AI Agents News May 2026: OpenAI Codex, Siri AI Upgrade, OpenRouter Funding & the AI Infrastructure Shift

Key Takeaways
OpenAI expanded Codex with Locked Use, allowing desktop AI agents to operate on locked macOS devices.
Apple is reportedly rebuilding Siri around a large custom Google AI model optimized for hybrid on-device inference.
OpenAI is accelerating commercial expansion through self-serve ChatGPT Ads Manager tools.
DuckDuckGo is benefiting from growing backlash against forced AI search experiences.
OpenRouter’s rapid growth reflects rising enterprise demand for multi-model AI infrastructure.
Qualcomm and ByteDance are deepening AI inference infrastructure cooperation through custom ASIC chips.
The AI industry is entering a new stage where deployment, distribution, and inference infrastructure are becoming as strategically important as model quality itself.
For most of the past two years, the AI race focused heavily on model benchmarks, reasoning ability, and parameter scale. But recent developments across OpenAI, Apple, ByteDance, Google, and Spotify suggest the market is shifting toward a more operational layer involving:
AI agents
inference efficiency
operating-system integration
AI advertising
workflow orchestration
custom silicon
multi-model deployment
The next generation of AI leaders may not simply be the companies building the smartest models.
Increasingly, advantage may go to companies controlling how AI is deployed, monetized, distributed, and integrated into real software ecosystems.

1. OpenAI Codex Introduces Locked Use for macOS Automation

OpenAI officially introduced a new “Locked Use” feature for Codex, allowing the desktop AI agent to continue operating on macOS devices while the computer remains locked or asleep.
The update addresses a long-standing problem for developers running large automation workflows. Previously, many users relied on wake-lock tools or external display tricks to prevent Macs from entering sleep mode during:
overnight testing
GUI automation
long-duration software workflows
remote debugging sessions
With Locked Use enabled, Codex can reportedly continue performing restricted desktop actions remotely while the system remains locked.
Why It Matters
This feature signals a major shift in how AI agents interact with operating systems.
Until recently, most AI systems operated primarily inside browser windows or cloud chat interfaces. Codex moves AI agents closer to persistent desktop-level automation capable of interacting directly with applications and workflows.
The more important issue may be security.
According to reports, OpenAI implemented the feature through constrained Apple-authorized permissions requiring users to manually approve:
Accessibility access
Screen Recording permissions
OpenAI also added operational guardrails limiting access to:
Terminal control
unrestricted system processes
Codex self-management functions
The feature is reportedly unavailable in the EEA, UK, and Switzerland because of regulatory concerns tied to unattended AI automation and operating-system-level permissions.
Serving AI agents inside real operating systems introduces new security risks, especially on developer machines containing production credentials and internal infrastructure access.

2. Apple’s Siri Overhaul Signals the Rise of Hybrid AI Inference

According to multiple industry reports, Apple is reportedly working with a customized Google AI model estimated at roughly 1.2 trillion parameters as part of a major Siri overhaul connected to Apple Intelligence.
If accurate, this would represent one of the largest AI systems ever integrated into a mainstream consumer assistant.
However, the bigger challenge is not model size.
It is inference efficiency.
Apple appears to be pursuing a hybrid AI architecture balancing:
1.low-latency responses
2.on-device privacy
3.large-model reasoning
4.battery efficiency
This is an extremely difficult engineering problem because Siri operates inside highly constrained mobile environments where:
memory bandwidth
thermal limits
battery life
response speed
directly affect user experience.
Reports suggest Apple may process lightweight requests locally while shifting more complex reasoning tasks to cloud infrastructure.
Why It Matters
The next phase of consumer AI competition may depend less on benchmark rankings and more on operational performance.
For most users, Siri response speed and reliability will matter far more than whether the underlying model contains one trillion or ten trillion parameters.
Apple’s strategy also reflects a broader shift toward distributed inference systems rather than fully cloud-dependent AI architectures.
That trend is becoming increasingly important as AI assistants move deeper into smartphones, wearables, and always-on consumer devices.

3. AIGCPanel 2.0 Pushes AI Content Creation Toward Workflow Automation

AIGCPanel 2.0 introduced a major workflow-focused upgrade aimed at automating AI digital-human and multimedia production pipelines.
At the center of the release is a new LogicFlow-powered workflow engine allowing creators to visually chain together:
text generation
voice synthesis
subtitle creation
video editing
export automation
Many AI video creators still manually move assets across separate tools for voice generation, editing, rendering, and subtitles. That fragmentation creates major bottlenecks during batch content production.
AIGCPanel attempts to consolidate those disconnected workflows into a unified automation layer.
The platform also introduced:
breakpoint recovery
asynchronous task queues
CLI tooling
cross-platform support
CI/CD integration
Why It Matters
The AI creator economy is increasingly shifting from isolated generation tools toward production infrastructure.
Generating AI content is no longer the hardest problem.
Managing large-scale content workflows efficiently is becoming the bigger operational challenge.
This is why more AI creator platforms are adopting concepts traditionally associated with software engineering, including orchestration systems, automation pipelines, and workflow reliability.

4. SpaceX, OpenAI, and Anthropic Could Trigger a Historic AI IPO Wave

SpaceX, OpenAI, and Anthropic are all reportedly accelerating public-market preparation efforts, potentially creating one of the largest AI-driven IPO cycles in U.S. market history.
According to reports:
SpaceX may target a valuation near $1.75 trillion
OpenAI’s valuation reportedly reached roughly $852 billion
Anthropic may approach a $900 billion valuation after additional fundraising rounds
Unlike traditional software companies, frontier AI firms face enormous ongoing infrastructure costs involving:
GPU procurement
inference infrastructure
data-center expansion
AI chip development
cloud deployment
energy consumption
Reports suggest OpenAI has already warned investors that profitability could remain years away because of infrastructure spending requirements.
Why It Matters
Private AI investors have largely tolerated massive losses because they believe AI may become foundational digital infrastructure similar to cloud computing or search.
Public-market investors may prove less patient.
Public markets typically demand clearer profitability timelines than venture-backed private funding environments.
That tension could become one of the defining financial risks of the current AI boom.

5. DuckDuckGo Benefits From Growing Backlash Against AI Search

Following Google’s expanded AI-search rollout during its May 2026 I/O conference, DuckDuckGo reported strong growth in U.S. user installations and traffic.
According to company data, app installations rose significantly between May 20 and May 25, with particularly strong growth on iOS devices.
The core issue appears to be user control.
Many users support AI-assisted search but dislike mandatory AI-generated summaries replacing traditional search results and reducing direct access to web links.
DuckDuckGo benefited by emphasizing search experiences with minimal or disabled AI features.
Users are increasingly concerned about:
zero-click search
reduced publisher traffic
hallucinated summaries
loss of transparency
synthetic content overload
Why It Matters
The backlash highlights a growing divide inside the AI search market.
Users do not necessarily oppose AI itself.
Many simply want more control over how much AI appears inside their search experience.
That shift may create opportunities for smaller privacy-focused search platforms positioning themselves as alternatives to fully AI-generated search environments.

6. Spotify Defends AI Music Through Licensed Creation Systems

Spotify executives recently defended the company’s expanding AI music strategy following new licensing agreements tied to AI-generated remixes and covers.
The company argues that regulated AI ecosystems are preferable to uncontrolled AI-generated content spreading across the internet.
Under Spotify’s proposed framework:
artists can opt in
creators receive compensation
AI-generated works operate within licensed systems
Spotify is attempting to position AI music as a licensing and monetization opportunity rather than purely a copyright threat.
Why It Matters
The debate surrounding AI music increasingly reflects a larger issue affecting the entire AI creator economy:
How can platforms commercialize AI-generated content while still protecting creator rights?
That challenge now affects:
music
publishing
video
voice cloning
digital avatars
Platforms capable of combining licensing, creator compensation, AI tooling, and monetization may gain significant long-term advantages as AI-generated media becomes more mainstream.

7. OpenAI Expands ChatGPT Ads Manager to Compete With Google and Meta

OpenAI significantly expanded its advertising business by rolling out broader access to its self-serve ChatGPT Ads Manager platform.
The move signals that OpenAI is increasingly evolving beyond an AI research company into a commercial distribution and advertising platform.
The most important change involves accessibility for small businesses.
The updated system now supports:
self-serve ad management
daily budget controls
conversion tracking
geographic targeting
pixel integrations
dynamic CTA optimization
This places OpenAI into more direct competition with Google and Meta’s advertising ecosystems.
Why It Matters
Conversational AI advertising behaves differently from traditional search advertising.
Unlike standard Google Search ads, ChatGPT ads may appear during longer decision-making sessions where users ask follow-up questions before purchasing products or services.
That potentially creates higher-intent commercial interactions involving:
travel planning
software evaluation
shopping research
local business discovery
As conversational AI platforms become larger traffic ecosystems, advertisers are increasingly treating AI chat interfaces as future customer-acquisition channels rather than experimental products.

8. OpenRouter’s Funding Surge Highlights the Rise of Multi-Model AI Infrastructure

OpenRouter completed a $113 million Series B funding round led by CapitalG, reportedly reaching a valuation near $1.3 billion.
The company’s growth reflects a major shift happening across enterprise AI deployment.
Rather than relying entirely on one provider, enterprises increasingly want infrastructure capable of routing workloads dynamically across multiple models.
OpenRouter currently provides access to models from companies including:
OpenAI
Anthropic
Google
xAI
DeepSeek
This helps enterprises optimize:
inference cost
latency
redundancy
workload specialization
Different models often perform better on different tasks. For example:
coding
reasoning
long-context retrieval
multilingual workflows
may each benefit from different model architectures.
Why It Matters
The rise of AI agents is making orchestration infrastructure increasingly valuable.
As enterprises deploy larger AI systems, the market may gradually shift away from single-model dependency toward flexible multi-model ecosystems optimized for cost and performance.
That transition could reshape enterprise AI competition over the next several years.

9. Qualcomm and ByteDance Deepen AI Infrastructure Cooperation

Qualcomm and ByteDance reportedly finalized a major AI semiconductor partnership focused on custom ASIC infrastructure and inference deployment systems.
According to reports, Qualcomm may supply millions of AI ASIC chips optimized for inference workloads tied to ByteDance’s AI ecosystem and AI agent infrastructure.
The agreement reportedly extends beyond chip procurement and may also involve semiconductor manufacturing support connected to ByteDance’s internal AI chip initiatives.
Why It Matters
The global AI market is increasingly shifting from pure model competition toward deployment economics and infrastructure scalability.
Serving AI assistants at consumer scale creates continuous operational costs involving:
power consumption
cooling
latency
memory bandwidth
GPU allocation
This is one reason many AI companies are aggressively pursuing custom hardware strategies rather than depending entirely on general-purpose GPUs.
The partnership also reflects broader efforts among Chinese AI companies to diversify infrastructure and reduce dependence on a single semiconductor supplier.

Quick Industry Snapshot
Company Major Update Strategic Focus
OpenAI Codex Locked Use Desktop AI agents
Apple Siri AI overhaul Hybrid inference
AIGCPanel Workflow engine Creator automation
OpenAI Ads Manager expansion Conversational advertising
OpenRouter $113M funding Multi-model routing
Qualcomm + ByteDance AI ASIC partnership Inference infrastructure

Final Analysis: AI Is Entering Its Deployment and Distribution Era

The biggest pattern across these developments is the growing importance of operational AI infrastructure.
Across operating systems, search, creator tools, semiconductors, and advertising platforms, the industry is increasingly shifting toward:
inference efficiency
deployment scalability
workflow orchestration
monetization systems
AI distribution channels
operational reliability
This represents a major transition from the earlier generative AI cycle dominated primarily by model releases and benchmark competition.
The next generation of AI leaders may not simply be the companies building the smartest models.
Increasingly, they may be the companies controlling:
compute infrastructure
operating-system integration
AI distribution
advertising ecosystems
deployment economics
enterprise orchestration
The AI race is becoming a deployment and infrastructure competition.

FAQ
Why is AI infrastructure becoming more important?
As AI products scale to millions of users, inference cost, deployment efficiency, and compute access become major operational challenges affecting profitability and scalability.
Why are AI companies building custom AI chips?
Custom AI chips can improve inference efficiency, reduce power consumption, and lower long-term deployment costs compared with relying entirely on general-purpose GPUs.
Why are users reacting negatively to AI-powered search?
Many users feel AI-generated summaries reduce transparency, limit direct access to websites, and create overly synthetic search experiences.
What is a multi-model AI platform?
A multi-model platform allows enterprises to dynamically switch between different AI models depending on cost, latency, workload type, or performance requirements.
Why are AI agents becoming important?
AI agents are evolving beyond chatbots into systems capable of automating workflows, operating software, and interacting directly with enterprise and operating-system infrastructure.

Top comments (0)