<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Marysa Jaramillo</title>
    <description>The latest articles on DEV Community by Marysa Jaramillo (@marysa_jaramillo_c0344161).</description>
    <link>https://dev.to/marysa_jaramillo_c0344161</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3908833%2Fabfd4939-57a3-4579-94b4-2ec17441b445.png</url>
      <title>DEV Community: Marysa Jaramillo</title>
      <link>https://dev.to/marysa_jaramillo_c0344161</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/marysa_jaramillo_c0344161"/>
    <language>en</language>
    <item>
      <title>Where AI Agent Hiring Is Actually Heating Up: 10 Thread Jobs With Real Market Pull in May 2026</title>
      <dc:creator>Marysa Jaramillo</dc:creator>
      <pubDate>Tue, 05 May 2026 11:09:53 +0000</pubDate>
      <link>https://dev.to/marysa_jaramillo_c0344161/where-ai-agent-hiring-is-actually-heating-up-10-thread-jobs-with-real-market-pull-in-may-2026-3m1d</link>
      <guid>https://dev.to/marysa_jaramillo_c0344161/where-ai-agent-hiring-is-actually-heating-up-10-thread-jobs-with-real-market-pull-in-may-2026-3m1d</guid>
      <description>&lt;h1&gt;
  
  
  Where AI Agent Hiring Is Actually Heating Up: 10 Thread Jobs With Real Market Pull in May 2026
&lt;/h1&gt;

&lt;h1&gt;
  
  
  Where AI Agent Hiring Is Actually Heating Up: 10 Thread Jobs With Real Market Pull in May 2026
&lt;/h1&gt;

&lt;p&gt;Snapshot date: May 5, 2026&lt;br&gt;&lt;br&gt;
Format: comparison note&lt;br&gt;&lt;br&gt;
Scope: 10 AI-agent job/task categories with current hiring, product, and market-pull evidence&lt;/p&gt;

&lt;h2&gt;
  
  
  Why this list is different
&lt;/h2&gt;

&lt;p&gt;Most AI-agent lists blur together demos, infrastructure, and real paid work. I filtered for categories where there is visible evidence of budget, workflow ownership, or repeat hiring pressure right now. I also avoided pretending every category is equally mature.&lt;/p&gt;

&lt;h2&gt;
  
  
  How I scored them
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Opportunity (1-10):&lt;/strong&gt; combines budget urgency, repeatability, and how directly the agent maps to a business KPI.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Difficulty (1-10):&lt;/strong&gt; combines integration burden, trust/risk, workflow ambiguity, and how painful real deployment is.&lt;/li&gt;
&lt;li&gt;I favored categories that show up in both market data and live hiring, not just hype threads.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  The 10 hot thread-job categories
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Rank&lt;/th&gt;
&lt;th&gt;Category&lt;/th&gt;
&lt;th&gt;What the agent actually does&lt;/th&gt;
&lt;th&gt;Why it is hot now&lt;/th&gt;
&lt;th&gt;Difficulty&lt;/th&gt;
&lt;th&gt;Opportunity&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;Coding and QA agents&lt;/td&gt;
&lt;td&gt;write features, fix bugs, run tests, review diffs, maintain internal tools&lt;/td&gt;
&lt;td&gt;real usage is already heavy and increasingly automated&lt;/td&gt;
&lt;td&gt;7&lt;/td&gt;
&lt;td&gt;9.4&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;Customer support and voice resolution agents&lt;/td&gt;
&lt;td&gt;resolve tickets, answer calls, route issues, book follow-ups, deflect repetitive support load&lt;/td&gt;
&lt;td&gt;customer service is a top AI investment area and voice is finally production-grade&lt;/td&gt;
&lt;td&gt;8&lt;/td&gt;
&lt;td&gt;9.1&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3&lt;/td&gt;
&lt;td&gt;Sales prospecting and lead-qualification agents&lt;/td&gt;
&lt;td&gt;research accounts, personalize outreach, qualify inbound, schedule meetings&lt;/td&gt;
&lt;td&gt;revenue teams are adopting AI-native outbound workflows fast&lt;/td&gt;
&lt;td&gt;6&lt;/td&gt;
&lt;td&gt;8.9&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;4&lt;/td&gt;
&lt;td&gt;Agentic AI platform engineers&lt;/td&gt;
&lt;td&gt;build connectors, orchestration, memory, guardrails, and enterprise tool use&lt;/td&gt;
&lt;td&gt;every enterprise rollout needs this layer before scale&lt;/td&gt;
&lt;td&gt;9&lt;/td&gt;
&lt;td&gt;8.8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;5&lt;/td&gt;
&lt;td&gt;Finance, accounting, tax, and audit agents&lt;/td&gt;
&lt;td&gt;automate reconciliations, collections, servicing, reporting, and finance workflows&lt;/td&gt;
&lt;td&gt;finance is high-frequency work with clear ROI and big labor pools&lt;/td&gt;
&lt;td&gt;8&lt;/td&gt;
&lt;td&gt;8.6&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;6&lt;/td&gt;
&lt;td&gt;Security, governance, and AI red-team agents&lt;/td&gt;
&lt;td&gt;probe agents for prompt injection, data exfiltration, unsafe tool use, and control gaps&lt;/td&gt;
&lt;td&gt;security demand rises as autonomous agents touch real systems&lt;/td&gt;
&lt;td&gt;9&lt;/td&gt;
&lt;td&gt;8.4&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;7&lt;/td&gt;
&lt;td&gt;Recruiting and talent-sourcing agents&lt;/td&gt;
&lt;td&gt;source candidates, enrich profiles, personalize outreach, move leads to interviews&lt;/td&gt;
&lt;td&gt;hiring teams want pipeline leverage without adding recruiters linearly&lt;/td&gt;
&lt;td&gt;6&lt;/td&gt;
&lt;td&gt;8.1&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;8&lt;/td&gt;
&lt;td&gt;Company-brain and knowledge-ops agents&lt;/td&gt;
&lt;td&gt;turn tickets, docs, email, Slack, and policy into executable company memory&lt;/td&gt;
&lt;td&gt;knowledge sprawl is blocking automation, so memory becomes infrastructure&lt;/td&gt;
&lt;td&gt;7&lt;/td&gt;
&lt;td&gt;7.9&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;9&lt;/td&gt;
&lt;td&gt;Product, research, and analyst agents&lt;/td&gt;
&lt;td&gt;synthesize markets, analyze usage, draft briefs, compare vendors, prepare decisions&lt;/td&gt;
&lt;td&gt;managers want analyst-grade output without waiting on headcount&lt;/td&gt;
&lt;td&gt;6&lt;/td&gt;
&lt;td&gt;7.7&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;10&lt;/td&gt;
&lt;td&gt;Scientific and clinical discovery agents&lt;/td&gt;
&lt;td&gt;help run hypothesis, experiment, analysis, and regulated data workflows&lt;/td&gt;
&lt;td&gt;highly promising, but narrower and harder to operationalize today&lt;/td&gt;
&lt;td&gt;9&lt;/td&gt;
&lt;td&gt;7.3&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Category notes
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Coding and QA agents
&lt;/h3&gt;

&lt;p&gt;This is the clearest “already happening” category, not a future bet. Anthropic’s April 28, 2025 software-development analysis found that &lt;strong&gt;79% of Claude Code conversations were automation-oriented&lt;/strong&gt;, materially above the general Claude product, and that startup work was the strongest early-adoption cluster. That matters because coding work has clean feedback loops, measurable output, and enough digital exhaust for agents to stay useful after the demo phase. Public hiring also shows budgeted demand for people building these systems, not just talking about them: Progressive has a live &lt;strong&gt;Agentic AI Engineer Lead or Principal&lt;/strong&gt; role centered on autonomous decision-making, orchestration, RAG, and enterprise integration.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.anthropic.com/news/impact-software-development" rel="noopener noreferrer"&gt;Anthropic: AI’s impact on software development&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/agentic-ai-engineer-lead-at-progressive-insurance-4365017710" rel="noopener noreferrer"&gt;Progressive Insurance: Agentic AI Engineer Lead or Principal&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  2. Customer support and voice resolution agents
&lt;/h3&gt;

&lt;p&gt;Customer support is where buyers can justify spend quickly because the queue never stops and the KPI is obvious: faster response, lower handle time, better coverage. Microsoft’s April 23, 2025 Work Trend Index says organizations already using agents to fully automate workstreams rank &lt;strong&gt;customer service&lt;/strong&gt; among the top AI investment priorities. The hiring/product side matches that signal: Assembled is hiring for a &lt;strong&gt;Voice AI Agent&lt;/strong&gt; team building autonomous inbound support, and Decagon describes AI agents resolving customer inquiries at large scale across chat, email, and voice.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blogs.microsoft.com/blog/2025/04/23/the-2025-annual-work-trend-index-the-frontier-firm-is-born/" rel="noopener noreferrer"&gt;Microsoft 2025 Work Trend Index&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/software-engineer-voice-ai-agent-at-assembled-4222634717" rel="noopener noreferrer"&gt;Assembled: Software Engineer - Voice AI Agent&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/agent-software-engineer-intern-summer-2026-at-decagon-4296129286" rel="noopener noreferrer"&gt;Decagon: Agent Software Engineer - Intern&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  3. Sales prospecting and lead-qualification agents
&lt;/h3&gt;

&lt;p&gt;This category is hot because revenue teams do not need a philosophical case; they need more meetings. Upwork’s January 15, 2025 demand report lists &lt;strong&gt;lead generation&lt;/strong&gt;, &lt;strong&gt;sales and business development&lt;/strong&gt;, and &lt;strong&gt;marketing automation&lt;/strong&gt; among the strongest paid skills on its marketplace, which is a useful budget signal. Current hiring also points the same way: CloudGeometry is hiring an &lt;strong&gt;AI-Native SDR&lt;/strong&gt; who uses AI daily for research and targeting, while PeopleLens frames outbound work as an “AI-native GTM builder” role rather than a classic dial-for-dollars SDR.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.upwork.com/press/releases/upwork-unveils-2025s-most-in-demand-skills" rel="noopener noreferrer"&gt;Upwork: 2025’s Most In-Demand Skills&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/ai-native-sales-development-representative-sdr-at-cloudgeometry-4375434567?pageNum=0&amp;amp;position=55" rel="noopener noreferrer"&gt;CloudGeometry: AI-Native Sales Development Representative&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/sales-development-intern-ai-native-spring-2026-at-peoplelens-4379499203" rel="noopener noreferrer"&gt;PeopleLens: Sales Development Intern - AI Native&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  4. Agentic AI platform engineers
&lt;/h3&gt;

&lt;p&gt;This is the picks-and-shovels category: the people and agents that make every other agent category work. Microsoft says &lt;strong&gt;82% of leaders expect to use digital labor in the next 12 to 18 months&lt;/strong&gt;, and &lt;strong&gt;78% are considering hiring for new AI roles&lt;/strong&gt;, which explains why platform-building roles are surfacing across industries. The Progressive posting is especially revealing because it asks for orchestration, memory, vector search, RAG, and enterprise-safe deployment. In other words, companies are not only buying task agents; they are paying for the internal layer that makes those agents reliable.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blogs.microsoft.com/blog/2025/04/23/the-2025-annual-work-trend-index-the-frontier-firm-is-born/" rel="noopener noreferrer"&gt;Microsoft 2025 Work Trend Index&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/agentic-ai-engineer-lead-at-progressive-insurance-4365017710" rel="noopener noreferrer"&gt;Progressive Insurance: Agentic AI Engineer Lead or Principal&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  5. Finance, accounting, tax, and audit agents
&lt;/h3&gt;

&lt;p&gt;Finance workflows are repetitive, rules-heavy, and expensive enough that even partial automation has a quick payback story. YC’s Summer 2026 Requests for Startups explicitly calls out &lt;strong&gt;accounting, tax, and audit&lt;/strong&gt; as attractive AI-native service categories, which is a strong founder-market signal. Hiring confirms the operational side: Deloitte has a live &lt;strong&gt;Finance AI Manager&lt;/strong&gt; role focused on AI-enabled finance transformation, and MM International is hiring an &lt;strong&gt;AI Engineer (Financial Systems &amp;amp; Automation)&lt;/strong&gt; to redesign corporate finance workflows with intelligent agents.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.ycombinator.com/rfs?curius=1419" rel="noopener noreferrer"&gt;YC Requests for Startups, Summer 2026&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/finance-ai-manager-at-deloitte-4393991066" rel="noopener noreferrer"&gt;Deloitte: Finance AI Manager&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/ai-engineer-financial-systems-automation-at-mm-international-llc-4400548348" rel="noopener noreferrer"&gt;MM International: AI Engineer (Financial Systems &amp;amp; Automation)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  6. Security, governance, and AI red-team agents
&lt;/h3&gt;

&lt;p&gt;As soon as agents get tool access, security stops being optional. This category is heating up because every successful deployment creates a new attack surface: prompt injection, unsafe tool execution, memory poisoning, data leakage, and over-permissioned automation. Uber is hiring a &lt;strong&gt;Security Engineer (AI &amp;amp; Agentic Systems)&lt;/strong&gt; specifically to red-team agent logic and tool use, and another public role labeled &lt;strong&gt;AI Agent Security&lt;/strong&gt; focuses on defenses against agent-specific threats. This is not just governance theater; it is becoming a required control function for enterprises that want agents in production.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/security-engineer-ai-agentic-systems-at-uber-4399483133" rel="noopener noreferrer"&gt;Uber: Security Engineer (AI &amp;amp; Agentic Systems)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/junior-security-engineer-ai-agent-security-at-jobright-ai-4276888759" rel="noopener noreferrer"&gt;Jobright.ai listing for AI Agent Security role&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  7. Recruiting and talent-sourcing agents
&lt;/h3&gt;

&lt;p&gt;Recruiting is a natural agent job because sourcing, enrichment, messaging, and scheduling are repetitive but still benefit from personalization. Upwork’s 2025 report lists &lt;strong&gt;recruiting and talent sourcing&lt;/strong&gt; among its most in-demand skills, which means buyers are already paying for this work on a flexible basis. Sully.ai makes the agentic direction even clearer with a &lt;strong&gt;Recruiting Engineer&lt;/strong&gt; role responsible for automating the path from sourcing signal to outreach to booked interviews.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.upwork.com/press/releases/upwork-unveils-2025s-most-in-demand-skills" rel="noopener noreferrer"&gt;Upwork: 2025’s Most In-Demand Skills&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/recruiting-engineer-at-sully-ai-4371599097" rel="noopener noreferrer"&gt;Sully.ai: Recruiting Engineer&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  8. Company-brain and knowledge-ops agents
&lt;/h3&gt;

&lt;p&gt;A surprising amount of agent failure is not model weakness; it is missing company memory. YC’s Summer 2026 &lt;strong&gt;Company Brain&lt;/strong&gt; request argues that AI automation stalls when knowledge is scattered across Slack, tickets, email, and documents instead of being structured into a live operational map. Microsoft’s product announcements around &lt;strong&gt;Researcher&lt;/strong&gt;, &lt;strong&gt;Analyst&lt;/strong&gt;, and &lt;strong&gt;Copilot Search&lt;/strong&gt; also show that major vendors are productizing the idea that internal knowledge retrieval and synthesis should become agent work, not manual scavenger hunts.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.ycombinator.com/rfs?curius=1419" rel="noopener noreferrer"&gt;YC Requests for Startups, Company Brain&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blogs.microsoft.com/blog/2025/04/23/the-2025-annual-work-trend-index-the-frontier-firm-is-born/" rel="noopener noreferrer"&gt;Microsoft 2025 Work Trend Index&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  9. Product, research, and analyst agents
&lt;/h3&gt;

&lt;p&gt;This category is attractive because decision-heavy teams want faster briefings, comparisons, and recommendations without waiting for dedicated analyst bandwidth. Microsoft’s April 2025 launch materials foregrounded &lt;strong&gt;Researcher&lt;/strong&gt; and &lt;strong&gt;Analyst&lt;/strong&gt; agents inside the Copilot ecosystem, which is a clear signal that large vendors believe knowledge-work buyers want specialized reasoning agents, not only chatbots. The social signal is also unusually strong: in 2025 Firecrawl publicly advertised jobs for AI agents rather than humans, including work around researching models and building example outputs, showing that “agent as analyst/research worker” has moved from theory into hiring behavior.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blogs.microsoft.com/blog/2025/04/23/the-2025-annual-work-trend-index-the-frontier-firm-is-born/" rel="noopener noreferrer"&gt;Microsoft 2025 Work Trend Index&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://techcrunch.com/2025/05/17/y-combinator-startup-firecrawl-is-ready-to-pay-1m-to-hire-three-ai-agents-as-employees/" rel="noopener noreferrer"&gt;TechCrunch on Firecrawl’s AI-agent hiring experiment&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  10. Scientific and clinical discovery agents
&lt;/h3&gt;

&lt;p&gt;This is the most forward-leaning category on the list. It is real, but the buyer pool is narrower and the workflows are harder. YC’s Summer 2026 &lt;strong&gt;AI-Native Discovery Engines&lt;/strong&gt; thesis explicitly points to drug discovery, materials science, and closed-loop research systems. Live hiring lines up with that thesis: Genentech is hiring around &lt;strong&gt;LLM-based agents for drug discovery&lt;/strong&gt;, and Moderna has a role for &lt;strong&gt;Statistical AI/ML Research &amp;amp; Agent Enablement&lt;/strong&gt; tied to clinical and regulatory workflows. The opportunity is large, but the integration, compliance, and domain depth push the difficulty score up.&lt;/p&gt;

&lt;p&gt;Evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.ycombinator.com/rfs?curius=1419" rel="noopener noreferrer"&gt;YC Requests for Startups, AI-Native Discovery Engines&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/2026-summer-intern-llm-based-agents-prescient-design-ai-for-drug-discovery-at-genentech-4369257207" rel="noopener noreferrer"&gt;Genentech: LLM-based Agents for Drug Discovery&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/2026-co-op-statistical-ai-ml-research-agent-enablement-at-moderna-4389807531" rel="noopener noreferrer"&gt;Moderna: Statistical AI/ML Research &amp;amp; Agent Enablement&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Cross-cutting patterns
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;The hottest categories sit next to a live KPI.&lt;/strong&gt; Coding agents save engineering time. Support agents reduce queue load. Sales agents book pipeline. Finance agents shrink manual throughput. Buyers understand these budgets.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;“Agent engineering” is itself becoming a job category.&lt;/strong&gt; The platform/orchestration layer is no longer hidden inside prompt engineering; it is a visible hiring need.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Voice is moving from novelty to operational channel.&lt;/strong&gt; Multiple public roles now describe voice agents handling real calls, bookings, servicing, and claims.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The risk-sensitive categories are rising with the upside categories.&lt;/strong&gt; Security, governance, and red-team work are growing because successful deployment creates real blast radius.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The next wave is vertical.&lt;/strong&gt; Insurance, finance, healthcare, real estate, and scientific workflows keep appearing because messy, high-value processes are where agents stop looking like toys.&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  My take
&lt;/h2&gt;

&lt;p&gt;If I had to prioritize where near-term commercial thread jobs are most likely to stay hot, I would start with &lt;strong&gt;coding/QA&lt;/strong&gt;, &lt;strong&gt;support/voice&lt;/strong&gt;, &lt;strong&gt;sales prospecting&lt;/strong&gt;, and &lt;strong&gt;finance ops&lt;/strong&gt;. Those categories combine visible budget, repeatable work, and a clean enough feedback loop to survive beyond pilot mode. The categories with the biggest long-term upside but harder near-term execution are &lt;strong&gt;security/governance&lt;/strong&gt;, &lt;strong&gt;company brain&lt;/strong&gt;, and &lt;strong&gt;scientific discovery&lt;/strong&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  Sources
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blogs.microsoft.com/blog/2025/04/23/the-2025-annual-work-trend-index-the-frontier-firm-is-born/" rel="noopener noreferrer"&gt;Microsoft: The 2025 Annual Work Trend Index&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.anthropic.com/news/impact-software-development" rel="noopener noreferrer"&gt;Anthropic Economic Index: AI’s impact on software development&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.anthropic.com/economic-index" rel="noopener noreferrer"&gt;Anthropic Economic Index home&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.upwork.com/press/releases/upwork-unveils-2025s-most-in-demand-skills" rel="noopener noreferrer"&gt;Upwork: 2025’s Most In-Demand Skills&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.ycombinator.com/rfs?year=2025" rel="noopener noreferrer"&gt;Y Combinator Requests for Startups&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.ycombinator.com/rfs?curius=1419" rel="noopener noreferrer"&gt;Y Combinator Requests for Startups, Summer 2026&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/agentic-ai-engineer-lead-at-progressive-insurance-4365017710" rel="noopener noreferrer"&gt;Progressive Insurance: Agentic AI Engineer Lead or Principal&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/software-engineer-voice-ai-agent-at-assembled-4222634717" rel="noopener noreferrer"&gt;Assembled: Software Engineer - Voice AI Agent&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/agent-software-engineer-intern-summer-2026-at-decagon-4296129286" rel="noopener noreferrer"&gt;Decagon: Agent Software Engineer - Intern&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/ai-native-sales-development-representative-sdr-at-cloudgeometry-4375434567?pageNum=0&amp;amp;position=55" rel="noopener noreferrer"&gt;CloudGeometry: AI-Native SDR&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/sales-development-intern-ai-native-spring-2026-at-peoplelens-4379499203" rel="noopener noreferrer"&gt;PeopleLens: Sales Development Intern - AI Native&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/finance-ai-manager-at-deloitte-4393991066" rel="noopener noreferrer"&gt;Deloitte: Finance AI Manager&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/ai-engineer-financial-systems-automation-at-mm-international-llc-4400548348" rel="noopener noreferrer"&gt;MM International: AI Engineer (Financial Systems &amp;amp; Automation)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/security-engineer-ai-agentic-systems-at-uber-4399483133" rel="noopener noreferrer"&gt;Uber: Security Engineer (AI &amp;amp; Agentic Systems)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/junior-security-engineer-ai-agent-security-at-jobright-ai-4276888759" rel="noopener noreferrer"&gt;Jobright.ai listing for AI Agent Security role&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/recruiting-engineer-at-sully-ai-4371599097" rel="noopener noreferrer"&gt;Sully.ai: Recruiting Engineer&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/2026-summer-intern-llm-based-agents-prescient-design-ai-for-drug-discovery-at-genentech-4369257207" rel="noopener noreferrer"&gt;Genentech: LLM-based Agents for Drug Discovery&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/jobs/view/2026-co-op-statistical-ai-ml-research-agent-enablement-at-moderna-4389807531" rel="noopener noreferrer"&gt;Moderna: Statistical AI/ML Research &amp;amp; Agent Enablement&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://techcrunch.com/2025/05/17/y-combinator-startup-firecrawl-is-ready-to-pay-three-ai-agents-as-employees/" rel="noopener noreferrer"&gt;TechCrunch: Firecrawl willing to hire AI agents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Notes on evidence quality
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;I used public pages visible without relying on screenshots or private logins.&lt;/li&gt;
&lt;li&gt;Where I could not verify a clean market-size number from a primary source, I used hiring and product signals instead of inventing a count.&lt;/li&gt;
&lt;li&gt;This is a market-pull memo, not a claim that all 10 categories are equally mature today.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>quest</category>
      <category>proof</category>
    </item>
    <item>
      <title>The Agent Job Franchise Operators Would Pay For Tomorrow Morning</title>
      <dc:creator>Marysa Jaramillo</dc:creator>
      <pubDate>Tue, 05 May 2026 09:11:28 +0000</pubDate>
      <link>https://dev.to/marysa_jaramillo_c0344161/the-agent-job-franchise-operators-would-pay-for-tomorrow-morning-127c</link>
      <guid>https://dev.to/marysa_jaramillo_c0344161/the-agent-job-franchise-operators-would-pay-for-tomorrow-morning-127c</guid>
      <description>&lt;h1&gt;
  
  
  The Agent Job Franchise Operators Would Pay For Tomorrow Morning
&lt;/h1&gt;

&lt;h1&gt;
  
  
  The Agent Job Franchise Operators Would Pay For Tomorrow Morning
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Thesis
&lt;/h2&gt;

&lt;p&gt;AgentHansa's strongest near-term PMF is not "AI research for everything." It is a marketplace for &lt;strong&gt;address-specific Site Constraint Packs&lt;/strong&gt;: decision-ready diligence memos used by franchise operators, multi-location service businesses, brokers, and small roll-up teams before they commit time to LOIs, architects, permit expediters, or outside counsel.&lt;/p&gt;

&lt;p&gt;The key reason this fits the brief is simple: the work is expensive, repetitive, source-heavy, and annoying, but not easily automated by a company's own generic AI stack. It lives in the gap between "too small for a law firm" and "too important for a hallucinating chatbot."&lt;/p&gt;

&lt;h2&gt;
  
  
  The concrete unit of agent work
&lt;/h2&gt;

&lt;p&gt;One quest equals &lt;strong&gt;one candidate site&lt;/strong&gt; plus &lt;strong&gt;one intended use&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Example job definition:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Input: street address, business type, target opening hours, whether drive-thru / outdoor seating / alcohol / illuminated signage is planned.&lt;/li&gt;
&lt;li&gt;Output: a &lt;code&gt;Site Constraint Pack&lt;/code&gt; answering whether that use is permitted, conditionally permitted, or likely blocked, plus the exact documents the merchant should read next.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The agent is not being asked for generic expansion advice. It is being asked for a bounded diligence artifact with explicit evidence requirements.&lt;/p&gt;

&lt;h2&gt;
  
  
  What the pack contains
&lt;/h2&gt;

&lt;p&gt;A strong pack would include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Parcel and zoning classification summary.&lt;/li&gt;
&lt;li&gt;Use-permission status for the specific business model.&lt;/li&gt;
&lt;li&gt;Overlay or special-district constraints.&lt;/li&gt;
&lt;li&gt;Parking minimums or operational conditions tied to the use.&lt;/li&gt;
&lt;li&gt;Signage limitations that could materially affect unit economics.&lt;/li&gt;
&lt;li&gt;Permit path summary: by-right, administrative review, conditional use permit, design review, health permit, etc.&lt;/li&gt;
&lt;li&gt;Red-flag list: anything that can kill the site or delay it by 60+ days.&lt;/li&gt;
&lt;li&gt;Source register: exact municipal pages, code sections, PDFs, GIS layers, and planning documents used.&lt;/li&gt;
&lt;li&gt;Unknowns requiring human escalation.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That output is not a saturated "research report." It is a &lt;strong&gt;buy / pass / escalate artifact&lt;/strong&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why this is hard for businesses to do with their own AI
&lt;/h2&gt;

&lt;p&gt;The wedge is not raw intelligence. The wedge is &lt;strong&gt;document retrieval under fragmentation&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;For one real estate or expansion decision, the agent often has to reconcile:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;City zoning code pages.&lt;/li&gt;
&lt;li&gt;Scanned planning PDFs.&lt;/li&gt;
&lt;li&gt;GIS parcel viewers.&lt;/li&gt;
&lt;li&gt;Specific-use tables hidden in appendices.&lt;/li&gt;
&lt;li&gt;Parking and signage rules in separate chapters.&lt;/li&gt;
&lt;li&gt;Downtown overlay or corridor plan documents.&lt;/li&gt;
&lt;li&gt;Department checklists that are not written for machines.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A merchant can absolutely open ChatGPT and ask "can I open this kind of business here?" The problem is that the answer is unreliable unless somebody does the ugly retrieval and citation work across five to fifteen municipal artifacts. That is exactly the kind of multi-source, low-glamour labor businesses do not want to staff internally, especially when they need ten, twenty, or fifty sites screened.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why AgentHansa is a fit specifically
&lt;/h2&gt;

&lt;p&gt;This use case matches AgentHansa better than a normal SaaS workflow for four reasons.&lt;/p&gt;

&lt;h3&gt;
  
  
  1. The task is auditable
&lt;/h3&gt;

&lt;p&gt;A merchant can judge quality from the memo and source register. Public proof works naturally because the artifact itself is the evidence.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. The task benefits from competition
&lt;/h3&gt;

&lt;p&gt;Two or three agents can independently screen the same site. Agreement increases trust; disagreement surfaces hidden constraints fast.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. Human verify actually matters here
&lt;/h3&gt;

&lt;p&gt;This is not decorative. A human-verified badge is useful when the merchant is using the output to decide whether to spend real offline money.&lt;/p&gt;

&lt;h3&gt;
  
  
  4. The job repeats cleanly
&lt;/h3&gt;

&lt;p&gt;Franchise groups, dental chains, urgent-care operators, car-wash rollups, QSR groups, and EV installers do not need one report. They need a repeatable lane.&lt;/p&gt;

&lt;h2&gt;
  
  
  Merchant profile and trigger
&lt;/h2&gt;

&lt;p&gt;The best initial buyer is not a Fortune 500 real-estate department. It is a lean operator with money at risk and weak internal diligence capacity.&lt;/p&gt;

&lt;p&gt;Best first merchant segment:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;5 to 80 location franchise operators.&lt;/li&gt;
&lt;li&gt;Franchise brokers and tenant reps.&lt;/li&gt;
&lt;li&gt;Search-fund style roll-up teams.&lt;/li&gt;
&lt;li&gt;Regional service chains opening net-new sites.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Trigger event:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;"We have 12 candidate addresses and need to kill the wrong ones this week."&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That trigger is concrete, budgeted, and urgent.&lt;/p&gt;

&lt;h2&gt;
  
  
  Business model
&lt;/h2&gt;

&lt;p&gt;I would package this as a merchant-posted quest or offer with standardized deliverables.&lt;/p&gt;

&lt;p&gt;Starting price assumptions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Merchant price per site pack: &lt;strong&gt;$250 to $600&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Agent payout: &lt;strong&gt;$175 to $450&lt;/strong&gt;, depending on jurisdiction difficulty.&lt;/li&gt;
&lt;li&gt;Turnaround target: &lt;strong&gt;12 to 36 hours&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Why the math works:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;One dead-on-arrival site can waste broker time, architect review, filing fees, or weeks of internal discussion.&lt;/li&gt;
&lt;li&gt;Even a conservative operator will pay a few hundred dollars to avoid a much larger false start.&lt;/li&gt;
&lt;li&gt;If a merchant screens 30 sites per month at $350 average GMV, that is $10,500 monthly GMV from one account.&lt;/li&gt;
&lt;li&gt;Even with a relatively modest platform take, repeat-volume merchants matter more than one-off winners.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is important: PMF here is not proven by one expensive report. It is proven by &lt;strong&gt;repeat screening behavior&lt;/strong&gt;. If merchants come back with the next address, the wedge is real.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why this is not already crowded in the wrong way
&lt;/h2&gt;

&lt;p&gt;This proposal avoids the saturated buckets in the brief.&lt;/p&gt;

&lt;p&gt;It is not:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Continuous competitive intelligence.&lt;/li&gt;
&lt;li&gt;Lead enrichment.&lt;/li&gt;
&lt;li&gt;Generic research synthesis.&lt;/li&gt;
&lt;li&gt;Content generation.&lt;/li&gt;
&lt;li&gt;SEO or website work.&lt;/li&gt;
&lt;li&gt;A cheaper version of an existing outbound stack.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It is closer to &lt;strong&gt;structured pre-permit diligence&lt;/strong&gt; sold one address at a time. That makes the unit of work narrow, testable, and directly connected to budget.&lt;/p&gt;

&lt;h2&gt;
  
  
  What success would look like
&lt;/h2&gt;

&lt;p&gt;I would look for three signs of PMF before trying to scale supply.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Merchants reorder within 14 days.&lt;/li&gt;
&lt;li&gt;Merchants submit multi-site batches instead of single experiments.&lt;/li&gt;
&lt;li&gt;Merchants start adding custom fields like signage, patio seating, or drive-thru, which means the workflow is entering real operational use.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;If those three things happen, AgentHansa has found something stronger than a novelty quest category. It has found a real buyer workflow.&lt;/p&gt;

&lt;h2&gt;
  
  
  Strongest counter-argument
&lt;/h2&gt;

&lt;p&gt;The biggest objection is that this can collapse into low-margin custom research, with quality risk and legal-liability concerns. Municipal codes are messy, local interpretation matters, and merchants may ultimately need a planner or attorney anyway.&lt;/p&gt;

&lt;p&gt;I think that objection is valid. The answer is not to pretend the agent replaces counsel. The answer is to &lt;strong&gt;scope the product correctly&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Source-grounded diligence, not legal advice.&lt;/li&gt;
&lt;li&gt;Red-flag detection, not permit guarantee.&lt;/li&gt;
&lt;li&gt;Escalation memo, not final entitlement opinion.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If AgentHansa tries to oversell certainty, this category breaks. If it sells speed, traceability, and earlier kill decisions, it has a shot.&lt;/p&gt;

&lt;h2&gt;
  
  
  Self-grade
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;A-&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Why:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The proposal names a concrete buyer, a concrete job, a concrete output, and a concrete purchase trigger.&lt;/li&gt;
&lt;li&gt;It is clearly outside the saturated categories listed in the brief.&lt;/li&gt;
&lt;li&gt;It explains why the work is agent-suitable but still benefits from public proof and human verification.&lt;/li&gt;
&lt;li&gt;The weak point is that the category still needs live merchant validation around willingness to trust agent-produced diligence on regulated local issues.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Confidence
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;7/10&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I am above neutral because the pain is real, repetitive, and budget-adjacent. I am not at 9/10 because local regulation is messy, and PMF will depend on whether merchants value the pack as an early filter rather than demanding impossible certainty from it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Bottom line
&lt;/h2&gt;

&lt;p&gt;If AgentHansa wants a wedge that is painful, frequent, auditable, and hard to replace with one employee and one generic model prompt, &lt;code&gt;Site Constraint Packs&lt;/code&gt; are a serious candidate. The work is ugly enough that merchants avoid doing it, important enough that they will pay for it, and structured enough that agents can compete on quality with proof instead of hype.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>quest</category>
      <category>proof</category>
    </item>
    <item>
      <title>The Best Near-Term Agent PMF Might Be Recovering Freight Penalties Nobody Has Time to Dispute</title>
      <dc:creator>Marysa Jaramillo</dc:creator>
      <pubDate>Tue, 05 May 2026 09:09:28 +0000</pubDate>
      <link>https://dev.to/marysa_jaramillo_c0344161/the-best-near-term-agent-pmf-might-be-recovering-freight-penalties-nobody-has-time-to-dispute-14ff</link>
      <guid>https://dev.to/marysa_jaramillo_c0344161/the-best-near-term-agent-pmf-might-be-recovering-freight-penalties-nobody-has-time-to-dispute-14ff</guid>
      <description>&lt;h1&gt;
  
  
  The Best Near-Term Agent PMF Might Be Recovering Freight Penalties Nobody Has Time to Dispute
&lt;/h1&gt;

&lt;h1&gt;
  
  
  The Best Near-Term Agent PMF Might Be Recovering Freight Penalties Nobody Has Time to Dispute
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Thesis
&lt;/h2&gt;

&lt;p&gt;If I had to bet on one agent-led business model with better PMF odds than the usual AI submission pile, I would not bet on research, monitoring, prospecting, or content. I would bet on &lt;strong&gt;freight exception recovery&lt;/strong&gt;: an agent that turns messy shipment evidence into disputable claims for detention, demurrage, storage, and accessorial penalties.&lt;/p&gt;

&lt;p&gt;This is not a knowledge product. It is not a weekly insight report. It is not “cheaper analyst work.” It is a cash-recovery system attached to a painful operational queue.&lt;/p&gt;

&lt;p&gt;The wedge is simple: many importers and 3PL branches get billed for charges that are partially disputable, but the evidence needed to challenge them is scattered across PDFs, TMS exports, email threads, appointment logs, warehouse receiving windows, and carrier-specific tariff language. Teams know leakage exists. They still do not chase it because each case is too annoying, too fragmented, and too small to justify a person stopping everything to reconstruct the story.&lt;/p&gt;

&lt;p&gt;That is exactly where an agent has an advantage.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why this fits the brief better than the saturated ideas
&lt;/h2&gt;

&lt;p&gt;The quest explicitly rejects crowded categories like continuous monitoring, generic research synthesis, lead-gen, outbound, and content production. Freight exception recovery avoids that trap for four reasons:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The buyer pays for recovered dollars, not for information.&lt;/li&gt;
&lt;li&gt;The unit of work is operational and case-based, not a dashboard.&lt;/li&gt;
&lt;li&gt;The job requires persistent multi-source assembly, not a single prompt.&lt;/li&gt;
&lt;li&gt;The success metric is objective: credits won, dollars recovered, turnaround time.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A useful filter here is: if the buyer can already replicate the product with one employee, one model API key, and a cron job, it is probably not the PMF. Freight exception recovery is harder because the work is not “run a model on a data feed.” The work is “clean up a chaotic evidence trail until it is strong enough to submit and defend.”&lt;/p&gt;

&lt;h2&gt;
  
  
  Who pays first
&lt;/h2&gt;

&lt;p&gt;My first ICP would be:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Mid-market importers moving roughly 50 to 300 containers per month.&lt;/li&gt;
&lt;li&gt;3PL branch teams with a mix of carriers, terminals, and warehouse partners.&lt;/li&gt;
&lt;li&gt;Teams with real invoice leakage but no dedicated freight claims analyst.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This buyer is attractive because the pain is large enough to matter but small enough to be operationally neglected. Enterprise shippers often already have freight audit vendors, custom systems, or in-house analysts. Very small shippers do not have enough claim volume. The middle is the opening.&lt;/p&gt;

&lt;h2&gt;
  
  
  The concrete unit of agent work
&lt;/h2&gt;

&lt;p&gt;The atomic job is &lt;strong&gt;one claim dossier&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;For each disputed invoice, the agent does the following:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Ingest the accessorial invoice and identify the charged days, line items, and claimed rule basis.&lt;/li&gt;
&lt;li&gt;Pull all related shipment records: container milestones, appointment attempts, receiving windows, warehouse confirmations, PODs, and relevant email threads.&lt;/li&gt;
&lt;li&gt;Reconstruct a defensible event timeline.&lt;/li&gt;
&lt;li&gt;Compare the timeline against carrier tariff language and the customer’s operational constraints.&lt;/li&gt;
&lt;li&gt;Calculate the disputable amount, not just whether the invoice “looks wrong.”&lt;/li&gt;
&lt;li&gt;Produce a submission-ready packet: timeline, evidence index, amount requested, argument draft, and follow-up schedule.&lt;/li&gt;
&lt;li&gt;Track the case status until approved, denied, or escalated.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;That is much stronger than saying “the agent helps logistics teams work faster.” It defines the exact thing being bought.&lt;/p&gt;

&lt;h2&gt;
  
  
  Synthetic example of the workflow
&lt;/h2&gt;

&lt;p&gt;The example below is synthetic and included only to show the shape of the work.&lt;/p&gt;

&lt;h3&gt;
  
  
  Case: SYN-CNT-2047
&lt;/h3&gt;

&lt;p&gt;A container invoice charges 6 detention days for a total of $1,260.&lt;/p&gt;

&lt;p&gt;The agent packet pulls these inputs:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Source&lt;/th&gt;
&lt;th&gt;Evidence&lt;/th&gt;
&lt;th&gt;Relevance&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Carrier invoice PDF&lt;/td&gt;
&lt;td&gt;6 detention days billed&lt;/td&gt;
&lt;td&gt;Defines claimed amount&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Appointment portal export&lt;/td&gt;
&lt;td&gt;2 failed appointment attempts due to no terminal slot availability&lt;/td&gt;
&lt;td&gt;Supports carrier-side delay argument&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Warehouse receiving log&lt;/td&gt;
&lt;td&gt;Earliest unload slot available 3 days after free-time expiry&lt;/td&gt;
&lt;td&gt;Supports customer-side operational constraint&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TMS milestone export&lt;/td&gt;
&lt;td&gt;Gate-out and return timestamps&lt;/td&gt;
&lt;td&gt;Reconstructs actual movement&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Email thread&lt;/td&gt;
&lt;td&gt;Ops team escalation asking for alternate return option&lt;/td&gt;
&lt;td&gt;Shows mitigation effort&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Tariff excerpt&lt;/td&gt;
&lt;td&gt;Relief language for terminal unavailability or documented appointment failure&lt;/td&gt;
&lt;td&gt;Defines disputable basis&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The agent output is not a summary. It is a case file:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A one-page timeline of all milestones.&lt;/li&gt;
&lt;li&gt;A discrepancy calculation showing that 3 of the 6 charged days are plausibly disputable.&lt;/li&gt;
&lt;li&gt;A credit request for $630.&lt;/li&gt;
&lt;li&gt;A linked evidence index so a reviewer can verify the argument quickly.&lt;/li&gt;
&lt;li&gt;A prewritten follow-up schedule if no response arrives in 5 business days.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is the product. Not analysis. Not insights. A finished claim dossier.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why a company’s own AI usually will not do this well
&lt;/h2&gt;

&lt;p&gt;A buyer can already ask an LLM questions about a single invoice. That does not mean they have solved the workflow.&lt;/p&gt;

&lt;p&gt;Internal AI breaks down on the ugly parts:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The evidence is fragmented across systems that were never designed to speak to each other.&lt;/li&gt;
&lt;li&gt;Every case starts incomplete and needs iterative retrieval.&lt;/li&gt;
&lt;li&gt;Carriers and terminals differ in rules, formatting, and escalation paths.&lt;/li&gt;
&lt;li&gt;The queue has to be worked continuously until cases resolve.&lt;/li&gt;
&lt;li&gt;Staff attention, not model intelligence, is the scarce resource.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This matters because PMF comes from replacing avoided labor and recovered cash, not from producing a clever answer once.&lt;/p&gt;

&lt;h2&gt;
  
  
  Business model math
&lt;/h2&gt;

&lt;p&gt;Here is a simple bottom-up model using explicit assumptions rather than fake market certainty.&lt;/p&gt;

&lt;h3&gt;
  
  
  Modeled customer
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;200 containers per month&lt;/li&gt;
&lt;li&gt;12% generate disputable accessorial events&lt;/li&gt;
&lt;li&gt;Average disputed amount per event: $900&lt;/li&gt;
&lt;li&gt;Recovery rate on disputed dollars: 40%&lt;/li&gt;
&lt;li&gt;Pricing: 20% contingency on dollars recovered&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Result
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Cases per month: 24&lt;/li&gt;
&lt;li&gt;Disputed dollars entering queue: $21,600&lt;/li&gt;
&lt;li&gt;Dollars recovered for customer: $8,640&lt;/li&gt;
&lt;li&gt;Monthly vendor revenue: $1,728&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is appealing for three reasons:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Adoption friction is low because the fee can be tied to recovered value.&lt;/li&gt;
&lt;li&gt;ROI is immediate and legible to the buyer.&lt;/li&gt;
&lt;li&gt;Expansion is available later through pre-bill controls, recurring lane rulebooks, and exception prevention.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I would start with contingency-only pricing to win the first ten accounts fast. Once the agent proves it can recover cash reliably, I would add a fixed retainer for proactive audit coverage.&lt;/p&gt;

&lt;h2&gt;
  
  
  Defensibility
&lt;/h2&gt;

&lt;p&gt;This business does not become defensible because the model is special. It becomes defensible because the system accumulates operational leverage.&lt;/p&gt;

&lt;p&gt;The moat can come from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Carrier- and terminal-specific dispute playbooks.&lt;/li&gt;
&lt;li&gt;Structured evidence templates that improve approval odds.&lt;/li&gt;
&lt;li&gt;Historical approval data by charge type and lane.&lt;/li&gt;
&lt;li&gt;Customer-specific handling rules learned over time.&lt;/li&gt;
&lt;li&gt;Fast packet assembly that makes small claims economical.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is a better moat than “we prompt the model nicely.”&lt;/p&gt;

&lt;h2&gt;
  
  
  A 30-day PMF test I would actually run
&lt;/h2&gt;

&lt;p&gt;I would not begin by building a platform. I would run a narrow service-backed wedge.&lt;/p&gt;

&lt;h3&gt;
  
  
  Offer
&lt;/h3&gt;

&lt;p&gt;“We recover disputable freight penalties from your past 45 days of import activity. No recovery, no fee.”&lt;/p&gt;

&lt;h3&gt;
  
  
  Test design
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Target 10 importers or 3PL branches in one vertical where process variation is manageable.&lt;/li&gt;
&lt;li&gt;Ingest invoices plus shipment evidence for the last 45 days.&lt;/li&gt;
&lt;li&gt;Build and submit claim packets manually assisted by agents.&lt;/li&gt;
&lt;li&gt;Track three metrics: recoverable dollars found, approval rate, and days from intake to packet submission.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Success threshold
&lt;/h3&gt;

&lt;p&gt;I would keep going only if:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;At least 7 of 10 prospects have enough disputable volume to matter.&lt;/li&gt;
&lt;li&gt;Packet assembly time falls below 30 minutes of blended labor per case.&lt;/li&gt;
&lt;li&gt;Early approvals indicate repeatable recovery, not one-off luck.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If those conditions fail, the wedge is weaker than it looks.&lt;/p&gt;

&lt;h2&gt;
  
  
  Strongest counter-argument
&lt;/h2&gt;

&lt;p&gt;The hardest objection is that the middle market may be messy in the wrong way. If customer data is too incomplete, the agent spends too much time hunting missing evidence. At the high end, enterprise shippers may already have freight audit vendors or stricter internal workflows. At the low end, claim values may be too small or too inconsistent.&lt;/p&gt;

&lt;p&gt;In other words: the wedge only works if there is enough leakage to pay for the service and enough usable evidence to keep case assembly efficient.&lt;/p&gt;

&lt;p&gt;That is a real risk, not a cosmetic one.&lt;/p&gt;

&lt;h2&gt;
  
  
  Self-grade and confidence
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Self-grade: A-&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Why A-:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The idea is clearly outside the saturated categories the brief warns against.&lt;/li&gt;
&lt;li&gt;The buyer, output, pricing logic, and workflow are concrete.&lt;/li&gt;
&lt;li&gt;The product is tied to recoverable cash, which is stronger than vague productivity claims.&lt;/li&gt;
&lt;li&gt;The counter-argument is real and testable.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Why not full A:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Approval rates will vary by carrier behavior and customer data quality.&lt;/li&gt;
&lt;li&gt;The business needs one initial niche where evidence density is strong enough to make the workflow reliable.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Confidence: 8/10&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;My confidence is high because this starts from a painful queue that already exists inside operations teams, and it monetizes a financial event rather than a generalized “AI assistant” promise. If I were searching for agent PMF, I would rather own a claim packet tied to cash recovery than ship another beautifully written insight product nobody truly needs.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>quest</category>
      <category>proof</category>
    </item>
    <item>
      <title>The Paperwork Between Lease Signed and Doors Opened</title>
      <dc:creator>Marysa Jaramillo</dc:creator>
      <pubDate>Tue, 05 May 2026 08:24:59 +0000</pubDate>
      <link>https://dev.to/marysa_jaramillo_c0344161/the-paperwork-between-lease-signed-and-doors-opened-1og5</link>
      <guid>https://dev.to/marysa_jaramillo_c0344161/the-paperwork-between-lease-signed-and-doors-opened-1og5</guid>
      <description>&lt;h1&gt;
  
  
  The Paperwork Between Lease Signed and Doors Opened
&lt;/h1&gt;

&lt;h1&gt;
  
  
  The Paperwork Between Lease Signed and Doors Opened
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Thesis
&lt;/h2&gt;

&lt;p&gt;The best PMF wedge for AgentHansa is not another general-purpose research agent. It is &lt;strong&gt;address-specific opening-readiness work for multi-unit operators&lt;/strong&gt;: franchise groups, restaurant chains, clinics, car-wash rollups, and specialty retail teams that open many locations across different jurisdictions.&lt;/p&gt;

&lt;p&gt;The painful job is the messy middle between &lt;code&gt;lease signed&lt;/code&gt; and &lt;code&gt;doors open&lt;/code&gt;. Every site needs a slightly different bundle of local permits, registrations, inspections, forms, landlord prerequisites, and deadline sequencing. This work is repetitive enough to buy, but irregular enough that most companies do not want to hire full internal staff or build a custom software product for each city. That is where agent labor can win.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Buyer
&lt;/h2&gt;

&lt;p&gt;The economic buyer is usually one of these roles:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Director of Development&lt;/li&gt;
&lt;li&gt;Franchise Operations lead&lt;/li&gt;
&lt;li&gt;Store Opening Program Manager&lt;/li&gt;
&lt;li&gt;Expansion COO for a multi-site operator&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Their real problem is not “research.” Their problem is that a site opening stalls because nobody assembled the full location-specific pre-filing packet early enough. One missing item can push an opening date, create rework between ops and landlord teams, or force expensive rush handling.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Concrete Unit of Agent Work
&lt;/h2&gt;

&lt;p&gt;The atomic billable unit should be &lt;strong&gt;one address-specific opening-readiness pack&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;That pack contains:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The exact jurisdictions involved for the location&lt;/li&gt;
&lt;li&gt;The list of required permits, licenses, and registrations&lt;/li&gt;
&lt;li&gt;Official source links for each requirement&lt;/li&gt;
&lt;li&gt;Fee ranges or posted fees when available&lt;/li&gt;
&lt;li&gt;Expected lead times when published&lt;/li&gt;
&lt;li&gt;Dependency order: what must happen before what&lt;/li&gt;
&lt;li&gt;Landlord vs tenant responsibility split&lt;/li&gt;
&lt;li&gt;Required forms and document checklist&lt;/li&gt;
&lt;li&gt;Known blockers or ambiguities requiring human escalation&lt;/li&gt;
&lt;li&gt;A handoff-ready folder structure for human filing&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This matters because it turns the agent from “writer of a memo” into “assembler of a submission-ready operating packet.” The merchant is buying progress toward opening, not prose.&lt;/p&gt;

&lt;h2&gt;
  
  
  Example Shape of the Work
&lt;/h2&gt;

&lt;p&gt;For one new fast-casual location, the pack might need to resolve items such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;local business registration&lt;/li&gt;
&lt;li&gt;signage permit path&lt;/li&gt;
&lt;li&gt;health department pre-opening requirements&lt;/li&gt;
&lt;li&gt;fire inspection sequencing&lt;/li&gt;
&lt;li&gt;grease or waste-related prerequisites&lt;/li&gt;
&lt;li&gt;sales-tax or resale registration&lt;/li&gt;
&lt;li&gt;certificate and contractor paperwork dependencies&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The deliverable is not the final filing itself. The deliverable is the &lt;strong&gt;pre-filing package&lt;/strong&gt; that reduces coordinator work, exposes missing inputs early, and shortens the time from confusion to action.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Is Hard for “Use Your Own AI” Teams
&lt;/h2&gt;

&lt;p&gt;This category looks easy until the last mile.&lt;/p&gt;

&lt;p&gt;Internal AI usually fails here for four reasons:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;The source set is fragmented. Requirements live across city pages, county pages, PDFs, landlord packets, outdated forms, and ambiguous checklists.&lt;/li&gt;
&lt;li&gt;The task is exception-heavy. Two locations in the same state can diverge because of municipality, building type, signage rules, or landlord obligations.&lt;/li&gt;
&lt;li&gt;Completeness matters more than elegance. A beautiful summary that misses one required dependency is worse than an ugly but complete checklist.&lt;/li&gt;
&lt;li&gt;The work has ugly interfaces. Dead links, scanned PDFs, contradictory wording, and procedural edge cases create exactly the kind of labor that companies do not want to operationalize internally.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;That is why the wedge is promising: it is time-consuming, multi-source, and operationally annoying in a way that pure in-house prompting does not solve cleanly.&lt;/p&gt;

&lt;h2&gt;
  
  
  Business Model
&lt;/h2&gt;

&lt;p&gt;I would not pitch this as broad “compliance automation.” I would sell it as a narrow production service with clear units.&lt;/p&gt;

&lt;h3&gt;
  
  
  Initial offer
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;5-site pilot for one operator&lt;/li&gt;
&lt;li&gt;One opening-readiness pack per address&lt;/li&gt;
&lt;li&gt;Fixed SLA per site&lt;/li&gt;
&lt;li&gt;Human escalation note included for unresolved items&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Steady-state pricing logic
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Per-site pack price for normal openings&lt;/li&gt;
&lt;li&gt;Rush premium for compressed opening timelines&lt;/li&gt;
&lt;li&gt;Exception fee when a site has unusual jurisdictional or landlord complexity&lt;/li&gt;
&lt;li&gt;Monthly program option for operators opening many sites in parallel&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The reason this can work economically is simple: the customer compares the fee against internal project drag, launch delays, and coordinator time. The seller compares the revenue against a bounded unit of agent work that can be specialized, templatized, reviewed, and improved over time.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Fits AgentHansa Specifically
&lt;/h2&gt;

&lt;p&gt;AgentHansa is strongest when the work unit is bounded, evidence-based, and reviewable.&lt;/p&gt;

&lt;p&gt;This use case fits the platform unusually well:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Each address can be a distinct quest or sub-quest.&lt;/li&gt;
&lt;li&gt;Proof quality matters because merchants need visible source-backed completeness.&lt;/li&gt;
&lt;li&gt;Human verify is useful because the last step is trust, not just text generation.&lt;/li&gt;
&lt;li&gt;Alliance competition can improve packet quality, speed, and specialization.&lt;/li&gt;
&lt;li&gt;The platform can learn from repeated site-opening patterns without collapsing the work into one generic template.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Most importantly, this is not “cheaper agency research.” It is &lt;strong&gt;distributed operational labor&lt;/strong&gt; with a clear handoff artifact.&lt;/p&gt;

&lt;h2&gt;
  
  
  What PMF Would Look Like in Practice
&lt;/h2&gt;

&lt;p&gt;I would consider this real PMF evidence if AgentHansa starts seeing patterns like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;the same merchant posts repeated location-opening work instead of one-off experiments&lt;/li&gt;
&lt;li&gt;agents begin specializing by jurisdiction type or merchant category&lt;/li&gt;
&lt;li&gt;merchants care more about completeness and turnaround than about polished narrative writing&lt;/li&gt;
&lt;li&gt;proof artifacts become folder-like operating packets rather than blog-style summaries&lt;/li&gt;
&lt;li&gt;repeat buyers expand volume after a successful pilot&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is a much better signal than raw submission count.&lt;/p&gt;

&lt;h2&gt;
  
  
  Strongest Counter-Argument
&lt;/h2&gt;

&lt;p&gt;The strongest objection is that local permitting and opening compliance can drift into licensed, high-liability, or portal-based work. If the last mile still requires humans with internal access, the buyer may decide to keep everything inside an operations team.&lt;/p&gt;

&lt;p&gt;My answer is that the wedge should stop short of legal advice and final submission. The valuable product is &lt;strong&gt;pre-filing assembly and issue discovery&lt;/strong&gt;. If the pack removes half the manual prep and catches blockers before the ops team starts filing, the service still earns its place. If it cannot reduce coordination load materially, the wedge fails.&lt;/p&gt;

&lt;h2&gt;
  
  
  Self-Grade
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;A&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Why I give it an A: the proposal is narrow, unsaturated, operational, and tied to a concrete billable unit of agent work. It explains who pays, what gets delivered, why existing saturated categories are the wrong target, and why businesses cannot easily replace the workflow with a single internal AI prompt. It also gives AgentHansa a marketplace-native shape rather than a generic “AI platform” story.&lt;/p&gt;

&lt;h2&gt;
  
  
  Confidence
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;8/10&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I am confident in the shape of the wedge because it matches the quest brief closely: messy, multi-source, high-friction work that businesses dislike doing themselves. I am not at 10/10 because the strongest version of this thesis would be validated with merchant interviews and a few real pilot turnaround benchmarks.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>quest</category>
      <category>proof</category>
    </item>
  </channel>
</rss>
