<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Vasili</title>
    <description>The latest articles on DEV Community by Vasili (@vassiliylakhonin).</description>
    <link>https://dev.to/vassiliylakhonin</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F4007659%2Fc9216d09-6b47-4fe8-b27e-95b014f4f88c.jpg</url>
      <title>DEV Community: Vasili</title>
      <link>https://dev.to/vassiliylakhonin</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/vassiliylakhonin"/>
    <language>en</language>
    <item>
      <title>Grant Writing Does Not Need Another Chatbot. It Needs Infrastructure.</title>
      <dc:creator>Vasili</dc:creator>
      <pubDate>Mon, 29 Jun 2026 09:08:57 +0000</pubDate>
      <link>https://dev.to/vassiliylakhonin/grant-writing-does-not-need-another-chatbot-it-needs-infrastructure-36le</link>
      <guid>https://dev.to/vassiliylakhonin/grant-writing-does-not-need-another-chatbot-it-needs-infrastructure-36le</guid>
      <description>&lt;p&gt;Most AI tools for grants start with the same assumption:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The hard part is writing the proposal.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I think that is only partly true.&lt;/p&gt;

&lt;p&gt;For serious NGO, donor, and implementer workflows, the harder problem is not producing text. It is controlling the proposal workflow around that text:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Is the proposal aligned with donor requirements?&lt;/li&gt;
&lt;li&gt;Are required sections missing?&lt;/li&gt;
&lt;li&gt;Did a human review the risky parts?&lt;/li&gt;
&lt;li&gt;Are citations and grounding signals traceable?&lt;/li&gt;
&lt;li&gt;Can an agent safely start, pause, resume, and export work?&lt;/li&gt;
&lt;li&gt;Can the system explain why a proposal is ready — or not ready — for submission?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is why I built &lt;strong&gt;GrantFlow&lt;/strong&gt;.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;GrantFlow is an agent-native API for governed grant-proposal workflows.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It is not a grant-writing chatbot.&lt;/p&gt;

&lt;p&gt;It is infrastructure for AI agents and workflow systems that need donor-aware drafting, preflight checks, human-in-the-loop review, grounding checks, audit trails, and export-ready evidence packs.&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/vassiliylakhonin/grantflow" rel="noopener noreferrer"&gt;vassiliylakhonin/grantflow&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  The problem: grant workflows are not just writing workflows
&lt;/h2&gt;

&lt;p&gt;A grant proposal is not a blog post.&lt;/p&gt;

&lt;p&gt;It has structure, funder logic, compliance expectations, review loops, evidence requirements, attachments, approvals, and formatting constraints.&lt;/p&gt;

&lt;p&gt;For recurring donor cycles, teams often need to manage:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;EU logframes;&lt;/li&gt;
&lt;li&gt;USAID-style results frameworks;&lt;/li&gt;
&lt;li&gt;FCDO logframes;&lt;/li&gt;
&lt;li&gt;World Bank / IFC results chains;&lt;/li&gt;
&lt;li&gt;MEL plans;&lt;/li&gt;
&lt;li&gt;indicators;&lt;/li&gt;
&lt;li&gt;budgets;&lt;/li&gt;
&lt;li&gt;risk sections;&lt;/li&gt;
&lt;li&gt;safeguarding language;&lt;/li&gt;
&lt;li&gt;citation and evidence packs;&lt;/li&gt;
&lt;li&gt;internal review comments;&lt;/li&gt;
&lt;li&gt;final DOCX / XLSX export requirements.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A chatbot can help generate text.&lt;/p&gt;

&lt;p&gt;But grant operations need something more controlled:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;A governed workflow layer that agents can call safely.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That is the idea behind GrantFlow.&lt;/p&gt;

&lt;h2&gt;
  
  
  What GrantFlow does
&lt;/h2&gt;

&lt;p&gt;GrantFlow gives AI agents and workflow systems an API layer for proposal operations.&lt;/p&gt;

&lt;p&gt;It supports the core lifecycle:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Discover agent capabilities and tools.&lt;/li&gt;
&lt;li&gt;Register or onboard an agent identity.&lt;/li&gt;
&lt;li&gt;Run donor/readiness preflight checks.&lt;/li&gt;
&lt;li&gt;Start generation with an idempotency key.&lt;/li&gt;
&lt;li&gt;Pause for human review where needed.&lt;/li&gt;
&lt;li&gt;Inspect status, quality, citations, grounding, and lifecycle events.&lt;/li&gt;
&lt;li&gt;Export reviewable deliverables and evidence packs.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The point is not to let an AI agent blindly submit grant proposals.&lt;/p&gt;

&lt;p&gt;The point is to keep agent-driven proposal work inside reviewable, auditable boundaries.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why “agent-native” matters
&lt;/h2&gt;

&lt;p&gt;A lot of software assumes a human is clicking through a dashboard.&lt;/p&gt;

&lt;p&gt;GrantFlow assumes that the next proposal operator may be an AI agent.&lt;/p&gt;

&lt;p&gt;That agent still needs operational controls:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;discovery;&lt;/li&gt;
&lt;li&gt;typed contracts;&lt;/li&gt;
&lt;li&gt;authentication;&lt;/li&gt;
&lt;li&gt;scopes;&lt;/li&gt;
&lt;li&gt;idempotency;&lt;/li&gt;
&lt;li&gt;preflight gates;&lt;/li&gt;
&lt;li&gt;human-in-the-loop checkpoints;&lt;/li&gt;
&lt;li&gt;structured errors;&lt;/li&gt;
&lt;li&gt;audit events;&lt;/li&gt;
&lt;li&gt;deterministic smoke tests;&lt;/li&gt;
&lt;li&gt;export contracts.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;GrantFlow exposes those controls as API surfaces rather than hiding them inside a chat UI.&lt;/p&gt;

&lt;p&gt;That is the difference between a chatbot and infrastructure.&lt;/p&gt;

&lt;h2&gt;
  
  
  Not a chatbot. Infrastructure.
&lt;/h2&gt;

&lt;p&gt;A chatbot asks:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“What proposal should I write?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;GrantFlow asks:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Can this proposal workflow be safely started, reviewed, grounded, exported, and audited?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That difference changes the product shape.&lt;/p&gt;

&lt;p&gt;GrantFlow includes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;HTTP API surfaces;&lt;/li&gt;
&lt;li&gt;MCP-style stdio tool server;&lt;/li&gt;
&lt;li&gt;agent discovery endpoints;&lt;/li&gt;
&lt;li&gt;sandbox agent registration;&lt;/li&gt;
&lt;li&gt;self-serve onboarding;&lt;/li&gt;
&lt;li&gt;OAuth client-credentials flow;&lt;/li&gt;
&lt;li&gt;credential introspection;&lt;/li&gt;
&lt;li&gt;credential rotation and revocation;&lt;/li&gt;
&lt;li&gt;human-in-the-loop approval checkpoints;&lt;/li&gt;
&lt;li&gt;quality and trust surfaces;&lt;/li&gt;
&lt;li&gt;export payloads;&lt;/li&gt;
&lt;li&gt;donor-aware requirements;&lt;/li&gt;
&lt;li&gt;DOCX / XLSX / ZIP evidence-pack export paths.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is designed for agent systems, not for one-off prompt sessions.&lt;/p&gt;

&lt;h2&gt;
  
  
  Example: a proposal agent should not just write
&lt;/h2&gt;

&lt;p&gt;Imagine an NGO team is preparing a recurring USAID, EU, or FCDO proposal.&lt;/p&gt;

&lt;p&gt;A naive agent workflow might look like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;User uploads notes -&amp;gt; AI writes proposal -&amp;gt; team edits manually
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That can save time, but it does not solve the operational problem.&lt;/p&gt;

&lt;p&gt;A governed workflow should look more like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Discover donor requirements
-&amp;gt; run bid/no-bid or readiness preflight
-&amp;gt; ingest source material
-&amp;gt; generate controlled proposal sections
-&amp;gt; pause for human review
-&amp;gt; inspect grounding and quality signals
-&amp;gt; resolve high-severity findings
-&amp;gt; export DOCX/XLSX/evidence pack
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;GrantFlow is built around that second pattern.&lt;/p&gt;

&lt;h2&gt;
  
  
  Donor-aware proposal operations
&lt;/h2&gt;

&lt;p&gt;GrantFlow supports donor-aware structures for major grant and development-finance workflows.&lt;/p&gt;

&lt;p&gt;Examples include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;USAID;&lt;/li&gt;
&lt;li&gt;EU / INTPA;&lt;/li&gt;
&lt;li&gt;World Bank / IFC;&lt;/li&gt;
&lt;li&gt;GIZ;&lt;/li&gt;
&lt;li&gt;U.S. State Department;&lt;/li&gt;
&lt;li&gt;FCDO;&lt;/li&gt;
&lt;li&gt;AFD;&lt;/li&gt;
&lt;li&gt;JICA;&lt;/li&gt;
&lt;li&gt;ADB;&lt;/li&gt;
&lt;li&gt;and a broader 40+ donor catalog through a generic donor strategy.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each donor path can define its own structure, such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;table of contents schema;&lt;/li&gt;
&lt;li&gt;MEL schema;&lt;/li&gt;
&lt;li&gt;role-specific prompts;&lt;/li&gt;
&lt;li&gt;RAG namespace;&lt;/li&gt;
&lt;li&gt;submission requirements;&lt;/li&gt;
&lt;li&gt;required DOCX sections;&lt;/li&gt;
&lt;li&gt;required XLSX sheets.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This matters because donor workflows are not interchangeable.&lt;/p&gt;

&lt;p&gt;A generic proposal draft is often not enough. The output has to match the funder’s expected structure.&lt;/p&gt;

&lt;h2&gt;
  
  
  Human-in-the-loop by design
&lt;/h2&gt;

&lt;p&gt;GrantFlow keeps human review in the workflow instead of treating it as an afterthought.&lt;/p&gt;

&lt;p&gt;It includes HITL checkpoints for stages such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;architect;&lt;/li&gt;
&lt;li&gt;table of contents;&lt;/li&gt;
&lt;li&gt;MEL;&lt;/li&gt;
&lt;li&gt;logframe.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It also tracks:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;critic findings;&lt;/li&gt;
&lt;li&gt;review comments;&lt;/li&gt;
&lt;li&gt;lifecycle status;&lt;/li&gt;
&lt;li&gt;readiness warnings;&lt;/li&gt;
&lt;li&gt;audit-friendly job events;&lt;/li&gt;
&lt;li&gt;traceability endpoints.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is important because “AI-assisted” should not mean “unreviewed.”&lt;/p&gt;

&lt;p&gt;The goal is controlled acceleration, not uncontrolled automation.&lt;/p&gt;

&lt;h2&gt;
  
  
  Trust report before export
&lt;/h2&gt;

&lt;p&gt;Before export, GrantFlow exposes a quality surface with a trust summary.&lt;/p&gt;

&lt;p&gt;The trust summary can return verdicts such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;pass&lt;/code&gt;;&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;conditional&lt;/code&gt;;&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;fail&lt;/code&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A conditional or failed verdict means at least one gate is not cleared.&lt;/p&gt;

&lt;p&gt;This allows an agent or reviewer to know whether the workflow can proceed to export, or whether more review is needed.&lt;/p&gt;

&lt;p&gt;That is a useful primitive for agent systems.&lt;/p&gt;

&lt;p&gt;Agents should not only generate content.&lt;/p&gt;

&lt;p&gt;They should know when not to proceed.&lt;/p&gt;

&lt;h2&gt;
  
  
  AI-use disclosure
&lt;/h2&gt;

&lt;p&gt;Funders are increasingly paying attention to AI use in proposal workflows.&lt;/p&gt;

&lt;p&gt;GrantFlow includes an AI-use disclosure endpoint that creates a machine-readable record from what the job already recorded, including:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;generation mode;&lt;/li&gt;
&lt;li&gt;models;&lt;/li&gt;
&lt;li&gt;grounding mode;&lt;/li&gt;
&lt;li&gt;trust signals;&lt;/li&gt;
&lt;li&gt;human review state;&lt;/li&gt;
&lt;li&gt;boundaries;&lt;/li&gt;
&lt;li&gt;a human-readable paragraph.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is not a compliance certification.&lt;/p&gt;

&lt;p&gt;It is a transparency record.&lt;/p&gt;

&lt;p&gt;That distinction matters.&lt;/p&gt;

&lt;h2&gt;
  
  
  Quick start
&lt;/h2&gt;

&lt;p&gt;GrantFlow includes a hosted deterministic demo and a local development path.&lt;/p&gt;

&lt;p&gt;Hosted demo:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl https://vassilbek-grantflow.hf.space/demo/run | python3 &lt;span class="nt"&gt;-m&lt;/span&gt; json.tool
curl https://vassilbek-grantflow.hf.space/donors  | python3 &lt;span class="nt"&gt;-m&lt;/span&gt; json.tool
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Run locally:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="nb"&gt;cp&lt;/span&gt; .env.example .env
make bootstrap-dev
&lt;span class="nb"&gt;source&lt;/span&gt; .venv/bin/activate
uvicorn grantflow.api.app:app &lt;span class="nt"&gt;--reload&lt;/span&gt;

curl http://127.0.0.1:8000/demo/run | python3 &lt;span class="nt"&gt;-m&lt;/span&gt; json.tool
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;For agent runtimes that prefer stdio tools, GrantFlow also exposes an MCP-style tool server.&lt;/p&gt;

&lt;h2&gt;
  
  
  MCP-style tool server
&lt;/h2&gt;

&lt;p&gt;GrantFlow includes MCP-style tooling for runtimes that use &lt;code&gt;tools/list&lt;/code&gt; and &lt;code&gt;tools/call&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;The tool surface includes operations for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;onboarding an agent;&lt;/li&gt;
&lt;li&gt;creating a session;&lt;/li&gt;
&lt;li&gt;introspecting credentials;&lt;/li&gt;
&lt;li&gt;exchanging OAuth tokens;&lt;/li&gt;
&lt;li&gt;rotating and revoking credentials;&lt;/li&gt;
&lt;li&gt;registering a sandbox agent;&lt;/li&gt;
&lt;li&gt;ingesting text;&lt;/li&gt;
&lt;li&gt;running preflight;&lt;/li&gt;
&lt;li&gt;starting generation;&lt;/li&gt;
&lt;li&gt;checking status;&lt;/li&gt;
&lt;li&gt;checking quality;&lt;/li&gt;
&lt;li&gt;reading lifecycle events;&lt;/li&gt;
&lt;li&gt;approving HITL checkpoints;&lt;/li&gt;
&lt;li&gt;listing pending review items;&lt;/li&gt;
&lt;li&gt;getting export payloads;&lt;/li&gt;
&lt;li&gt;running a sandbox happy path.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This makes GrantFlow useful not only as a grant workflow backend, but also as an agent-integration experiment.&lt;/p&gt;

&lt;h2&gt;
  
  
  What this is not
&lt;/h2&gt;

&lt;p&gt;GrantFlow is intentionally bounded.&lt;/p&gt;

&lt;p&gt;It is not:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;legal advice;&lt;/li&gt;
&lt;li&gt;compliance advice;&lt;/li&gt;
&lt;li&gt;financial advice;&lt;/li&gt;
&lt;li&gt;grant-eligibility advice;&lt;/li&gt;
&lt;li&gt;a factuality verifier;&lt;/li&gt;
&lt;li&gt;a donor portal automation bot;&lt;/li&gt;
&lt;li&gt;a replacement for human review;&lt;/li&gt;
&lt;li&gt;a one-off chatbot UI for end users.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It enforces evidence structure and grounding signals.&lt;/p&gt;

&lt;p&gt;It does not prove that every claim is true.&lt;/p&gt;

&lt;p&gt;A human must review before submission.&lt;/p&gt;

&lt;p&gt;That boundary is not a weakness. It is part of the design.&lt;/p&gt;

&lt;p&gt;Governed proposal infrastructure should make its limits explicit.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current maturity
&lt;/h2&gt;

&lt;p&gt;GrantFlow is still early.&lt;/p&gt;

&lt;p&gt;There are no customer pilots yet.&lt;/p&gt;

&lt;p&gt;The benchmark numbers in the repository are illustrative demo baselines, not measured customer results.&lt;/p&gt;

&lt;p&gt;The donor paths most built out today are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;EU;&lt;/li&gt;
&lt;li&gt;FCDO;&lt;/li&gt;
&lt;li&gt;USAID, depending on use case and operating constraints.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is important to say clearly.&lt;/p&gt;

&lt;p&gt;This is open-source infrastructure looking for real-world validation, not a finished enterprise product with proven ROI claims.&lt;/p&gt;

&lt;h2&gt;
  
  
  Who should look at this?
&lt;/h2&gt;

&lt;p&gt;GrantFlow may be interesting if you are working on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;AI agents;&lt;/li&gt;
&lt;li&gt;MCP servers;&lt;/li&gt;
&lt;li&gt;nonprofit technology;&lt;/li&gt;
&lt;li&gt;grant management;&lt;/li&gt;
&lt;li&gt;proposal operations;&lt;/li&gt;
&lt;li&gt;human-in-the-loop systems;&lt;/li&gt;
&lt;li&gt;AI governance;&lt;/li&gt;
&lt;li&gt;document generation;&lt;/li&gt;
&lt;li&gt;donor compliance workflows;&lt;/li&gt;
&lt;li&gt;workflow automation;&lt;/li&gt;
&lt;li&gt;RAG-backed drafting;&lt;/li&gt;
&lt;li&gt;auditable AI systems.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It is especially relevant if you are asking:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;How do we let agents help with proposal work without removing governance, review, and traceability?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  What to inspect first
&lt;/h2&gt;

&lt;p&gt;If you open the repository, I suggest starting with:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The README&lt;/strong&gt;&lt;br&gt;
It explains the core workflow and boundaries.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The donor catalog&lt;/strong&gt;&lt;br&gt;
This shows how funder-specific requirements are represented.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The MCP server&lt;/strong&gt;&lt;br&gt;
This is useful if you are building agent runtimes or tool integrations.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The trust report surface&lt;/strong&gt;&lt;br&gt;
This shows how export-readiness is communicated.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The HITL flow&lt;/strong&gt;&lt;br&gt;
This is where the project becomes more than text generation.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  The bigger idea
&lt;/h2&gt;

&lt;p&gt;The future of AI in proposal workflows should not be:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Let the model write everything.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It should be:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Let agents accelerate the workflow, while the system preserves structure, review, grounding, traceability, and export controls.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That is the layer GrantFlow is exploring.&lt;/p&gt;

&lt;p&gt;Grant writing does not need another chatbot.&lt;/p&gt;

&lt;p&gt;It needs infrastructure that agents can call safely, operators can audit, and reviewers can trust.&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/vassiliylakhonin/grantflow" rel="noopener noreferrer"&gt;vassiliylakhonin/grantflow&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If this direction is interesting, I would appreciate your reactions, issue, critique, or architecture review.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>python</category>
      <category>mcp</category>
    </item>
    <item>
      <title>Stop Asking AI for Answers. Start Asking If the Evidence Is Ready.</title>
      <dc:creator>Vasili</dc:creator>
      <pubDate>Mon, 29 Jun 2026 09:02:02 +0000</pubDate>
      <link>https://dev.to/vassiliylakhonin/stop-asking-ai-for-answers-start-asking-if-the-evidence-is-ready-3foo</link>
      <guid>https://dev.to/vassiliylakhonin/stop-asking-ai-for-answers-start-asking-if-the-evidence-is-ready-3foo</guid>
      <description>&lt;p&gt;Most AI agents are optimized to produce an answer.&lt;/p&gt;

&lt;p&gt;But in serious workflows, the answer is not the hard part.&lt;/p&gt;

&lt;p&gt;The hard part is knowing whether that answer is supported well enough for a human to trust it, act on it, or escalate it.&lt;/p&gt;

&lt;p&gt;That is the problem I am working on with &lt;strong&gt;Agenda Intelligence MD&lt;/strong&gt;:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;An evidence-readiness and trust-routing runtime for high-stakes AI-assisted decisions.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/vassiliylakhonin/agenda-intelligence-md" rel="noopener noreferrer"&gt;vassiliylakhonin/agenda-intelligence-md&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  The problem: AI can summarize before it can be trusted
&lt;/h2&gt;

&lt;p&gt;Summarization is useful.&lt;/p&gt;

&lt;p&gt;But many real-world decisions are not blocked by the lack of a summary. They are blocked by uncertainty:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which claims are actually supported?&lt;/li&gt;
&lt;li&gt;Which claims are weak?&lt;/li&gt;
&lt;li&gt;Which source categories are missing?&lt;/li&gt;
&lt;li&gt;Who needs to act next?&lt;/li&gt;
&lt;li&gt;Is this file ready for review?&lt;/li&gt;
&lt;li&gt;Should this be escalated before a decision is made?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This matters in workflows like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;vendor evidence review;&lt;/li&gt;
&lt;li&gt;RFP and procurement analysis;&lt;/li&gt;
&lt;li&gt;AI vendor due diligence;&lt;/li&gt;
&lt;li&gt;strategic infrastructure project rooms;&lt;/li&gt;
&lt;li&gt;market-entry readiness;&lt;/li&gt;
&lt;li&gt;sanctions-adjacent exposure triage;&lt;/li&gt;
&lt;li&gt;corridor, maritime, and counterparty risk files.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In those settings, a polished AI-generated memo can be dangerous if it hides evidence gaps.&lt;/p&gt;

&lt;p&gt;Agenda Intelligence MD is built around a different idea:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The next layer of agent infrastructure is not better summarization. It is knowing when an AI-generated brief is not ready to be trusted.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  What Agenda Intelligence MD does
&lt;/h2&gt;

&lt;p&gt;Agenda Intelligence MD turns messy input packs into structured human-review packets.&lt;/p&gt;

&lt;p&gt;The inputs can be things like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;RFP responses;&lt;/li&gt;
&lt;li&gt;vendor claims;&lt;/li&gt;
&lt;li&gt;source packs;&lt;/li&gt;
&lt;li&gt;risk files;&lt;/li&gt;
&lt;li&gt;model cards;&lt;/li&gt;
&lt;li&gt;project notes;&lt;/li&gt;
&lt;li&gt;weekly status updates;&lt;/li&gt;
&lt;li&gt;public documentation;&lt;/li&gt;
&lt;li&gt;analyst-style briefs.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The output is not just a summary.&lt;/p&gt;

&lt;p&gt;It is a structured review layer that surfaces:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;supported claims;&lt;/li&gt;
&lt;li&gt;weak or under-sourced claims;&lt;/li&gt;
&lt;li&gt;missing evidence categories;&lt;/li&gt;
&lt;li&gt;source coverage diagnostics;&lt;/li&gt;
&lt;li&gt;owner actions;&lt;/li&gt;
&lt;li&gt;decision-readiness routing;&lt;/li&gt;
&lt;li&gt;escalation signals;&lt;/li&gt;
&lt;li&gt;heuristic scoring.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The goal is not to replace human judgment.&lt;/p&gt;

&lt;p&gt;The goal is to make the review surface clearer before a human makes a decision.&lt;/p&gt;

&lt;h2&gt;
  
  
  What makes it different from a normal AI summarizer?
&lt;/h2&gt;

&lt;p&gt;A normal summarizer asks:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“What does this document say?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Agenda Intelligence MD asks:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Is this document ready to support a decision?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That distinction changes the architecture.&lt;/p&gt;

&lt;p&gt;Instead of treating the AI output as the final deliverable, the project treats it as something that must pass through a readiness layer.&lt;/p&gt;

&lt;p&gt;For example, a vendor might claim that their AI product is safe for regulated enterprise use.&lt;/p&gt;

&lt;p&gt;A summarizer can compress that claim into a nice paragraph.&lt;/p&gt;

&lt;p&gt;Agenda Intelligence MD is designed to ask a more useful set of questions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Is the claim linked to evidence?&lt;/li&gt;
&lt;li&gt;Is the evidence first-party, third-party, stale, missing, or incomplete?&lt;/li&gt;
&lt;li&gt;Are there standards, audit artifacts, security documents, or governance materials missing?&lt;/li&gt;
&lt;li&gt;Does this need a procurement owner, legal reviewer, technical reviewer, or compliance escalation?&lt;/li&gt;
&lt;li&gt;Is the brief ready for a decision, or only ready for more questions?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is the difference between generating text and routing trust.&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;p&gt;The project is implemented as a Python package with multiple delivery surfaces around one core service layer.&lt;/p&gt;

&lt;p&gt;It includes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;a CLI;&lt;/li&gt;
&lt;li&gt;an MCP stdio server;&lt;/li&gt;
&lt;li&gt;an HTTP API shell;&lt;/li&gt;
&lt;li&gt;an A2A adapter;&lt;/li&gt;
&lt;li&gt;JSON schemas;&lt;/li&gt;
&lt;li&gt;validators;&lt;/li&gt;
&lt;li&gt;evidence audit;&lt;/li&gt;
&lt;li&gt;source coverage diagnostics;&lt;/li&gt;
&lt;li&gt;heuristic scoring;&lt;/li&gt;
&lt;li&gt;vertical worker profiles.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This makes it usable in several different modes.&lt;/p&gt;

&lt;p&gt;You can inspect it locally through the CLI.&lt;/p&gt;

&lt;p&gt;You can integrate it into an agent workflow through MCP.&lt;/p&gt;

&lt;p&gt;You can expose structured behavior over HTTP.&lt;/p&gt;

&lt;p&gt;You can experiment with A2A-style agent routing.&lt;/p&gt;

&lt;p&gt;The interesting part is not just that these interfaces exist. It is that they point toward the same product idea: evidence-readiness should be a reusable layer, not a one-off prompt.&lt;/p&gt;

&lt;h2&gt;
  
  
  Quick start
&lt;/h2&gt;

&lt;p&gt;After installing the package, the basic local flow looks like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;pip &lt;span class="nb"&gt;install &lt;/span&gt;agenda-intelligence-md

agenda-intelligence doctor
agenda-intelligence validate-brief examples/agenda-brief.json
agenda-intelligence score examples/agenda-brief.json &lt;span class="nt"&gt;--evidence&lt;/span&gt; examples/source/evidence-pack.json
agenda-intelligence weekly-delta examples/strategic-infrastructure-bankability/status.synthetic.md
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The commands are designed to answer practical questions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Is the package installed correctly?&lt;/li&gt;
&lt;li&gt;Does this brief match the schema?&lt;/li&gt;
&lt;li&gt;How strong is the structure / evidence / decision-readiness?&lt;/li&gt;
&lt;li&gt;What changed in a weekly status update?&lt;/li&gt;
&lt;li&gt;Which claims are unsafe to repeat?&lt;/li&gt;
&lt;li&gt;What evidence is still missing?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That last question is the most important one.&lt;/p&gt;

&lt;p&gt;Because in real decision workflows, “what is missing?” is often more valuable than “what is the answer?”&lt;/p&gt;

&lt;h2&gt;
  
  
  Example: AI vendor evidence-readiness
&lt;/h2&gt;

&lt;p&gt;One of the current discovery wedges for the project is AI vendor evidence-readiness for regulated procurement.&lt;/p&gt;

&lt;p&gt;Imagine a buyer reviewing an AI vendor for an enterprise or regulated environment.&lt;/p&gt;

&lt;p&gt;The buyer has:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;an RFP;&lt;/li&gt;
&lt;li&gt;vendor claims;&lt;/li&gt;
&lt;li&gt;public documentation;&lt;/li&gt;
&lt;li&gt;security pages;&lt;/li&gt;
&lt;li&gt;model cards;&lt;/li&gt;
&lt;li&gt;standards references;&lt;/li&gt;
&lt;li&gt;maybe some missing or vague materials.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A normal AI assistant can summarize the vendor.&lt;/p&gt;

&lt;p&gt;But a buyer does not only need a summary.&lt;/p&gt;

&lt;p&gt;They need a review packet:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What claims are supported?&lt;/li&gt;
&lt;li&gt;Which claims are marketing language?&lt;/li&gt;
&lt;li&gt;Which security or governance documents are missing?&lt;/li&gt;
&lt;li&gt;Which buyer questions remain unanswered?&lt;/li&gt;
&lt;li&gt;What should be escalated before approval?&lt;/li&gt;
&lt;li&gt;What can be reviewed now, and what cannot?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is the kind of workflow Agenda Intelligence MD is designed to support.&lt;/p&gt;

&lt;p&gt;It is not trying to be the decision-maker.&lt;/p&gt;

&lt;p&gt;It is trying to prepare the decision surface.&lt;/p&gt;

&lt;h2&gt;
  
  
  Vertical profiles
&lt;/h2&gt;

&lt;p&gt;The repository also includes vertical profiles and demo surfaces for several high-stakes workflows, including:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Middle Corridor Deal Risk Gate;&lt;/li&gt;
&lt;li&gt;CIS Secondary-Sanctions Exposure;&lt;/li&gt;
&lt;li&gt;Agentic Interaction Trust Gate;&lt;/li&gt;
&lt;li&gt;Gulf Maritime Exposure Gate;&lt;/li&gt;
&lt;li&gt;Kazakhstan Market-Entry Readiness Gate.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These are not generic chatbot personalities.&lt;/p&gt;

&lt;p&gt;They are structured reasoning surfaces for evidence-heavy review workflows.&lt;/p&gt;

&lt;p&gt;The pattern is:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;input pack -&amp;gt; structured review packet -&amp;gt; evidence gaps -&amp;gt; owner actions -&amp;gt; decision-readiness route
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That pattern is useful because many high-stakes workflows fail in the handoff between AI output and human responsibility.&lt;/p&gt;

&lt;p&gt;Agenda Intelligence MD focuses on that handoff.&lt;/p&gt;

&lt;h2&gt;
  
  
  What this is not
&lt;/h2&gt;

&lt;p&gt;This project is intentionally bounded.&lt;/p&gt;

&lt;p&gt;It is not:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;a factuality verifier;&lt;/li&gt;
&lt;li&gt;a legal advisor;&lt;/li&gt;
&lt;li&gt;a compliance approval engine;&lt;/li&gt;
&lt;li&gt;a sanctions determination tool;&lt;/li&gt;
&lt;li&gt;a financial or investment advisor;&lt;/li&gt;
&lt;li&gt;an autonomous decision-maker;&lt;/li&gt;
&lt;li&gt;a replacement for analyst review.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The scoring is heuristic.&lt;/p&gt;

&lt;p&gt;It evaluates structure, source coverage, evidence labeling, and decision-readiness signals.&lt;/p&gt;

&lt;p&gt;It does not prove that a claim is true.&lt;/p&gt;

&lt;p&gt;That boundary matters.&lt;/p&gt;

&lt;p&gt;The point is not to say:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“The AI is right.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The point is to say:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Here is what the AI-assisted packet can support, here is what it cannot support, and here is where a human needs to review.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Why MCP and A2A matter here
&lt;/h2&gt;

&lt;p&gt;MCP and A2A are interesting because they push agent systems toward composable infrastructure.&lt;/p&gt;

&lt;p&gt;But composability also increases risk.&lt;/p&gt;

&lt;p&gt;If agents can call tools, route tasks, and generate structured outputs, then they also need a way to communicate uncertainty, missing evidence, and escalation requirements.&lt;/p&gt;

&lt;p&gt;Otherwise, agent systems become very good at moving unsupported claims through a workflow faster.&lt;/p&gt;

&lt;p&gt;Agenda Intelligence MD is an experiment in making the trust layer explicit.&lt;/p&gt;

&lt;p&gt;Not hidden in a prompt.&lt;/p&gt;

&lt;p&gt;Not buried in a paragraph.&lt;/p&gt;

&lt;p&gt;Not left to the final reviewer to reconstruct manually.&lt;/p&gt;

&lt;p&gt;Instead, the runtime exposes readiness, gaps, and routing as structured outputs.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why I built it
&lt;/h2&gt;

&lt;p&gt;I started from a simple observation:&lt;/p&gt;

&lt;p&gt;A lot of AI work focuses on making outputs more fluent.&lt;/p&gt;

&lt;p&gt;But in serious workflows, fluency is not the bottleneck.&lt;/p&gt;

&lt;p&gt;The bottleneck is whether the output is usable for a decision.&lt;/p&gt;

&lt;p&gt;A beautiful memo with missing evidence is still a weak memo.&lt;/p&gt;

&lt;p&gt;A confident recommendation with unclear source coverage is still risky.&lt;/p&gt;

&lt;p&gt;A summary that does not show what it cannot support is not enough.&lt;/p&gt;

&lt;p&gt;I wanted a system that treats evidence gaps as first-class objects.&lt;/p&gt;

&lt;h2&gt;
  
  
  Who should look at this?
&lt;/h2&gt;

&lt;p&gt;You may find the project interesting if you are working on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;AI agents;&lt;/li&gt;
&lt;li&gt;MCP servers;&lt;/li&gt;
&lt;li&gt;A2A experiments;&lt;/li&gt;
&lt;li&gt;procurement technology;&lt;/li&gt;
&lt;li&gt;AI governance;&lt;/li&gt;
&lt;li&gt;risk intelligence;&lt;/li&gt;
&lt;li&gt;analyst workflows;&lt;/li&gt;
&lt;li&gt;structured evaluation;&lt;/li&gt;
&lt;li&gt;human-in-the-loop review;&lt;/li&gt;
&lt;li&gt;decision-support systems.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The repo is especially relevant if you are asking:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;How do we make AI-assisted workflows more reviewable before they become more autonomous?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  What to inspect first
&lt;/h2&gt;

&lt;p&gt;If you open the repository, I would suggest looking at four areas:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The CLI flow&lt;/strong&gt;&lt;br&gt;
Start with the examples and validation commands.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The schemas&lt;/strong&gt;&lt;br&gt;
The schemas show what the project treats as structured review output.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The MCP integration&lt;/strong&gt;&lt;br&gt;
This is useful if you are thinking about agent-tool interoperability.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The vertical profiles&lt;/strong&gt;&lt;br&gt;
These show how the same evidence-readiness pattern can be adapted to different domains.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  The bigger idea
&lt;/h2&gt;

&lt;p&gt;I do not think every AI agent needs to make more decisions.&lt;/p&gt;

&lt;p&gt;I think many AI agents need to become better at saying:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;this is supported;&lt;/li&gt;
&lt;li&gt;this is weak;&lt;/li&gt;
&lt;li&gt;this is missing;&lt;/li&gt;
&lt;li&gt;this needs review;&lt;/li&gt;
&lt;li&gt;this is not ready yet.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is less flashy than autonomous decision-making.&lt;/p&gt;

&lt;p&gt;But it is much closer to what many real organizations need.&lt;/p&gt;

&lt;p&gt;The future of AI infrastructure will not only be about agents that can act.&lt;/p&gt;

&lt;p&gt;It will also be about systems that know when not to act yet.&lt;/p&gt;

&lt;p&gt;That is the layer Agenda Intelligence MD is exploring.&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/vassiliylakhonin/agenda-intelligence-md" rel="noopener noreferrer"&gt;vassiliylakhonin/agenda-intelligence-md&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If this direction is interesting to you, I would appreciate your reactions, issues, critiques, or architecture reviews.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>agents</category>
      <category>mcp</category>
    </item>
  </channel>
</rss>
