<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Laura Hannah</title>
    <description>The latest articles on DEV Community by Laura Hannah (@lchannah).</description>
    <link>https://dev.to/lchannah</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3955349%2F5cfab077-c66f-4ead-9000-20c07d7a37ca.jpeg</url>
      <title>DEV Community: Laura Hannah</title>
      <link>https://dev.to/lchannah</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/lchannah"/>
    <language>en</language>
    <item>
      <title>The Role of QA in the New AI SDLC</title>
      <dc:creator>Laura Hannah</dc:creator>
      <pubDate>Thu, 28 May 2026 00:07:00 +0000</pubDate>
      <link>https://dev.to/lchannah/the-role-of-qa-in-the-new-ai-sdlc-13je</link>
      <guid>https://dev.to/lchannah/the-role-of-qa-in-the-new-ai-sdlc-13je</guid>
      <description>&lt;p&gt;QA’s role in the new AI SDLC is no longer just &lt;strong&gt;“test the finished application.”&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;It is becoming &lt;strong&gt;quality engineering across the entire lifecycle&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Requirements&lt;/li&gt;
&lt;li&gt;Prompts&lt;/li&gt;
&lt;li&gt;Data&lt;/li&gt;
&lt;li&gt;Models&lt;/li&gt;
&lt;li&gt;Generated code&lt;/li&gt;
&lt;li&gt;Automation&lt;/li&gt;
&lt;li&gt;Deployment&lt;/li&gt;
&lt;li&gt;Monitoring&lt;/li&gt;
&lt;li&gt;Governance&lt;/li&gt;
&lt;li&gt;Production feedback&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The big shift is this:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Old SDLC QA:&lt;/strong&gt; Does the software meet the requirements?&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;New AI SDLC QA:&lt;/strong&gt; Can we trust the system, the AI-generated work, the data, the model behavior, and the delivery process — repeatedly, safely, and measurably?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;AI does not eliminate QA.&lt;/p&gt;

&lt;p&gt;It makes strong QA leadership more important.&lt;/p&gt;




&lt;h2&gt;
  
  
  The AI SDLC Quality Loop
&lt;/h2&gt;

&lt;p&gt;For a first pass on dev.to, I would use a simple text diagram rather than Mermaid. It is safer for copy/paste into the dev.to/new editor and avoids renderer surprises.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Business Need / Product Idea
        ↓
Requirements + Risk Definition
        ↓
Spec-Driven Development
        ↓
Prompt / Agent / Workflow Design
        ↓
AI-Assisted Code + Test Generation
        ↓
Human Review + Automated Testing
        ↓
CI/CD Quality Gates
        ↓
Deployment
        ↓
Production Monitoring
        ↓
Feedback, Drift, Incidents, Metrics
        ↺ loops back into Requirements + Risk Definition
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;QA is not sitting at the end of this flow.&lt;/p&gt;

&lt;p&gt;QA influences the entire loop:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;QA / Quality Engineering
        ↳ Requirements
        ↳ Specs
        ↳ Prompts and agents
        ↳ Generated code
        ↳ Automated tests
        ↳ CI/CD quality gates
        ↳ Production monitoring
        ↳ Feedback and improvement
        ↳ Governance and audit evidence
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  1. Requirements and Risk Definition
&lt;/h2&gt;

&lt;p&gt;QA should be involved &lt;strong&gt;before code exists&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;For AI-enabled systems, requirements need to include not just functional behavior, but also risk, trust, and guardrails.&lt;/p&gt;

&lt;p&gt;QA helps define:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What &lt;strong&gt;“good”&lt;/strong&gt; output looks like&lt;/li&gt;
&lt;li&gt;What &lt;strong&gt;“bad”&lt;/strong&gt; output looks like&lt;/li&gt;
&lt;li&gt;What the AI must &lt;strong&gt;never&lt;/strong&gt; do&lt;/li&gt;
&lt;li&gt;What needs human approval&lt;/li&gt;
&lt;li&gt;What needs automated validation&lt;/li&gt;
&lt;li&gt;What risks need mitigation&lt;/li&gt;
&lt;li&gt;What security, privacy, compliance, bias, hallucination, and explainability concerns need to be addressed&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is one of the most important changes in the AI SDLC.&lt;/p&gt;

&lt;p&gt;QA cannot wait until the end of the process and then try to test quality into the system. The quality strategy has to start at the beginning.&lt;/p&gt;




&lt;h2&gt;
  
  
  2. Spec-Driven Development and Testable Intent
&lt;/h2&gt;

&lt;p&gt;In an AI SDLC, the specification becomes &lt;strong&gt;more important, not less&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;If AI agents or copilots are generating code, tests, documentation, or workflows, then QA needs to help make the specification precise enough that AI can generate useful output.&lt;/p&gt;

&lt;p&gt;QA should push for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Clear business rules&lt;/li&gt;
&lt;li&gt;Examples&lt;/li&gt;
&lt;li&gt;Counterexamples&lt;/li&gt;
&lt;li&gt;Edge cases&lt;/li&gt;
&lt;li&gt;Negative scenarios&lt;/li&gt;
&lt;li&gt;Test data assumptions&lt;/li&gt;
&lt;li&gt;Explicit quality gates&lt;/li&gt;
&lt;li&gt;Traceability from requirement to evidence&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A useful traceability chain looks like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Requirement → Prompt/Spec → Generated Code → Tests → Evidence
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This is where QA becomes a &lt;strong&gt;system designer of correctness&lt;/strong&gt;, not just a defect finder.&lt;/p&gt;




&lt;h2&gt;
  
  
  3. Prompt, Agent, and Workflow Validation
&lt;/h2&gt;

&lt;p&gt;Many engineering teams are now using tools like Claude Code, GitHub Copilot, Cursor, ChatGPT, and internal AI agents to generate or modify software artifacts.&lt;/p&gt;

&lt;p&gt;That means QA also needs to help test the prompts, skills, conventions, and workflows themselves.&lt;/p&gt;

&lt;p&gt;QA should validate whether AI workflows:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Produce consistent results&lt;/li&gt;
&lt;li&gt;Follow architecture standards&lt;/li&gt;
&lt;li&gt;Generate useful and maintainable tests&lt;/li&gt;
&lt;li&gt;Avoid hallucinated APIs or false assumptions&lt;/li&gt;
&lt;li&gt;Respect security and data-handling rules&lt;/li&gt;
&lt;li&gt;Handle edge cases&lt;/li&gt;
&lt;li&gt;Fit repository conventions&lt;/li&gt;
&lt;li&gt;Produce code that compiles, runs, and behaves correctly&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For AI QE Architects, this is a major opportunity.&lt;/p&gt;

&lt;p&gt;A strong QA function can create reusable prompts, skills, conventions, documentation, and evaluation checks so teams generate better software and better tests consistently.&lt;/p&gt;




&lt;h2&gt;
  
  
  4. AI-Assisted Test Generation, But With Review
&lt;/h2&gt;

&lt;p&gt;AI can generate a lot of tests quickly.&lt;/p&gt;

&lt;p&gt;That is useful.&lt;/p&gt;

&lt;p&gt;It is also risky if nobody checks whether those tests are meaningful.&lt;/p&gt;

&lt;p&gt;QA’s role is to make sure AI-generated tests are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Relevant&lt;/li&gt;
&lt;li&gt;Deterministic where possible&lt;/li&gt;
&lt;li&gt;Maintainable&lt;/li&gt;
&lt;li&gt;Properly scoped&lt;/li&gt;
&lt;li&gt;Not just happy-path coverage&lt;/li&gt;
&lt;li&gt;Connected to real business risk&lt;/li&gt;
&lt;li&gt;Running reliably in CI/CD&lt;/li&gt;
&lt;li&gt;Producing evidence that humans can trust&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The trap is believing this:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;More tests automatically means better quality.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It does not.&lt;/p&gt;

&lt;p&gt;QA needs to guard against shallow, duplicated, brittle, or misleading AI-generated tests.&lt;/p&gt;

&lt;p&gt;The goal is not just volume. The goal is useful coverage, meaningful validation, and trustworthy release evidence.&lt;/p&gt;




&lt;h2&gt;
  
  
  5. Data Quality and Model Behavior
&lt;/h2&gt;

&lt;p&gt;For systems using machine learning, large language models, recommendations, classification, scoring, summarization, or prediction, QA now has to care about data and model behavior too.&lt;/p&gt;

&lt;p&gt;That includes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Test data quality&lt;/li&gt;
&lt;li&gt;Training and evaluation data assumptions&lt;/li&gt;
&lt;li&gt;Bias and representativeness&lt;/li&gt;
&lt;li&gt;Regression sets for model behavior&lt;/li&gt;
&lt;li&gt;Prompt-response evaluation&lt;/li&gt;
&lt;li&gt;Golden datasets&lt;/li&gt;
&lt;li&gt;Drift detection&lt;/li&gt;
&lt;li&gt;Accuracy&lt;/li&gt;
&lt;li&gt;Precision&lt;/li&gt;
&lt;li&gt;Recall&lt;/li&gt;
&lt;li&gt;False positives&lt;/li&gt;
&lt;li&gt;False negatives&lt;/li&gt;
&lt;li&gt;Task-specific scoring&lt;/li&gt;
&lt;li&gt;Human review workflows&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Traditional software tests usually ask whether the code follows deterministic rules.&lt;/p&gt;

&lt;p&gt;AI systems often require a broader question:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Is the behavior acceptable, safe, and reliable across the kinds of real-world inputs the system will receive?&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That requires evaluation strategy, monitoring, and human judgment.&lt;/p&gt;




&lt;h2&gt;
  
  
  6. CI/CD Quality Gates
&lt;/h2&gt;

&lt;p&gt;QA should help define automated gates that prevent bad AI-generated or AI-enabled changes from reaching production.&lt;/p&gt;

&lt;p&gt;Examples include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Unit tests&lt;/li&gt;
&lt;li&gt;API tests&lt;/li&gt;
&lt;li&gt;UI tests&lt;/li&gt;
&lt;li&gt;Integration tests&lt;/li&gt;
&lt;li&gt;Contract tests&lt;/li&gt;
&lt;li&gt;End-to-end tests&lt;/li&gt;
&lt;li&gt;Static analysis&lt;/li&gt;
&lt;li&gt;Dependency scans&lt;/li&gt;
&lt;li&gt;Security scans&lt;/li&gt;
&lt;li&gt;Prompt evaluation suites&lt;/li&gt;
&lt;li&gt;LLM response regression checks&lt;/li&gt;
&lt;li&gt;Accessibility checks&lt;/li&gt;
&lt;li&gt;Performance checks&lt;/li&gt;
&lt;li&gt;Synthetic production checks&lt;/li&gt;
&lt;li&gt;Test coverage thresholds&lt;/li&gt;
&lt;li&gt;Code review rules for AI-generated code&lt;/li&gt;
&lt;li&gt;Required release evidence before deployment&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The goal is not to slow everyone down.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;The goal is to make fast delivery safe.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;This is especially important when AI increases the speed at which teams can produce code.&lt;/p&gt;

&lt;p&gt;Faster generation without stronger quality gates simply accelerates risk.&lt;/p&gt;




&lt;h2&gt;
  
  
  7. Production Monitoring and Feedback Loops
&lt;/h2&gt;

&lt;p&gt;AI systems can degrade after release because the world around them changes.&lt;/p&gt;

&lt;p&gt;Things that can change include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data&lt;/li&gt;
&lt;li&gt;User behavior&lt;/li&gt;
&lt;li&gt;Prompts&lt;/li&gt;
&lt;li&gt;Models&lt;/li&gt;
&lt;li&gt;Third-party APIs&lt;/li&gt;
&lt;li&gt;Business expectations&lt;/li&gt;
&lt;li&gt;Security threats&lt;/li&gt;
&lt;li&gt;Regulatory expectations&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;QA therefore needs to stay involved after release through:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Observability&lt;/li&gt;
&lt;li&gt;Defect trend analysis&lt;/li&gt;
&lt;li&gt;Model and prompt performance monitoring&lt;/li&gt;
&lt;li&gt;Data drift checks&lt;/li&gt;
&lt;li&gt;Behavior drift checks&lt;/li&gt;
&lt;li&gt;User feedback review&lt;/li&gt;
&lt;li&gt;Incident analysis&lt;/li&gt;
&lt;li&gt;Continuous improvement of test suites&lt;/li&gt;
&lt;li&gt;Release quality metrics&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is one of the biggest mindset shifts:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Production becomes part of the test strategy.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;In the AI SDLC, testing does not stop at deployment.&lt;/p&gt;

&lt;p&gt;Production behavior becomes a source of quality information that feeds back into requirements, specs, tests, prompts, and governance.&lt;/p&gt;




&lt;h2&gt;
  
  
  8. Governance and Auditability
&lt;/h2&gt;

&lt;p&gt;AI creates a new need for evidence.&lt;/p&gt;

&lt;p&gt;QA can own or strongly influence the evidence trail.&lt;/p&gt;

&lt;p&gt;That means documenting:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What was tested&lt;/li&gt;
&lt;li&gt;What model, prompt, or version was used&lt;/li&gt;
&lt;li&gt;What data was used&lt;/li&gt;
&lt;li&gt;What risks were considered&lt;/li&gt;
&lt;li&gt;What human approvals occurred&lt;/li&gt;
&lt;li&gt;What known limitations remain&lt;/li&gt;
&lt;li&gt;What monitoring is in place&lt;/li&gt;
&lt;li&gt;Why the release was considered acceptable&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This matters in regulated environments, but it also matters for any company trying to use AI responsibly.&lt;/p&gt;

&lt;p&gt;Governance is not just paperwork.&lt;/p&gt;

&lt;p&gt;Good governance helps teams prove that they understood the risks, tested the right things, and made informed release decisions.&lt;/p&gt;




&lt;h2&gt;
  
  
  The New QA Title Is Closer to “Quality Architect”
&lt;/h2&gt;

&lt;p&gt;In the AI SDLC, QA becomes less about manual validation at the end and more about designing a trustworthy delivery system.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Area&lt;/th&gt;
&lt;th&gt;QA / QE Responsibility&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Product idea&lt;/td&gt;
&lt;td&gt;Identify quality risks early&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Requirements&lt;/td&gt;
&lt;td&gt;Make requirements testable, measurable, and risk-aware&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Specs&lt;/td&gt;
&lt;td&gt;Add examples, counterexamples, edge cases, and acceptance criteria&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Prompts / agents&lt;/td&gt;
&lt;td&gt;Validate consistency, correctness, guardrails, and failure modes&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Generated code&lt;/td&gt;
&lt;td&gt;Review AI-generated code for correctness, maintainability, and standards&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Test automation&lt;/td&gt;
&lt;td&gt;Generate, review, scale, and govern automated tests&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Data / model quality&lt;/td&gt;
&lt;td&gt;Validate datasets, model behavior, drift, and evaluation metrics&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;CI/CD&lt;/td&gt;
&lt;td&gt;Build quality gates into pipelines&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Deployment&lt;/td&gt;
&lt;td&gt;Require release evidence before production&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Production&lt;/td&gt;
&lt;td&gt;Monitor quality after release&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Governance&lt;/td&gt;
&lt;td&gt;Preserve traceability, audit evidence, approvals, and known limitations&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  Traditional QA vs. AI SDLC QA
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Traditional QA
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Requirements → Code → Test → Release
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Traditional QA often enters late and asks:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Does the software meet the requirements?&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  AI SDLC QA
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Risk → Spec → Prompt → Generated Code → Test → Gate → Monitor → Improve
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;AI SDLC QA enters early and keeps asking:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;How do we know this is correct, safe, maintainable, observable, and fit for purpose?&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  The Plainspoken Version
&lt;/h2&gt;

&lt;p&gt;QA is becoming the group that answers:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;How do we know this AI-assisted system is correct, safe, maintainable, observable, and fit for purpose?&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That is a much bigger role than traditional testing.&lt;/p&gt;

&lt;p&gt;It is also a huge opportunity for experienced QA architects, because AI makes weak engineering processes worse and strong engineering processes faster.&lt;/p&gt;

&lt;p&gt;QA’s job is to make sure the organization gets the second outcome, not the first.&lt;/p&gt;




&lt;h2&gt;
  
  
  Bottom Line
&lt;/h2&gt;

&lt;p&gt;In the new AI SDLC, QA is not just testing software.&lt;/p&gt;

&lt;p&gt;QA is helping the organization build systems that are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Correct&lt;/li&gt;
&lt;li&gt;Safe&lt;/li&gt;
&lt;li&gt;Trustworthy&lt;/li&gt;
&lt;li&gt;Maintainable&lt;/li&gt;
&lt;li&gt;Observable&lt;/li&gt;
&lt;li&gt;Governed&lt;/li&gt;
&lt;li&gt;Measurable&lt;/li&gt;
&lt;li&gt;Ready for production&lt;/li&gt;
&lt;li&gt;Continuously improving&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;AI does not replace QA. AI makes strong QA leadership more important.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  References
&lt;/h2&gt;

&lt;p&gt;These references are useful for grounding this model of QA in the AI SDLC.&lt;/p&gt;

&lt;h3&gt;
  
  
  NIST AI Risk Management Framework 1.0
&lt;/h3&gt;

&lt;p&gt;NIST provides a practical framework for thinking about AI risk through governance, mapping, measurement, and management.&lt;/p&gt;

&lt;p&gt;Useful for supporting the role of QA in risk definition, measurement, monitoring, governance, and lifecycle accountability.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://nvlpubs.nist.gov/nistpubs/ai/nist.ai.100-1.pdf" rel="noopener noreferrer"&gt;https://nvlpubs.nist.gov/nistpubs/ai/nist.ai.100-1.pdf&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Google Cloud: MLOps Continuous Delivery and Automation Pipelines in Machine Learning
&lt;/h3&gt;

&lt;p&gt;Google Cloud’s MLOps guidance explains why machine learning systems require CI/CD, continuous training, automation, monitoring, and production feedback loops.&lt;/p&gt;

&lt;p&gt;Useful for supporting the idea that AI quality is not a one-time testing event.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://docs.cloud.google.com/architecture/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning" rel="noopener noreferrer"&gt;https://docs.cloud.google.com/architecture/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Google Cloud: Practitioners Guide to MLOps
&lt;/h3&gt;

&lt;p&gt;This guide provides a broader view of operationalizing ML systems, including lifecycle practices, automation, monitoring, and production readiness.&lt;/p&gt;

&lt;p&gt;Useful for grounding QA’s role in end-to-end ML system quality.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://services.google.com/fh/files/misc/practitioners_guide_to_mlops_whitepaper.pdf" rel="noopener noreferrer"&gt;https://services.google.com/fh/files/misc/practitioners_guide_to_mlops_whitepaper.pdf&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Microsoft Responsible AI Standard
&lt;/h3&gt;

&lt;p&gt;Microsoft’s Responsible AI Standard provides concrete requirements for building AI systems responsibly.&lt;/p&gt;

&lt;p&gt;Useful for supporting governance, accountability, transparency, reliability, safety, fairness, privacy, and inclusive design considerations.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://cdn-dynmedia-1.microsoft.com/is/content/microsoftcorp/microsoft/final/en-us/microsoft-brand/documents/Microsoft-Responsible-AI-Standard-General-Requirements.pdf" rel="noopener noreferrer"&gt;https://cdn-dynmedia-1.microsoft.com/is/content/microsoftcorp/microsoft/final/en-us/microsoft-brand/documents/Microsoft-Responsible-AI-Standard-General-Requirements.pdf&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  OWASP Top 10 for LLM Applications
&lt;/h3&gt;

&lt;p&gt;OWASP identifies major security risks for LLM applications, including prompt injection, insecure output handling, training data poisoning, sensitive information disclosure, and supply-chain vulnerabilities.&lt;/p&gt;

&lt;p&gt;Useful for supporting QA involvement in LLM-specific security and quality risks.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://genai.owasp.org/llm-top-10/" rel="noopener noreferrer"&gt;https://genai.owasp.org/llm-top-10/&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  ISO/IEC 42001:2023 AI Management System
&lt;/h3&gt;

&lt;p&gt;ISO/IEC 42001 defines an AI management system standard for organizations that develop, provide, or use AI systems.&lt;/p&gt;

&lt;p&gt;Useful for supporting auditability, governance, accountability, lifecycle management, and continuous improvement.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.iso.org/standard/42001" rel="noopener noreferrer"&gt;https://www.iso.org/standard/42001&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>testing</category>
      <category>codequality</category>
      <category>sdlc</category>
    </item>
  </channel>
</rss>
