Abhishek Jaiswal

Posted on Jun 29

Structured Output in LangChain

#ai #python #rag #programming

When I started building LLM applications, one thing became obvious very quickly:

Getting a response from an LLM is easy.

Getting a reliable, predictable, machine-readable response from an LLM is the real challenge.

A chatbot returning:

"Here is your answer..."

is fine for a demo.

But in an enterprise AI system, we usually need something much more strict:

Extract customer information from documents
Generate JSON responses for APIs
Classify support tickets
Extract financial data
Validate AI-generated decisions
Trigger workflows based on AI output

A traditional LLM response is just text. Your application cannot safely depend on random text.

This is where LangChain Structured Output becomes extremely useful.

In this article, we will understand:

What structured output means in LangChain
Why normal LLM responses fail in production
How LangChain structured output works internally
Using Pydantic models
JSON schema based outputs
Structured output with agents
Enterprise-level examples
Production best practices

What is Structured Output in LangChain?

Structured output means forcing an LLM to return data in a predefined format instead of plain text.

For example, instead of:

The customer Babu Rao has an account with premium subscription and his payment failed yesterday.

we want:

{
  "customer_name": "Babu Rao",
  "subscription": "premium",
  "issue": "payment_failed",
  "priority": "high"
}

Now your backend can directly consume this response.

A structured response can be:

JSON
Pydantic object
Typed dictionary
Custom schema

Why Do We Need Structured Output?

Imagine building an AI customer support automation system.

Without structured output:

User message
      |
      v
     LLM
      |
      v
Random text response

Your backend has no guarantee.

The model might return:

The issue seems related to payment. Please contact support.

or:

{
 "category":"billing"
}

or:

Here is the information:
{
 "category":"billing"
}

Every format is different.

Your code breaks.

With structured output:

User message
      |
      v
     LLM
      |
      v
Validated Schema
      |
      v
Backend Workflow

Now your application knows exactly what to expect.

LangChain Structured Output with Pydantic

The most common approach in production is using Pydantic models.

Pydantic gives us:

Type validation
Required fields
Data consistency
Error handling

Install dependencies:

pip install langchain langchain-openai pydantic

Example:

from pydantic import BaseModel, Field
from langchain_openai import ChatOpenAI


class CustomerIssue(BaseModel):

    customer_name: str = Field(
        description="Name of the customer"
    )

    issue_type: str = Field(
        description="Category of customer problem"
    )

    priority: str = Field(
        description="Priority level: low, medium, high"
    )


llm = ChatOpenAI(
    model="gpt-4.1",
    temperature=0
)


structured_llm = llm.with_structured_output(
    CustomerIssue
)


response = structured_llm.invoke(
    """
    Customer Babu Rao reported that his credit card payment
    failed multiple times and he cannot complete checkout.
    """
)


print(response)

Output:

CustomerIssue(
    customer_name="Babu Rao",
    issue_type="payment_failure",
    priority="high"
)

Now instead of handling strings, we work with Python objects.

Understanding with_structured_output()

This line:

structured_llm = llm.with_structured_output(CustomerIssue)

changes the behavior of the model.

Internally LangChain does something like:

Reads your schema
Converts it into a format the model understands
Sends structured output instructions
Receives model response
Validates the response
Returns the parsed object

Basically:

Pydantic Model
        |
        v
JSON Schema
        |
        v
LLM Instructions
        |
        v
Validated Response

Enterprise Example: AI Document Extraction System

A common enterprise use case:

Extract invoice information automatically.

Input document:

Invoice Number: INV-10291

Customer:
ABC Technologies

Amount:
$25,000

Payment Status:
Pending

We want:

{
 "invoice_id":"INV-10291",
 "customer":"ABC Technologies",
 "amount":25000,
 "payment_status":"pending"
}

Implementation:

from pydantic import BaseModel


class Invoice(BaseModel):

    invoice_id: str

    customer_name: str

    amount: float

    payment_status: str



invoice_llm = llm.with_structured_output(
    Invoice
)


result = invoice_llm.invoke(
"""
Extract invoice details:

Invoice Number: INV-10291

Customer:
ABC Technologies

Amount:
25000

Payment Status:
Pending
"""
)


print(result)

Output:

Invoice(
 invoice_id="INV-10291",
 customer_name="ABC Technologies",
 amount=25000,
 payment_status="Pending"
)

Now this output can directly go into:

Database
ERP system
Payment workflow
Analytics pipeline

Structured Output in RAG Applications

RAG systems are one of the biggest enterprise use cases.

Normally:

User Query

      |
      v

Retriever

      |
      v

Documents

      |
      v

LLM

      |
      v

Answer

But enterprise systems often need:

Answer
+
Sources
+
Confidence Score
+
Action

Example:

from pydantic import BaseModel


class RAGResponse(BaseModel):

    answer: str

    confidence: float

    sources: list[str]

    action: str



rag_llm = llm.with_structured_output(
    RAGResponse
)


response = rag_llm.invoke(
"""
Based on company policy documents,
answer:

Can employees work remotely?
"""
)

Output:

{
 "answer":"Employees can work remotely 3 days per week",
 "confidence":0.94,
 "sources":[
   "remote_policy.pdf"
 ],
 "action":"inform_user"
}

This is much easier to integrate into an enterprise application.

Structured Output with LangChain Agents

Agents are powerful but unpredictable.

An agent may:

Call tools
Reason internally
Decide next actions

Structured output helps control agent behavior.

Example:

from pydantic import BaseModel


class AgentDecision(BaseModel):

    next_action: str

    tool_required: bool

    reason: str



agent_llm = llm.with_structured_output(
    AgentDecision
)


decision = agent_llm.invoke(
"""
A customer wants to cancel subscription.
Decide the next action.
"""
)


print(decision)

Output:

{
 "next_action":"billing_agent",
 "tool_required":true,
 "reason":"Cancellation requires account verification"
}

Now your orchestration layer can route requests safely.

Structured Output vs JSON Mode

Many developers confuse these two.

JSON Mode

Example:

llm.invoke(
"Return JSON only"
)

Problem:

The model can still return invalid JSON.

Example:

{
"name":"Babu Rao",
}

Invalid.

Structured Output

With LangChain:

llm.with_structured_output(MySchema)

You get:

Schema validation
Type checking
Better reliability

For production applications, structured output is usually the better choice.

Handling Validation Errors

Production systems need error handling.

Example:

try:

    response = structured_llm.invoke(
        user_input
    )

except Exception as e:

    print(
        "LLM output validation failed",
        e
    )

In real systems, you can:

Retry generation
Ask model to correct output
Log failures
Send to human review

Advanced Pattern: Multiple Output Types

Sometimes AI responses depend on the situation.

Example:

Customer support:

from typing import Union


class RefundRequest(BaseModel):

    order_id:str

    refund_reason:str



class Complaint(BaseModel):

    category:str

    description:str

Your application can route based on the returned schema.

Production Best Practices

1. Keep schemas simple

Bad:

class Response:

    everything_possible:str

Good:

class Response:

    category:str

    confidence:float

Clear schemas produce better outputs.

2. Use descriptions

Instead of:

priority:str

Use:

priority:str = Field(
description="Urgency level: low, medium, high"
)

Descriptions improve model understanding.

3. Use temperature 0 for extraction tasks

For structured extraction:

ChatOpenAI(
temperature=0
)

You want consistency, not creativity.

4. Always validate AI output

Never blindly trust an LLM response.

AI output should go through:

LLM
 |
 v
Validation
 |
 v
Business Rules
 |
 v
Database / API

Real Enterprise Architecture

A production AI application usually looks like:

                 User

                  |

              API Gateway

                  |

             AI Service

                  |

        LangChain Orchestration

                  |

       Structured Output Layer

                  |

        Validation + Business Logic

                  |

        Database / External APIs

Structured output becomes the contract between AI and your application.

Final Thoughts

LLMs are amazing at generating human-like responses, but enterprise software needs reliability.

Structured output is one of the techniques that helps bridge this gap.

With LangChain structured output, you can build AI systems that are:

More predictable
Easier to maintain
Safer for production
Easier to integrate with APIs and databases

The future of enterprise AI is not just generating text.

It is generating structured intelligence that software can trust.

DEV Community

Structured Output in LangChain

What is Structured Output in LangChain?

Why Do We Need Structured Output?

LangChain Structured Output with Pydantic

Understanding with_structured_output()

Enterprise Example: AI Document Extraction System

Structured Output in RAG Applications

Structured Output with LangChain Agents

Structured Output vs JSON Mode

JSON Mode

Structured Output

Handling Validation Errors

Advanced Pattern: Multiple Output Types

Production Best Practices

1. Keep schemas simple

2. Use descriptions

3. Use temperature 0 for extraction tasks

4. Always validate AI output

Real Enterprise Architecture

Final Thoughts

Top comments (0)