Using LLMs for Chemistry Research and Applications

#aiinfrastructure #oxlo #ai

Chemistry research produces a flood of unstructured data, from synthesis procedures buried in PDF supplements to spectral metadata locked in patent filings. Large language models can extract entities, propose novel molecular structures, and even orchestrate multi-step lab protocols, but these workloads place unusual demands on inference infrastructure. Long-context ingestion of full articles, agentic loops that call external property databases, and deterministic JSON outputs for automated pipelining are not optional features. They are baseline requirements. An inference backend that bills by the token punishes exactly the kind of long-document analysis and iterative reasoning that chemistry AI requires. Oxlo.ai offers a developer-first alternative: flat per-request pricing, OpenAI SDK compatibility, and a model catalog that includes deep-reasoning and coding specialists.

Literature and Knowledge Extraction

Full-text articles and patent claims easily exceed tens of thousands of tokens. With token-based billing, a single batch of 50 papers becomes expensive quickly. Oxlo.ai charges one flat cost per API request regardless of prompt length, so feeding an entire article or a concatenated set of experimental sections is economically predictable. Models such as DeepSeek V4 Flash support up to 1 million tokens of context, letting you drop in a full PDF conversion plus supplementary tables without truncation.

The following example passes a long experimental section to the model and requests structured JSON output.

import openai

client = openai.OpenAI(
    base_url="https://api.oxlo.ai/v1",
    api_key="YOUR_OXLO_API_KEY"
)

paper_text = """
... paste a full experimental section or patent claim here ...
"""

response = client.chat.completions.create(
    model="deepseek-v4-flash",  # 1M context window
    messages=[
        {
            "role": "system",
            "content": (
                "You extract structured reaction data from academic text. "
                "Return valid JSON only."
            )
        },
        {
            "role": "user",
            "content": (
                f"Extract reactants, reagents, solvents, temperature, and yield "
                f"from the following paper:\n\n{paper_text}"
            )
        }
    ],
    response_format={"type": "json_object"},
    stream=False
)

print(response.choices[0].message.content)

Molecular Design and Generative Chemistry

LLMs can be prompted with SMILES, SELFIES, or InChI strings to generate novel molecules with desired properties. The challenge is enforcing chemical validity and domain constraints. Reasoning models like DeepSeek R1 671B MoE on Oxlo.ai can work through validity checks step by step, while code-specialized models such as Qwen 3 Coder 30B can draft RDKit validation scripts on the fly.

prompt = (
    "Generate 5 novel drug-like molecules targeting COX-2 inhibition. "
    "Return each as a SMILES string with a predicted IC50 rationale. "
    "Output strictly as JSON with keys: smiles, rationale."
)

response = client.chat.completions.create(
    model="deepseek-r1-671b",  # reasoning specialist
    messages=[
        {
            "role": "system",
            "content": "You are a computational chemist. Think step by step."
        },
        {
            "role": "user",
            "content": prompt
        }
    ],
    response_format={"type": "json_object"}
)

print(response.choices[0].message.content)

Reaction Prediction and Retrosynthesis

Multi-step synthesis planning is inherently agentic. A model must propose a route, check reagent availability, and possibly revise based on stereochemical constraints. Oxlo.ai supports function calling and tool use, so you can connect an LLM to a search tool or an internal inventory database. Because Oxlo.ai bills per request, an agent that iterates five times to refine a synthesis does not incur five times the token cost of a long context window. It incurs five predictable per-request charges.

tools = [
    {
        "type": "function",
        "function": {
            "name": "search_reagents",
            "description": "Search internal inventory for reagent availability",
            "parameters": {
                "type": "object",
                "properties": {
                    "reagent_name": {"type": "string"}
                },
                "required": ["reagent_name"]
            }
        }
    }
]

response = client.chat.completions.create(
    model="qwen3-32b",  # strong agent workflow performance
    messages=[
        {
            "role": "user",
            "content": (
                "Propose a synthesis for paracetamol and check reagent availability "
                "after each step."
            )
        }
    ],
    tools=tools,
    tool_choice="auto"
)

Lab Automation and Closed-Loop Systems

Modern labs integrate LLMs with robotic platforms and IoT sensors. These systems require low-latency, always-available endpoints. Oxlo.ai has no cold starts on popular models, so a synthesis robot waiting for a go/no-go decision does not hit a warmup delay. Streaming responses let you parse partial outputs and abort expensive physical actions early if the model flags a safety issue.

You can enable streaming on any chat completion request:

stream = client.chat.completions.create(
    model="llama-3.3-70b",
    messages=[
        {
            "role": "user",
            "content": (
                "Evaluate the safety of mixing sodium borohydride with methanol "
                "under argon at 0 C. Respond with a concise risk assessment."
            )
        }
    ],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

Cheminformatics Scripting

Much of computational chemistry runs on Python libraries like RDKit, Open Babel, and Pymatgen. Instead of hand-writing scripts, researchers can prompt a code model to generate analysis pipelines. Oxlo.ai offers code-optimized models including Oxlo.ai Coder Fast and Qwen 3 Coder 30B.

code_prompt = (
    "Write a Python function using RDKit that takes a list of SMILES, "
    "filters for molecular weight between 200 and 500, and returns a pandas DataFrame."
)

response = client.chat.completions.create(
    model="oxlo.ai-coder-fast",
    messages=[{"role": "user", "content": code_prompt}]
)

script = response.choices[0].message.content
print(script)

Why Inference Economics Matter for Chemistry

Chemistry workloads are long-context by nature. A single patent can be 50,000 tokens. An agentic loop might pass that same context back five times. On token-based platforms, costs scale linearly with input length. Oxlo.ai uses request-based pricing: one flat cost per API request regardless of prompt length. For teams processing full-text literature or running autonomous lab agents, this can reduce costs by an order of magnitude or more compared to token-based billing. See the exact rates at https://oxlo.ai/pricing.

Oxlo.ai is fully OpenAI SDK compatible, so switching existing chemistry pipelines is a base_url change. The platform hosts 45+ models across seven categories, including reasoning, code, vision (for scanning lab notebook photos with Kimi VL A3B or Gemma 3 27B), and embeddings (for vectorizing molecular text representations). No cold starts, JSON mode, and function calling are available on all supported chat models, making Oxlo.ai a pragmatic foundation for chemistry AI.