Matěj Štágl

Posted on Nov 7

Exploring Google Gemini: New Frontiers for AI Integration in C# Applications

#csharp #dotnet #ai #gemini

Exploring Google Gemini: New Frontiers for AI Integration in C# Applications

When I started working with AI in my C# projects, I faced a simple problem: too many choices, too little clarity. Google's Gemini models brought something new to the table—native support for images, audio, and text in one conversation. But the real question was: how do you actually use this in .NET?

This article walks through what makes Gemini different and shows you how to integrate it into C# applications. We'll cover setup, compare options, and work through real examples.

What Makes Gemini Different?

Google Gemini Enterprise stands out for two reasons: multimodal capabilities and massive context windows. While coding assistants like GitHub Copilot excel at writing code in your IDE, Gemini shines when you need to process documents, images, and text together.

The models handle up to 1 million tokens of context. That's enough for entire codebases, research papers, or long documents. According to comparisons of AI developer tools, this makes Gemini particularly valuable for document processing and complex reasoning tasks.

Getting Started with C# and Gemini

You have three main paths:

Provider-agnostic SDKs (like LlmTornado) - Switch between models easily
Google's official .NET SDK - Direct Google integration
Custom HTTP client - Full control, more work

I'll focus on the first approach because it offers flexibility without complexity.

Installation

Install the SDK with NuGet:

dotnet add package LlmTornado

Basic Setup

Here's a simple chat with Gemini:

using System;
using System.Threading.Tasks;
using LlmTornado;
using LlmTornado.Chat;
using LlmTornado.Chat.Models;

// Initialize with your API key
TornadoApi api = new TornadoApi(
    new ProviderAuthentication("YOUR_GOOGLE_API_KEY")
);

// Create a conversation
Conversation chat = api.Chat.CreateConversation(new ChatRequest
{
    Model = ChatModel.Google.Gemini.Gemini25Pro,
    Messages = 
    [
        new ChatMessage(ChatMessageRoles.User, 
            "Explain quantum entanglement in simple terms.")
    ]
});

string response = await chat.GetResponse();
Console.WriteLine(response);

This pattern works across providers. Need to test against OpenAI or Claude? Change the model and API key. Your code stays the same.

Working with Documents: A Real Example

Gemini's multimodal support becomes useful when processing documents. Here's how to extract data from a PDF with charts:

using System;
using System.Threading.Tasks;
using LlmTornado;
using LlmTornado.Chat;
using LlmTornado.Chat.Models;
using LlmTornado.Files;

TornadoApi api = new TornadoApi(
    new ProviderAuthentication("YOUR_API_KEY")
);

// Upload a PDF
HttpCallResult<TornadoFile> uploadedFile = await api.Files.Upload(
    "research_paper.pdf",
    mimeType: "application/pdf",
    provider: LLmProviders.Google
);

// Create conversation
Conversation chat = api.Chat.CreateConversation(new ChatRequest
{
    Model = ChatModel.Google.Gemini.Gemini25Flash
});

// Ask about the document
chat.AppendUserInput([
    new ChatMessagePart(
        new ChatMessagePartFileLinkData(uploadedFile.Data.Uri)
    ),
    new ChatMessagePart(
        "Extract key findings from the charts. Return JSON."
    )
]);

// Stream the response for better UX
await chat.StreamResponseRich(new ChatStreamEventHandler
{
    MessageTokenHandler = (token) =>
    {
        Console.Write(token);
        return ValueTask.CompletedTask;
    },
    OnFinished = (data) =>
    {
        Console.WriteLine($"\nTokens used: {data.Usage}");
        return ValueTask.CompletedTask;
    }
});

Streaming matters for user experience. Real-world implementations show that streaming responses feel faster than waiting 30 seconds for complete output.

Reasoning Models: When to Use Them

Gemini 2.5 Pro includes configurable reasoning budgets. This is compute time the model uses to "think" before responding. For complex problems, this makes a difference:

using System;
using System.Threading.Tasks;
using LlmTornado;
using LlmTornado.Chat;
using LlmTornado.Chat.Models;

TornadoApi api = new TornadoApi(
    new ProviderAuthentication("YOUR_API_KEY")
);

Conversation chat = api.Chat.CreateConversation(new ChatRequest
{
    Model = ChatModel.Google.Gemini.Gemini25Pro,
    ReasoningBudget = 8192 // More thinking time
});

chat.AddUserMessage(
    "Design a distributed caching system for e-commerce. " +
    "Consider consistency, latency, and failure scenarios."
);

ChatRichResponse response = await chat.GetResponseRich();
Console.WriteLine(response.Text);

The reasoning budget ranges from 128 to 32,768 tokens. Use -1 for automatic allocation. Simple questions don't need high budgets. Complex architectural problems benefit from them.

Code Execution: Eliminating Math Errors

One feature that surprised me was native code execution. The model writes and runs Python code for calculations:

using System;
using System.Threading.Tasks;
using LlmTornado;
using LlmTornado.Chat;
using LlmTornado.Chat.Models;
using LlmTornado.Chat.Vendors.Google;

TornadoApi api = new TornadoApi(
    new ProviderAuthentication("YOUR_API_KEY")
);

Conversation chat = api.Chat.CreateConversation(new ChatRequest
{
    Model = ChatModel.Google.Gemini.Gemini25Pro,
    VendorExtensions = new ChatRequestVendorExtensions(
        new ChatRequestVendorGoogleExtensions
        {
            CodeExecution = ChatRequestVendorGoogleCodeExecution.Inst
        }
    )
});

chat.AddUserMessage(
    "Calculate compound interest on $10,000 at 7% annually " +
    "for 30 years, compounded quarterly."
);

ChatRichResponse response = await chat.GetResponseRich();
Console.WriteLine(response.Text);

This removes a class of errors where models incorrectly calculate formulas. For financial or scientific applications, accurate math is critical.

Image Generation: A Bonus Feature

Gemini 2.5 Flash Image generates and edits images conversationally:

using System;
using System.IO;
using System.Linq;
using System.Net.Http;
using System.Threading.Tasks;
using LlmTornado;
using LlmTornado.Chat;
using LlmTornado.Chat.Models;

TornadoApi api = new TornadoApi(
    new ProviderAuthentication("YOUR_API_KEY")
);

Conversation chat = api.Chat.CreateConversation(new ChatRequest
{
    Model = ChatModel.Google.GeminiPreview.Gemini25FlashImagePreview,
    Modalities = [ChatModelModalities.Text, ChatModelModalities.Image]
});

chat.AddUserMessage("Generate a cyberpunk cityscape at sunset");

ChatRichResponse response = await chat.GetResponseRich();
ChatRichResponseBlock? imageBlock = response.Blocks
    .FirstOrDefault(x => x.ChatImage != null);

if (imageBlock?.ChatImage != null)
{
    using HttpClient client = new HttpClient();
    byte[] imageData = await client.GetByteArrayAsync(
        imageBlock.ChatImage.Url
    );
    await File.WriteAllBytesAsync("cityscape.png", imageData);
}

The conversational aspect is useful. You can iterate: "Now add neon signs" or "Make it darker." This feels more natural than traditional text-to-image APIs.

Choosing Between Options

After using several AI tools in production, here's my decision tree:

Use Gemini when:

You need multimodal understanding (text, images, documents together)
Long context is critical (1M+ token windows)
You want configurable reasoning for complex problems
You're on Google Cloud infrastructure

Use GitHub Copilot when:

You need tight IDE integration
Real-time code completion is your priority
You work primarily in VS Code or JetBrains IDEs

Use smaller models when:

Cost per token is your main concern
Tasks are simple and well-defined
Response speed matters more than capability

According to comprehensive AI coding tool comparisons, the best choice depends on your workflow. Copilot integrates into your editor. Gemini excels at document processing and reasoning.

Comparing SDK Approaches

I've used three approaches to integrate Gemini:

Approach	Best For	Tradeoff
LlmTornado SDK	Multi-provider apps, unified API	Extra dependency
Google .NET SDK	Google-only apps	Vendor lock-in
Custom HTTP	Minimal deps, full control	Maintenance work

The LlmTornado approach lets me A/B test Gemini against Claude or GPT-4 without changing application logic. Google's official .NET SDK works well if you're committed to Google services.

Production Tips

After running Gemini in production for months, here's what helped:

Implement retry logic - Google's API occasionally returns 503 errors under load
Monitor reasoning tokens - They're billed separately and can surprise you
Use Flash models for volume - Pro is powerful but expensive at scale
Enable prompt caching - Can reduce costs by 90% for repeated context

For more patterns and examples, check the LlmTornado repository on GitHub. It includes error handling, retry patterns, and production configurations.

What I'm Exploring Next

I'm currently testing Gemini's computer use capabilities for automated testing. The idea is to have the model interact with web UIs, find bugs, and suggest fixes. It's early, but the potential for autonomous testing workflows is interesting.

For teams evaluating AI in 2025, I recommend starting with flexibility. Build abstractions that let you switch models easily. Instrument everything for observability. Don't optimize too early. The AI landscape changes monthly—flexibility beats picking the "perfect" model today.

Gemini's combination of multimodal capabilities, large context windows, and code execution makes it valuable for C# applications beyond simple chatbots. Whether you use LlmTornado, Google's SDK, or build your own client, understanding these capabilities helps you make better architectural decisions.

DEV Community

Exploring Google Gemini: New Frontiers for AI Integration in C# Applications

Exploring Google Gemini: New Frontiers for AI Integration in C# Applications

What Makes Gemini Different?

Getting Started with C# and Gemini

Installation

Basic Setup

Working with Documents: A Real Example

Reasoning Models: When to Use Them

Code Execution: Eliminating Math Errors

Image Generation: A Bonus Feature

Choosing Between Options

Comparing SDK Approaches

Production Tips

What I'm Exploring Next

Top comments (0)