Sam Chen

Posted on Jun 21

Advanced LLM Prompting Techniques: From Zero-Shot to Chain-of-Thought

#programming

A practical guide to dramatically improve your AI model outputs

Introduction

You've probably noticed that the difference between a mediocre AI response and an excellent one often comes down to how you ask the question. This isn't magic—it's technique.

After spending months working with various LLMs (GPT-4, Claude, Mistral, etc.), I've discovered that mastering prompt engineering is like learning to cook. You can follow a recipe, or you can understand why ingredients work together.

Let's explore the techniques that actually move the needle.

1. The Problem with Generic Prompts

Before we dive into solutions, let's understand the baseline:

// ❌ Weak prompt
"Explain machine learning"

// Result: Generic, surface-level explanation

The issue? You're asking the model to guess your intent, audience level, and use case. It makes default assumptions—and defaults are rarely what you need.

2. The Role-Based Prompt Technique

What it does: Anchors the model in a specific perspective, triggering relevant knowledge patterns.

// ✅ Better prompt
You are an expert systems architect with 15 years of experience.
Explain machine learning to a junior developer who just finished 
their first full-stack project.

Keep explanations practical. Use examples from web development.
Avoid academic jargon.

Why it works: LLMs perform better when given a role. It's like the difference between asking a doctor for health advice versus asking a random person. The model internally activates relevant knowledge patterns.

Pro tip: Be specific about the role. "Expert developer" is weaker than "Senior Backend Engineer specializing in Python microservices."

3. Chain-of-Thought Prompting

This is where the magic happens.

The principle: Instead of asking for a final answer, ask the model to show its work.

// ❌ Direct approach
"Will adding caching improve our API response time?"

// ✅ Chain-of-thought
"Let's think through this step-by-step:
1. First, explain what our current bottleneck likely is
2. Then describe how caching would address it
3. Walk through the tradeoffs (consistency, storage, complexity)
4. Finally, recommend a decision"

The research: Papers like "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models" show that CoT improves performance on complex tasks by 10-40%.

Example in code review:

// Request: "Review this function for bugs"

// Better request with CoT:
"Review this authentication function. Walk through it step-by-step:
1. What does each line do?
2. What could go wrong in each section?
3. Are there race conditions, injection points, or state issues?
4. List any vulnerabilities found"

4. Few-Shot Prompting

Humans learn by example. So do LLMs.

// ❌ Single instruction
"Convert these function names to snake_case"

// ✅ Few-shot (show examples first)
"Convert function names to snake_case. Here are examples:

getUserData → get_user_data
fetchAPIResponse → fetch_api_response
calculateTotalPrice → calculate_total_price

Now convert these:
loadConfigFile → ?
validateUserInput → ?
processPaymentData → ?"

When to use it:

Establishing style preferences (code tone, format, structure)
Demonstrating edge case handling
Teaching the model your specific conventions

5. The Structured Output Prompt

LLMs are great at producing messy, natural text. They're even better when you tell them exactly what structure you want.

// Request with structure
"Analyze this code snippet for security issues.

Format your response as JSON:
{
  "severity": "high|medium|low",
  "vulnerabilities": ["vulnerability1", "vulnerability2"],
  "explanation": "clear explanation",
  "fix": "code snippet showing the fix"
}"

Benefits:

Easier to parse in code
Fewer parsing errors
Model tends to be more concise and accurate

6. Negative Prompting

Tell the model what not to do.

// ❌ Vague
"Explain async/await in JavaScript"

// ✅ With constraints
"Explain async/await in JavaScript.

DO NOT:
- Use callback examples (assume the user already understands them)
- Reference Promises in detail (separate topic)
- Use ES5 syntax examples
- Exceed 150 words

DO:
- Use modern async/await syntax
- Include at least one practical example
- Mention common pitfalls"

This dramatically reduces unnecessary context and keeps responses focused.

7. Temperature + Prompt Combination

Temperature controls randomness (0 = deterministic, 1+ = creative).

Pairing strategy:

// For precise, factual responses (docs, bug analysis)
const precisePrompt = {
  temperature: 0.1,
  prompt: "List every security best practice for REST APIs. Be exhaustive."
};

// For creative solutions (brainstorming, naming)
const creativePrompt = {
  temperature: 0.8,
  prompt: "Generate 10 creative variable names for a user preference object"
};

// For balanced responses (explanations, debugging)
const balancedPrompt = {
  temperature: 0.5,
  prompt: "What's the most likely cause of this error?"
};

8. The Meta Prompt: Asking the Model to Improve Itself

This is advanced, but surprisingly effective:

// Initial question:
"How do I optimize a React component?"

// Add a meta-layer:
"Before answering, tell me:
1. What assumptions am I making about your experience level?
2. What additional information would make this answer more useful?
3. Any edge cases or caveats I should mention?"

The model becomes self-aware about its own reasoning. Responses become more thoughtful and comprehensive.

9. Putting It All Together: A Real-World Example

Let's say you want the model to help debug a performance issue:

// ❌ Weak version
"Why is my app slow?"

// ✅ Strong version
You are a senior performance engineer debugging a React web application.

A user reports that their dashboard takes 8+ seconds to load after login.
Here's the relevant code: [code snippet]
The network tab shows 23 requests, some in parallel.

Let's analyze this step-by-step:
1. First, identify which requests are critical path vs. optional
2. Estimate the bottleneck (network, rendering, computation?)
3. Suggest 3 specific optimizations, ranked by impact
4. For each, explain the tradeoff and implementation approach

Format your response as:
{
  "bottleneck": "...",
  "recommendations": [
    {
      "optimization": "...",
      "impact": "high|medium|low",
      "effort": "hours needed",
      "explanation": "...",
      "code_example": "..."
    }
  ]
}

Assume:
- The app uses React 18
- Database queries are already optimized
- We have a 2-week timeline
- Code clarity is important (don't recommend premature optimizations)

Compare this to the weak version. The strong version gets 10x better results.

10. Common Mistakes to Avoid

❌ Mistake	✅ Fix
Being too polite	Be direct: "Do X, then explain why" not "Could you possibly...?"
Changing multiple variables	Test one technique at a time
Using vague terms	Specific > descriptive > vague
Ignoring the model's limitations	LLMs struggle with: math, very large context, real-time data
Not iterating	Refine based on results, don't expect perfection on v1

Testing Your Prompts

Here's a simple framework:


javascript
const testPrompt = async (prompt, testCases) => {
  const results = [];

  for (const testCase of testCases) {
    const response = await callLLM(prompt + testCase.input);
    const passed = evaluateResponse(response, testCase.expected);
    results.push({ test

DEV Community