<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Faizan Reza</title>
    <description>The latest articles on DEV Community by Faizan Reza (@faizan_reza_e0b8108e3f8ee).</description>
    <link>https://dev.to/faizan_reza_e0b8108e3f8ee</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3995770%2Fd6706bd9-7dd3-4a12-b7da-4ee9d3ad6d2a.png</url>
      <title>DEV Community: Faizan Reza</title>
      <link>https://dev.to/faizan_reza_e0b8108e3f8ee</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/faizan_reza_e0b8108e3f8ee"/>
    <language>en</language>
    <item>
      <title>How I Use AI as My "Junior Analyst" 5 Prompt Templates That Actually Work</title>
      <dc:creator>Faizan Reza</dc:creator>
      <pubDate>Sun, 21 Jun 2026 21:42:41 +0000</pubDate>
      <link>https://dev.to/faizan_reza_e0b8108e3f8ee/how-i-use-ai-as-my-junior-analyst-5-prompt-templates-that-actually-work-38hn</link>
      <guid>https://dev.to/faizan_reza_e0b8108e3f8ee/how-i-use-ai-as-my-junior-analyst-5-prompt-templates-that-actually-work-38hn</guid>
      <description>&lt;p&gt;Most data analysts I know are using AI wrong.&lt;/p&gt;

&lt;p&gt;They type things like "analyze this data" or "write me a SQL query" — and get back generic, surface-level responses. That's like telling a junior analyst "do analysis" with no brief, no context, no expectations.&lt;/p&gt;

&lt;p&gt;The difference between a useless AI response and one that saves you 3 hours? &lt;strong&gt;The prompt.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I've spent months engineering prompts specifically for data workflows. Here are 5 that consistently deliver production-ready output — and the principles behind why they work.&lt;/p&gt;




&lt;h2&gt;
  
  
  1. The Universal Data Cleaner
&lt;/h2&gt;

&lt;p&gt;Data cleaning is 60-80% of an analyst's job, but most prompts skip edge cases entirely. This one doesn't:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;I&lt;/span&gt; &lt;span class="n"&gt;have&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;dataset&lt;/span&gt; &lt;span class="k"&gt;with&lt;/span&gt; &lt;span class="n"&gt;mixed&lt;/span&gt; &lt;span class="n"&gt;data&lt;/span&gt; &lt;span class="n"&gt;quality&lt;/span&gt; &lt;span class="n"&gt;issues&lt;/span&gt; &lt;span class="err"&gt;—&lt;/span&gt; &lt;span class="n"&gt;missing&lt;/span&gt; &lt;span class="n"&gt;values&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; 
&lt;span class="n"&gt;inconsistent&lt;/span&gt; &lt;span class="n"&gt;formats&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;duplicates&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="ow"&gt;and&lt;/span&gt; &lt;span class="n"&gt;outliers&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt; &lt;span class="n"&gt;Here&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;s a sample:

[PASTE 5-10 ROWS]

Give me a complete cleaning pipeline in Python (pandas) that:
- Detects and reports all data quality issues
- Handles missing values intelligently (not just dropna)
- Standardizes date/text/number formats
- Flags outliers with justification
- Outputs a clean dataset + a cleaning report
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Why this works:&lt;/strong&gt; Instead of "clean my data" (which gets you &lt;code&gt;df.dropna()&lt;/code&gt; at best), you're telling the AI exactly what quality dimensions to check and what kind of output you expect. "Not just dropna" is the key phrase — it forces the model to think about imputation strategies.&lt;/p&gt;




&lt;h2&gt;
  
  
  2. The Query Optimizer
&lt;/h2&gt;

&lt;p&gt;Writing SQL is easy. Writing SQL that runs fast on 10M+ rows is an art. This prompt turns AI into your DBA:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="n"&gt;This&lt;/span&gt; &lt;span class="k"&gt;SQL&lt;/span&gt; &lt;span class="n"&gt;query&lt;/span&gt; &lt;span class="k"&gt;is&lt;/span&gt; &lt;span class="n"&gt;running&lt;/span&gt; &lt;span class="n"&gt;too&lt;/span&gt; &lt;span class="n"&gt;slow&lt;/span&gt; &lt;span class="n"&gt;against&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="k"&gt;large&lt;/span&gt; &lt;span class="k"&gt;table&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt; &lt;span class="n"&gt;Optimize&lt;/span&gt; &lt;span class="n"&gt;it&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;

&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="n"&gt;PASTE&lt;/span&gt; &lt;span class="n"&gt;YOUR&lt;/span&gt; &lt;span class="n"&gt;SLOW&lt;/span&gt; &lt;span class="n"&gt;QUERY&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;

&lt;span class="k"&gt;Explain&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
&lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;What&lt;/span&gt;&lt;span class="s1"&gt;'s causing the slowness (scan type, joins, subqueries)
- 3-5 specific optimizations in order of impact
- Which indexes I should create
- The rewritten optimized query
- Estimated improvement for each change
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Why this works:&lt;/strong&gt; You're not just asking for a rewritten query — you're demanding a diagnosis. The structured output format forces the model to think in terms of query execution plans, not just syntax. "Estimated improvement" makes it quantify its claims.&lt;/p&gt;




&lt;h2&gt;
  
  
  3. The Root Cause Analysis Framework
&lt;/h2&gt;

&lt;p&gt;Something broke. A KPI tanked. Your boss wants answers by EOD. This prompt is your investigation partner:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;[KPI] dropped by [X%] starting on [DATE]. Here's the available data:
[DESCRIBE DATA SOURCES]

Walk me through a structured root cause analysis:
- What to check first, second, third (prioritized)
- Segmentation cuts that could reveal the driver
- Common pitfalls in this type of investigation
- How to present findings to stakeholders
- Python/SQL code templates for each diagnostic step
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Why this works:&lt;/strong&gt; The prioritization is the secret sauce. Most analysts go down rabbit holes. This prompt forces the AI to give you a triage framework, not just "look at the data."&lt;/p&gt;




&lt;h2&gt;
  
  
  4. The Executive Dashboard Designer
&lt;/h2&gt;

&lt;p&gt;Dashboards that impress engineers are useless. Dashboards that impress executives get you promoted. Same data, different framing:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;Design&lt;/span&gt; &lt;span class="n"&gt;an&lt;/span&gt; &lt;span class="n"&gt;executive&lt;/span&gt; &lt;span class="n"&gt;dashboard&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;monitoring&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="n"&gt;BUSINESS&lt;/span&gt; &lt;span class="n"&gt;AREA&lt;/span&gt;&lt;span class="p"&gt;].&lt;/span&gt; 
&lt;span class="n"&gt;Key&lt;/span&gt; &lt;span class="n"&gt;stakeholders&lt;/span&gt; &lt;span class="n"&gt;care&lt;/span&gt; &lt;span class="n"&gt;about&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="n"&gt;LIST&lt;/span&gt; &lt;span class="mi"&gt;5&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="mi"&gt;7&lt;/span&gt; &lt;span class="n"&gt;KEY&lt;/span&gt; &lt;span class="n"&gt;METRICS&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;

&lt;span class="n"&gt;Provide&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
&lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;Dashboard&lt;/span&gt; &lt;span class="n"&gt;layout&lt;/span&gt; &lt;span class="nf"&gt;wireframe &lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;top&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;level&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;detail&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;drill&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;down&lt;/span&gt; &lt;span class="n"&gt;views&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;Which&lt;/span&gt; &lt;span class="n"&gt;charts&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;which&lt;/span&gt; &lt;span class="nf"&gt;metrics &lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="ow"&gt;and&lt;/span&gt; &lt;span class="n"&gt;why&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;Color&lt;/span&gt; &lt;span class="n"&gt;scheme&lt;/span&gt; &lt;span class="ow"&gt;and&lt;/span&gt; &lt;span class="n"&gt;visual&lt;/span&gt; &lt;span class="n"&gt;hierarchy&lt;/span&gt;
&lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;What&lt;/span&gt; &lt;span class="n"&gt;should&lt;/span&gt; &lt;span class="n"&gt;be&lt;/span&gt; &lt;span class="n"&gt;real&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;time&lt;/span&gt; &lt;span class="n"&gt;vs&lt;/span&gt; &lt;span class="n"&gt;daily&lt;/span&gt; &lt;span class="n"&gt;vs&lt;/span&gt; &lt;span class="n"&gt;weekly&lt;/span&gt; &lt;span class="n"&gt;refresh&lt;/span&gt;
&lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;Alert&lt;/span&gt; &lt;span class="n"&gt;thresholds&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;each&lt;/span&gt; &lt;span class="n"&gt;metric&lt;/span&gt;
&lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;Plotly&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Dash&lt;/span&gt; &lt;span class="ow"&gt;or&lt;/span&gt; &lt;span class="n"&gt;Streamlit&lt;/span&gt; &lt;span class="n"&gt;starter&lt;/span&gt; &lt;span class="n"&gt;code&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Why this works:&lt;/strong&gt; Most dashboard prompts produce a list of charts. This one produces a design system — visual hierarchy, refresh cadence, alert thresholds. That's what separates a monitoring tool from a decision-making tool.&lt;/p&gt;




&lt;h2&gt;
  
  
  5. The Stakeholder Communication Translator
&lt;/h2&gt;

&lt;p&gt;Your analysis is solid. Your presentation is… technical. This prompt bridges the gap:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;I need to present [TECHNICAL FINDING] to [EXECUTIVE/MARKETING/ENGINEERING].

Translate my technical findings into their language:
- The 30-second version (elevator pitch)
- The 3-minute version (key insights + recommendation)
- What they'll care about (tailored to their priorities)
- What they'll push back on (and how to handle it)
- One visual that makes the point better than words
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Why this works:&lt;/strong&gt; Different audiences have different "so what"s. This prompt forces the AI to reframe the same finding for different mental models. The "what they'll push back on" section is pure gold — it prepares you for the actual conversation, not just the presentation.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Formula Behind These Prompts
&lt;/h2&gt;

&lt;p&gt;All of these follow the same pattern. I call it the &lt;strong&gt;Genie Formula:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;[CONTEXT] I am a [ROLE] working on [PROBLEM].
[INPUT] Here is my data/code/situation: [DETAILS]
[TASK] I need you to: [SPECIFIC REQUEST]
[FORMAT] Please provide the output as: [STRUCTURE]
[CONSTRAINTS] Important constraints: [TECH STACK, PRIVACY, FORMAT]
[QUALITY] Double-check for: [EDGE CASES, PERFORMANCE, ACCURACY]
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The more structure you give, the better the output. AI isn't a magic wand — it's a junior analyst who needs clear instructions.&lt;/p&gt;




&lt;h2&gt;
  
  
  Want All 55?
&lt;/h2&gt;

&lt;p&gt;I've compiled my full prompt library — 55 prompts across 8 categories (data cleaning, SQL, Python, visualization, business intelligence, automation, career acceleration) — into a PDF.&lt;/p&gt;

&lt;p&gt;Each prompt follows the Genie Formula and is engineered for production use, not demo purposes.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="https://geniehq.gumroad.com/l/data-analyst-ai-command-center" rel="noopener noreferrer"&gt;Get the full pack → $9.99 on Gumroad&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What's inside:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;📊 10 Data Cleaning &amp;amp; Preparation prompts&lt;/li&gt;
&lt;li&gt;🗄️ 8 SQL Generation &amp;amp; Optimization prompts&lt;/li&gt;
&lt;li&gt;📈 7 Statistical Analysis prompts (A/B tests, segmentation, forecasting)&lt;/li&gt;
&lt;li&gt;📉 7 Visualization &amp;amp; Reporting prompts&lt;/li&gt;
&lt;li&gt;🐍 7 Python Code Generation prompts&lt;/li&gt;
&lt;li&gt;💼 7 Business Intelligence prompts&lt;/li&gt;
&lt;li&gt;⚡ 5 Automation prompts&lt;/li&gt;
&lt;li&gt;🚀 4 Career Accelerator prompts&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What's Your Go-To Prompt?
&lt;/h2&gt;

&lt;p&gt;I'm curious — what's the one prompt structure you've found that consistently produces great results for data work? Drop it in the comments.&lt;/p&gt;

&lt;p&gt;And if you've never structured your prompts beyond "analyze this" — try the Genie Formula above. The difference will surprise you.&lt;/p&gt;




</description>
      <category>ai</category>
      <category>python</category>
      <category>datascience</category>
      <category>productivity</category>
    </item>
  </channel>
</rss>
