DEV Community

ClawGear
ClawGear

Posted on

35 ChatGPT Prompts for Epidemiologists: From Data to Public Health Impact

Epidemiologists investigate disease distribution, identify risk factors, design surveillance systems, and translate complex data into actionable public health guidance. These 35 ChatGPT prompts help you move from data to insight faster, communicate findings more effectively, and design stronger studies. Copy, paste, and adapt them to your work.

1. Study Design & Methods

I am designing an epidemiological study to investigate the association between [exposure, e.g., particulate matter air pollution] and [outcome, e.g., incident asthma] in [population, e.g., children ages 5-12 in urban areas]. Compare the strengths and limitations of a cohort study, case-control study, and cross-sectional design for this specific research question. Recommend the optimal design and justify your choice.
Enter fullscreen mode Exit fullscreen mode
Help me calculate the required sample size for a [study design] investigating [exposure-outcome association]. The following parameters apply: expected prevalence/incidence [X], odds/risk ratio of interest [Y], desired power [80/90]%, alpha [0.05], and estimated loss to follow-up [Z]%. Show the formula and walk me through the calculation step by step.
Enter fullscreen mode Exit fullscreen mode
I am designing a case-control study on [topic]. Help me develop: the case definition (with inclusion and exclusion criteria), the control selection strategy and matching variables, the exposure assessment method, potential sources of selection bias and how to minimize them, and a recall bias mitigation strategy for self-reported exposures.
Enter fullscreen mode Exit fullscreen mode
Evaluate the validity of the following exposure assessment approach for [exposure type] in a prospective cohort study: [describe method, e.g., single baseline questionnaire / biomarker measurement / administrative data linkage]. Assess: measurement error type (random vs. systematic), likely direction of bias in effect estimates, and suggest a more valid approach if available.
Enter fullscreen mode Exit fullscreen mode
I need to design a cluster-randomized trial evaluating [intervention, e.g., a community health worker program] in [setting]. Address: rationale for cluster vs. individual randomization, unit of randomization, intraclass correlation coefficient (ICC) implications for sample size, randomization procedure, and how to account for clustering in the analysis.
Enter fullscreen mode Exit fullscreen mode

2. Epidemiological Analysis

I have a dataset from a cohort study with the following variables: [list key variables including outcome, exposure, covariates, follow-up time]. Walk me through the analysis plan: how to calculate crude and adjusted incidence rates, which confounders to include in multivariable regression, whether to use Cox regression or logistic regression and why, how to handle time-varying exposures, and what sensitivity analyses to run.
Enter fullscreen mode Exit fullscreen mode
Explain how to identify and handle confounding in my analysis of [exposure] and [outcome]. My dataset includes: [list potential confounders]. Describe: how to assess confounding using the 10% change-in-estimate rule, propensity score methods as an alternative, residual confounding risks, and how to present adjusted vs. unadjusted estimates in a table.
Enter fullscreen mode Exit fullscreen mode
I am analyzing data from a cross-sectional survey of [N] participants. The outcome is [binary/continuous] and my key exposure is [variable]. I have the following covariates: [list]. Guide me through: appropriate regression model selection, how to handle missing data (complete case vs. multiple imputation), how to calculate prevalence ratios vs. odds ratios for a common outcome, and how to interpret the results.
Enter fullscreen mode Exit fullscreen mode
Interpret the following epidemiological results for me: [paste results — OR/RR/HR with 95% CI and p-value]. Explain: the magnitude and direction of the association, statistical vs. clinical/public health significance, what the confidence interval tells us, and what I should say and NOT say about causality based on these results.
Enter fullscreen mode Exit fullscreen mode
I want to conduct a meta-analysis on [topic]. Guide me through: defining inclusion/exclusion criteria for studies, the PRISMA process, how to extract and harmonize data across studies with different designs, how to select a fixed vs. random effects model, how to assess and quantify heterogeneity (I² statistic), and how to assess publication bias (funnel plot, Egger's test).
Enter fullscreen mode Exit fullscreen mode

3. Disease Surveillance & Outbreak Investigation

Design a sentinel surveillance system for [disease/condition] in [setting, e.g., a national healthcare system / a low-resource country / a specific industry sector]. Include: objectives, case definition, data sources and collection points, reporting frequency, data flow architecture, analysis plan, and how surveillance findings will trigger public health action.
Enter fullscreen mode Exit fullscreen mode
Walk me through the steps of a field outbreak investigation for a suspected foodborne illness cluster affecting [N] people at [setting, e.g., a wedding reception / a school / a correctional facility]. Follow the classic 10 steps of outbreak investigation, specifying what I should do in the first 24 hours, first week, and in the final report.
Enter fullscreen mode Exit fullscreen mode
I am analyzing syndromic surveillance data and have detected an anomaly: [describe the signal, e.g., a 2.5 SD increase in ED visits for gastrointestinal illness in [city] over the past 3 days]. How do I: assess whether this is a true signal or noise, generate hypotheses about the cause, decide when to escalate to a full investigation, and communicate the preliminary signal to public health leadership?
Enter fullscreen mode Exit fullscreen mode
Develop a line list template and attack rate table for an outbreak investigation of [disease/syndrome] at [setting]. Include: all variables to collect (demographic, clinical, exposure, laboratory), how to calculate attack rates by exposure category, and how to use the attack rate table to identify the most likely vehicle or source.
Enter fullscreen mode Exit fullscreen mode
Write the Methods and Results sections of a preliminary outbreak report for [describe outbreak scenario: disease, setting, case count, dates]. Include: case definition, descriptive epidemiology (person/place/time), epi curve description and interpretation, attack rates, and preliminary hypothesis. Format appropriate for a health department situation report.
Enter fullscreen mode Exit fullscreen mode

4. Public Health Communication & Translation

Translate the following epidemiological finding into a press release for a general public audience: [paste finding, e.g., adjusted OR 1.47 (95% CI 1.21-1.79) for [outcome] among [exposure group]]. The press release should: accurately represent the finding without overstating causality, contextualize the risk, provide actionable guidance, include a quote from a public health official, and avoid fear-inducing language.
Enter fullscreen mode Exit fullscreen mode
Write a health advisory for clinicians about [emerging disease / outbreak / vaccine update]. Include: epidemiological summary (case counts, geographic spread, at-risk populations), clinical presentation and case definition, diagnostic and testing guidance, treatment/prophylaxis recommendations, reporting requirements, and key resources. Tone: authoritative, actionable, concise.
Enter fullscreen mode Exit fullscreen mode
I need to present complex risk data to a [city council / hospital board / community group] that has no epidemiology background. The data shows: [describe finding, e.g., a 1 in 500 lifetime risk of [disease] associated with [exposure]]. Help me explain this risk in three different formats: absolute risk, relative risk framing, and a visual metaphor or natural frequency format that is most intuitive for lay audiences.
Enter fullscreen mode Exit fullscreen mode
Draft a risk communication message for a [disease outbreak / environmental health threat / vaccine hesitancy situation] following the CERC (Crisis and Emergency Risk Communication) principles. Address: acknowledging uncertainty, what we know and don't know, what people should do right now, where to get trusted information, and how we will provide updates.
Enter fullscreen mode Exit fullscreen mode
Write a policy brief (2 pages maximum) summarizing the evidence on [public health topic, e.g., the health effects of [exposure] / the effectiveness of [intervention]] for a state health department audience. Include: key findings from the literature, quality of evidence, public health significance, policy options and trade-offs, and a specific recommendation. Use clear subheadings and avoid statistical jargon.
Enter fullscreen mode Exit fullscreen mode

5. Infectious Disease Epidemiology

Explain the key epidemiological parameters I need to characterize a novel infectious disease outbreak: R0 vs. Rt, serial interval, incubation period, generation time, case fatality rate vs. infection fatality rate, and ascertainment fraction. For each parameter, explain: what it measures, how to estimate it from early outbreak data, its limitations, and its implications for outbreak control.
Enter fullscreen mode Exit fullscreen mode
Design a seroprevalence study to estimate the true burden of [infection] in [population/region] following an outbreak in which confirmed case counts are thought to underrepresent true infections. Cover: study design, target population and sampling strategy, sample size considerations, antibody assay selection and performance characteristics, analysis to adjust for test sensitivity/specificity, and ethical considerations.
Enter fullscreen mode Exit fullscreen mode
I am modeling the impact of a vaccination campaign on [disease] transmission in a population with [vaccination coverage %] and vaccine efficacy of [X]% against infection and [Y]% against transmission. Help me calculate herd immunity threshold, effective reproduction number post-vaccination, and describe the key assumptions and limitations of this simple model.
Enter fullscreen mode Exit fullscreen mode
Develop a contact tracing protocol for [disease] in [setting, e.g., a school / a workplace / a low-resource community setting]. Include: case interview procedure and exposure window, how to define a close contact for this pathogen, notification approach, quarantine vs. isolation guidance, monitoring of contacts, and data management and privacy considerations.
Enter fullscreen mode Exit fullscreen mode
Summarize the current evidence on the effectiveness of [non-pharmaceutical intervention, e.g., masking / ventilation / school closures] in reducing transmission of [pathogen]. Describe: study designs used, effect size range, heterogeneity in evidence, key methodological limitations, and what the evidence does and does not support in terms of policy recommendations.
Enter fullscreen mode Exit fullscreen mode

6. Chronic Disease & Environmental Epidemiology

I am investigating a potential disease cluster of [disease, e.g., childhood leukemia] near [exposure source, e.g., an industrial facility]. What is the appropriate epidemiological approach? Discuss: statistical methods for cluster detection and evaluation (SaTScan, standardized incidence ratios), the limitations of cluster investigations, how to communicate findings to a concerned community, and when a cluster warrants a full analytic study.
Enter fullscreen mode Exit fullscreen mode
Design a study to evaluate the long-term health effects of occupational exposure to [chemical/agent] in [industry sector]. Choose the most appropriate study design, describe exposure assessment using job-exposure matrices or biomonitoring, identify the key health outcomes to track, address the healthy worker effect as a potential bias, and describe ethical considerations for worker research.
Enter fullscreen mode Exit fullscreen mode
Summarize the Bradford Hill criteria for evaluating whether an observed association between [exposure] and [outcome] is likely causal. For each criterion (strength, consistency, specificity, temporality, biological gradient, plausibility, coherence, experiment, analogy), apply it to the evidence for [specific exposure-outcome pair] and render a judgment on causality.
Enter fullscreen mode Exit fullscreen mode
I have geocoded health outcome data and environmental exposure data for [geographic area]. Walk me through a spatial epidemiology analysis plan: how to map outcome rates, how to assess spatial autocorrelation (Moran's I), how to conduct a spatial regression to assess the exposure-outcome association while accounting for spatial autocorrelation, and how to visualize and present results.
Enter fullscreen mode Exit fullscreen mode
Develop a life course epidemiology framework for studying the developmental origins of [chronic disease, e.g., cardiovascular disease / obesity / mental health disorders]. Include: key sensitive periods to study (prenatal, early childhood, adolescence), study design implications (birth cohort), relevant exposures at each life stage, and how epigenetic mechanisms might mediate these associations.
Enter fullscreen mode Exit fullscreen mode

7. Grant Writing, Publishing & Professional Development

Help me write a Specific Aims page for an NIH R01 grant application studying [research question]. Include: an attention-grabbing opening paragraph on the public health significance, a clear statement of the knowledge gap, a brief overview of preliminary data, three specific aims with hypotheses, and a closing paragraph on innovation and impact. Length: 1 page.
Enter fullscreen mode Exit fullscreen mode
Write an abstract for a manuscript reporting the results of a [study design] study examining [exposure-outcome association] in [population]. Include: background (2 sentences), methods (study design, population, exposure and outcome measures, analysis), results (key quantitative findings with effect estimates and confidence intervals), and conclusions (1-2 sentences, no overclaiming). Target journal: [journal name].
Enter fullscreen mode Exit fullscreen mode
Review this Methods section of my epidemiology manuscript and provide feedback: [paste Methods]. Evaluate: completeness against STROBE/CONSORT/PRISMA reporting guidelines, clarity of the study design description, adequacy of the statistical methods section, whether I have described confounding control adequately, and any methodological concerns a reviewer is likely to raise.
Enter fullscreen mode Exit fullscreen mode
I received the following peer review comments on my manuscript: [paste reviewer comments]. Help me draft a point-by-point response letter. For each comment: acknowledge the reviewer's concern, describe how I addressed it (or respectfully explain why I did not), and indicate where changes appear in the revised manuscript. Tone: collegial, thorough, and professionally assertive.
Enter fullscreen mode Exit fullscreen mode
Design a 12-month professional development plan for an early-career epidemiologist wanting to establish a research program in [area, e.g., infectious disease / environmental health / health disparities]. Include: skills to develop (methods, communication, grant writing, leadership), conferences to attend, collaborations to build, a publication goal, and a target grant mechanism to apply for by year's end.
Enter fullscreen mode Exit fullscreen mode

Get the Complete Epidemiologist AI Toolkit

Get the complete AI Prompt Toolkit for Epidemiologists →

Works with Claude, ChatGPT, and DeepSeek. Copy-paste ready.

Top comments (0)