Nicholus Gathirwa

Posted on Nov 7

Bayesians vs Frequentists: A Beginner's Guide

#beginners #datascience #tutorial

At their core, both approaches want to help us understand uncertainty and make decisions based on data. But they go about it in fundamentally different ways.

Both want to answer the question, "What does probability mean, and how do we use data to make inferences?"

Keep in mind these central questions:

"What does probability represent?" Bayesians and Frequentists answer this differently.
"How do we update our knowledge with new data?" This is where the approaches diverge most clearly.

Frequentist Statistics: The Long-Run Frequency Approach

Frequentist statistics is the traditional approach you'll encounter in most introductory statistics courses.

What Probability Means to Frequentists

To a Frequentist, probability is about long-run frequency. If you flip a coin infinitely many times, the proportion of heads approaches 0.5. That's probability ie what happens in the long run if you repeat an experiment over and over.

Core Principle

Frequentists treat parameters as fixed but unknown. For example, a population mean exists as one true value. We just don't know it. The job is to use sample data to estimate it.

How They Work with Data

No prior beliefs: Frequentists don't incorporate prior knowledge or beliefs into their analysis. The data speaks for itself.
P-values and confidence intervals: These are core in Frequentist inference. A 95% confidence interval means "if we repeated this sampling process infinitely, 95% of such intervals would contain the true parameter."
Hypothesis testing: You set up a null hypothesis and calculate the probability of seeing your data (or more extreme data) if the null is true. That's your p-value.

What Frequentists CANNOT Say

Here's the tricky part: Frequentists cannot make probability statements about parameters. They can't say "there's a 95% probability the true mean is between 10 and 20." Why? Because to them, the parameter is fixed ie it either is or isn't in that interval. The randomness is in the sampling process, not the parameter.

Bayesian Statistics: The Belief Update Approach

What Probability Means to Bayesians

To a Bayesian, probability represents degree of belief or uncertainty. It's subjective and personal. Probability quantifies what we believe about something, given all available information.

Core Principle

Bayesians treat parameters as random variables with probability distributions. There's no single "true" value we're trying to estimate instead, we describe our uncertainty about the parameter with a distribution.

How bayesians Work with Data

Prior beliefs: Bayesians start with a prior distribution that represents what they believe before seeing the data. This could be based on previous studies, expert opinion, or even complete ignorance.
Likelihood: This is the probability of observing the data given different parameter values (both approaches use this).
Posterior distribution: Using Bayes' Theorem, they combine the prior and likelihood to get a posterior distribution—what they believe after seeing the data.

The fundamental equation:
Posterior is directly proportional to Prior x Likelihood

What Bayesians CAN Say

Bayesians can make probability statements about parameters! They can say "there's a 95% probability the true mean is between 10 and 20" because they treat the parameter as having a probability distribution.

refer back to what frequentists cannot say

The Key Differences at a Glance

1. Interpretation of Probability

Frequentist: Long-run frequency (objective)
Bayesian: Degree of belief (subjective)

2. Treatment of Parameters

Frequentist: Fixed but unknown values
Bayesian: Random variables with distributions

3. Use of Prior Information

Frequentist: Not incorporated (only data matters)
Bayesian: Explicitly incorporated through prior distributions

4. Type of Results

Frequentist: Point estimates, confidence intervals, p-values
Bayesian: Probability distributions (posterior distributions), credible intervals

5. Interpretation of Intervals

Frequentist: "95% of such intervals would contain the true parameter if we repeated sampling infinitely"
Bayesian: "There's a 95% probability the parameter lies in this interval"

6. Flexibility

Frequentist: Simpler for standard problems, well-established methods
Bayesian: More flexible for complex models, but computationally intensive

A Practical Example: Coin Flipping

Imagine you flip a coin 10 times and get 7 heads. You want to know if the coin is fair.

Frequentist Approach:

Null hypothesis: The coin is fair (p = 0.5)
Calculate: What's the probability of getting 7 or more heads if the coin is actually fair?
P-value = 0.34 (not unusual)
Conclusion: "We fail to reject the null hypothesis. The data is consistent with a fair coin."
Note: They cannot say "there's a 34% chance the coin is fair"

Bayesian Approach:

Prior: Maybe you believe the coin is probably fair, but you're open to it being biased
Likelihood: The probability of 7 heads in 10 flips for different values of p
Posterior: Combine prior and likelihood using Bayes' Theorem
Conclusion: "Given the data and my prior beliefs, I now believe there's an 85% probability that p is between 0.45 and 0.85"
They can make probability statements about the parameter

Note that most of "basic" statistics we do, we use frequentists approaches. By basic I mean this is the one mostly used in introductory statistics courses and right to do so as this is the traditional approach and has been the dominant paradigm.

Strengths and weaknesses.

Frequentist Strengths

Objective doesn't depend on subjective priors
Well-established with clear procedures
Computationally simpler for many problems
Widely accepted in scientific publications

Frequentist weaknesses

Confidence intervals are often misinterpreted as Bayesian credible intervals
P-values are frequently misused and misunderstood
Ignores prior knowledge that might be valuable
Long-run frequency interpretation can be unintuitive

Bayesian Strengths

Intuitive interpretation (directly answers "what do we believe?")
Naturally incorporates prior information
Excellent for complex models and hierarchical structures
Provides full probability distributions, not just point estimates
Updates naturally as new data arrives

Bayesian weaknesses

Results depend on choice of prior (though this can also be a strength)
Computationally intensive (requires specialized software and techniques)
Less standardized ie many choices to make
"Subjective" nature makes some scientists uncomfortable

When to Use Which?

Use Frequentist methods when:

You want objective, widely-accepted standard procedures
You have no reliable prior information
You're working with simple, standard statistical tests
You need computational efficiency
You're publishing in fields that require Frequentist approaches

Use Bayesian methods when:

You have valuable prior information to incorporate
You want intuitive probability statements about parameters
You're working with complex hierarchical models
You want to update beliefs as new data arrives
You need to make decisions under uncertainty

The Bottom Line

The difference between Bayesian and Frequentist statistics boils down to fundamentally different answers to "What is probability?"

Frequentists say: "Probability is long-run frequency. Parameters are fixed. Data is random."
Bayesians say: "Probability is degree of belief. Parameters are uncertain. Everything has a distribution."

Neither is inherently "better" as they are different tools for different situations

The important thing is understanding what each approach assumes and what it can (and cannot) tell you. That way, you can interpret results correctly and choose the right tool for your problem.

Hope you've understood dear reader and leave a comment if you have any concern.

DEV Community

Bayesians vs Frequentists: A Beginner's Guide

Frequentist Statistics: The Long-Run Frequency Approach

What Probability Means to Frequentists

Core Principle

How They Work with Data

What Frequentists CANNOT Say

Bayesian Statistics: The Belief Update Approach

What Probability Means to Bayesians

Core Principle

How bayesians Work with Data

What Bayesians CAN Say

The Key Differences at a Glance

1. Interpretation of Probability

2. Treatment of Parameters

3. Use of Prior Information

4. Type of Results

5. Interpretation of Intervals

6. Flexibility

A Practical Example: Coin Flipping

Strengths and weaknesses.

Frequentist Strengths

Frequentist weaknesses

Bayesian Strengths

Bayesian weaknesses

When to Use Which?

The Bottom Line

Top comments (0)