DEV Community

Cover image for Day 29 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -8)
Nitin-bhatt46
Nitin-bhatt46

Posted on

1

Day 29 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -8)

STATISTICS FOR DATA ANALYTICS - 8

Inferential Statistics :-

Very large amount of data for analysis is needed, which may need too much time and resources.

We take small samples of data from big data and create models to get output.

Probability :-

Introduction to Probability :-

What is the likelihood of the event Is called probability.

Type of probability

Mutually exclusive.
Eg:- Dice ( everything is independent )
Non mutually exclusive.
Eg :- Dependent on each other

Rules in Probability

Additive Rule of Probability

Multiplicative Rule of Probability.

Dependent

Independent

BASIC OF PROBABILITY

Steps :-
Find all the combination
Find probability of each
Frequency distribution.
Probability distribution
Use and get the result with Expected value.

Random Variables :-

It is a process of mapping the output of the random process or experiment to a number.
Eg ➖ Tossing a coin.

Sets :-

a={1,2,3,4,5,6,7,8}
b={1,2,3,4}

Intersection - getting common from the sets.
Union - getting everything from the sets.
Difference - (a-b) getting only which is not in b.
Subset - b is a subset of a.
Superset - a is a superset of b.

Probability distribution
It is a distribution form of representation that tells us the probability for all the possible values of X.

PROBABILITY DISTRIBUTION FUNCTION :-

It has two type : -

OVERVIEW :-

PROBABILITY DENSITY FUNCTION ( pdf )
It is for the continuous random variable.
EG :- Height
PROBABILITY MASS FUNCTION ( pmf )
It is for the discrete random variable.
EG :- Rolling of dice
Cumulative Distribution FUNCTION ( cdf ) :-
Cumulative means sum of previous and current value. It can be created for both pdf and pmf. To show the sum up value.

Follow me on this where every day will be added if i learn something new about it :- https://dev.to/nitinbhatt46

Thank you for your time.

Image of Timescale

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read full post →

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more

AWS GenAI Live!

GenAI LIVE! is a dynamic live-streamed show exploring how AWS and our partners are helping organizations unlock real value with generative AI.

Tune in to the full event

DEV is partnering to bring live events to the community. Join us or dismiss this billboard if you're not interested. ❤️