STATISTICS FOR DATA ANALYTICS - 4
We study Measure of Tendency on the basis of population and sample.
Measure of Central Tendency : -
What is the central Tendency ?
It is information about the centre part of the group of numbers.
Mean, Median & Mode.
Mean : -
Arithmetic mean
The Ratio of the sum of all the observations in the data to the total number of observations.
It is used in continuous and discrete numeric data. But it cannot be used for categorical data.
It varies with skewness.
Population mean
Sample
In this formula it does matter if you are in college reading it but if you are a professional you just need to understand why and when they are used to get correct insight.
Median : -
It is the exact middle value. But Data must be sorted.
Even = count(sample)/2
It will give the index no. then find out the value from the table.
Odd = count(sample)/2
If you get a continuous value it means it is an odd value.
Whatever is the index value just take the mean of both the values
5.6
Take the value of 5 th index and 6 th index and calculate the mean.
Median is unaffected by extreme values.
Not all the information from numbers is used.
Medians don’t depend on the distribution of the graph. It is used to handle outliers.
Mode: -
It is the most commonly occurring value in a Distribution.
A data can have one or more than one mode.
It is mostly used as a categorical value for filling empty spaces.
Types :-
Unimodal - 1 mode in a dataset.
Multimodal - more than one mode in a dataset.
Bimodal - 2 mode in a dataset.
Trimodal - 3 mode in a dataset.
PRO TIPS
Filling empty cell in a dataset :-
Categorical data - use mode only
Numerical data - use mean ( only after making dataset into a normal distribution )
Follow me on this where every day will be added if i learn something new about it :- https://dev.to/nitinbhatt46
Thank you for your time.
Top comments (0)