Edmilson Silva

Posted on Aug 17, 2024

Statistics with R - Measures of Central Tendency and Measures of Dispersion

#r #tutorial #statistics #math

mtcars

data(mtcars)
head(mtcars)

Loads and displays the first few rows of the mtcars dataset.

str(mtcars)

Displays the structure of the mtcars dataset, showing the type of each column.

summary(mtcars)

Measures of Central Tendency

Mean

\mu = \frac{1}{N} \sum_{i=1}^{N} x_i

Calculates the mean of a sequence of numbers.

n = c(1,2,4,5,6)

print(n)

mean_ = sum(n) / length(n)

print(mean_)

mean_cyl = sum(mtcars$cyl) / length(mtcars$cyl) 

print(mean_cyl)

Median

If ( N ) is odd:

\text{Med} = x_{\left(\frac{N+1}{2}\right)}

If ( N ) is even:

\text{Med} = \frac{x_{\left(\frac{N}{2}\right)} + x_{\left(\frac{N}{2} + 1\right)}}{2}

Calculates the median of a sequence of numbers with an odd size.

data_even <- c(7, 13, 19, 33, 67)

median_ <- median(data_even)
print(median_)

data_even <- c(7, 13, 19, 33, 67)
n = length(data_even)
median_ <- data_even[(n + 1) / 2]
print(median_)

Calculates the median of a sequence of numbers with an even size.

data_odd <- c(2, 34, 76, 92, 112)

median_ <- median(data_odd)
print(median_)

data_odd <- c(2, 34, 76, 92, 112)
n = length(data_odd)

median_ <- (data_odd[n / 2] + data_odd[n / 2 + 1]) / 2

print(median_)

median(mtcars$cyl)

median(mtcars$qsec)

Mode

\text{Mode} = \underset{x_i}{\operatorname{argmax}} \ f(x_i)

Creates a frequency table for a sequence of numbers.

numbers <- c(1, 233, 233, 010101, 342, 1, 2, 1111, 1, 55)

tnumbers <- table(numbers)
print(numbers)
print(tnumbers)

mode_ <- as.numeric(names(tnumbers)[tnumbers == max(tnumbers)])
print(mode_)

Identifies the most frequent value(s) in the sequence of numbers.

library(DescTools)

mode_ <- Mode(tnumbers)
print(mode_)

Measures of Dispersion

Defines a sequence of numbers.

n_arr = c(1,2,4,5,6)
print(n_arr)

Variance

\sigma^2 = \frac{1}{N} \sum_{i=1}^{N} (x_i - \mu)^2

Calculates the variance of a sequence of numbers.

mean_ <- mean(n_arr)

print('Mean')
print(mean_)

print('Variance')
var_ <- sum((n_arr - mean_)^2) / length(n_arr)

print((n_arr - mean_))
print((n_arr - mean_)^2)
print(sum((n_arr - mean_)^2))
print(length(n_arr))
print(var_)

Standard Deviation

\sigma = \sqrt{\sigma^2}

Calculates the standard deviation, which is the square root of the variance.

print('Variance')
var_ <- sum((n_arr - mean_)^2) / length(n_arr)
print((n_arr - mean_))
print((n_arr - mean_)^2)
print(sum((n_arr - mean_)^2))
print(length(n_arr))
print(var_)

print('Standard Deviation')
std_ <- sqrt(var_)
print(std_)

Calculates the standard deviation using the sd function in R.

std_ <- sd(n_arr)
print(std_)

Range

\text{Range} = x_{\text{max}} - x_{\text{min}}

Calculates the range, which is the difference between the maximum and minimum values.

range_ <- max(n_arr) - min(n_arr)
print('Range')
print(max(n_arr))
print(min(n_arr))
print(range_)

Calculates the range using the diff function.

range_ <- diff(range(n_arr))
print(range_)

Coefficient of Variation

\text{CV} = \frac{\sigma}{\mu}

Calculates the coefficient of variation, which is the ratio of the standard deviation to the mean.

mean_ <- mean(n_arr)
print('Mean')
print(mean_)

print('Variance')
var_ <- sum((n_arr - mean_)^2) / length(n_arr)
print((n_arr - mean_))
print((n_arr - mean_)^2)
print(sum((n_arr - mean_)^2))
print(length(n_arr))
print(var_)

print('Standard Deviation')
std_ <- sqrt(var_)
print(std_)

print('Coefficient of Variation')
cv <- std_ / mean_
print(cv)

My Latest Posts

Did you say Free GPU?

Edmilson Silva ・ Feb 29 '24

#python #gpu #ai #machinelearning

My Super Powers as a Software Developer - 2024

Edmilson Silva ・ Jan 6 '24

#tools #softwaredevelopment #workstations #productivity

Becoming Efficient with Pomodoro

Edmilson Silva ・ Feb 12 '24

#tools #productivity #softwaredevelopment

Favorites Projects Open Source

About the author:

Edmilson Silva

Machine learning, deep learning, and raw code. Presented clearly and with examples.

A little more about me...

Graduated in Bachelor of Information Systems, in college I had contact with different technologies. Along the way, I took the Artificial Intelligence course, where I had my first contact with machine learning and Python. From this it became my passion to learn about this area. Today I work with machine learning and deep learning developing communication software. Along the way, I created a blog where I create some posts about subjects that I am studying and share them to help other users.

I'm currently learning TensorFlow and Computer Vision

Curiosity: I love coffee

DEV Community

Statistics with R - Measures of Central Tendency and Measures of Dispersion

mtcars

Measures of Central Tendency

Mean

Median

Mode

Measures of Dispersion

Variance

Standard Deviation

Range

Coefficient of Variation

My Latest Posts

Did you say Free GPU?

Edmilson Silva ・ Feb 29 '24

My Super Powers as a Software Developer - 2024

Edmilson Silva ・ Jan 6 '24

Becoming Efficient with Pomodoro

Edmilson Silva ・ Feb 12 '24

Favorites Projects Open Source

About the author:

Edmilson Silva

Top comments (0)

mtcars

Measures of Central Tendency

Mean

Median

Mode

Measures of Dispersion

Variance

Standard Deviation

Range

Coefficient of Variation

My Latest Posts

Did you say Free GPU?

Edmilson Silva ・ Feb 29 '24

My Super Powers as a Software Developer - 2024

Edmilson Silva ・ Jan 6 '24

Becoming Efficient with Pomodoro

Edmilson Silva ・ Feb 12 '24

Favorites Projects Open Source

About the author:

Edmilson SilvaFollow

Edmilson Silva