DEV Community

Cover image for Self Study: Data Science - Machine Learning journey : Day 2 (Statistics | R | Python | Anaconda | Jupyter)
Vignesh C
Vignesh C

Posted on • Edited on

Self Study: Data Science - Machine Learning journey : Day 2 (Statistics | R | Python | Anaconda | Jupyter)

Prerequisites:

Statistics is generally considered as one of the prerequisites to study machine learning. We need statistics to help transform observations into information and to answer questions about samples of observations.

Statistics is needed in Machine Learning for..

Alt Text

Another prerequisite to data science - machine learning is a programming language - R or Python. R is used for statistical analysis to build models while Python is used beyond statistics with wide range of libraries and having better integration with other programming languages.

Applied Statistics:

Two broad categories in the field of statistics:

  1. Descriptive statistics
  2. Inferential statistics

Descriptive statistics is the process of categorizing and describing the information.

Inferential statistics includes the process of analyzing a sample of data and using it to draw inferences about the population from which it was drawn.

We need to get familiarized with all these concepts to continue our machine learning journey effectively. Most of these concepts would have been covered as part of our graduate degree.

Alt Text

Install R Studio

Install R and R Studio Desktop for your version of OS from here..

Sample R code to illustrate AUC and ROC from Day 1:

https://github.com/IamVigneshC/Machine-Learning-Data-Science/blob/master/R/ROC_AUC.R

Install Python

You can install and use python through command line or through Anaconda which come along with a tutorial, reference for various libraries.

Once installed, you shall open JupyterLab or Jupyter notebook and work on Python.

Some of my samples to get started:

https://anaconda.org/iamvigneshc

https://github.com/IamVigneshC/Machine-Learning-Data-Science/tree/master/Python

Image of Datadog

How to Diagram Your Cloud Architecture

Cloud architecture diagrams provide critical visibility into the resources in your environment and how they’re connected. In our latest eBook, AWS Solution Architects Jason Mimick and James Wenzel walk through best practices on how to build effective and professional diagrams.

Download the Free eBook

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay