<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: kinuthia john</title>
    <description>The latest articles on DEV Community by kinuthia john (@kinuthia_john_).</description>
    <link>https://dev.to/kinuthia_john_</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1857886%2Fd298caa0-0e7e-410b-86a8-e82bca57af2a.png</url>
      <title>DEV Community: kinuthia john</title>
      <link>https://dev.to/kinuthia_john_</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/kinuthia_john_"/>
    <language>en</language>
    <item>
      <title>Understanding Your Data: The Essentials of Exploratory Data Analysis</title>
      <dc:creator>kinuthia john</dc:creator>
      <pubDate>Sun, 11 Aug 2024 17:14:19 +0000</pubDate>
      <link>https://dev.to/kinuthia_john_/understanding-your-data-the-essentials-of-exploratory-data-analysis-573n</link>
      <guid>https://dev.to/kinuthia_john_/understanding-your-data-the-essentials-of-exploratory-data-analysis-573n</guid>
      <description>&lt;p&gt;Exploratory Data Analysis commonly known as EDA is a crucial step in data analysis as it helps examine and explore the characteristics of a dataset and through this, we are able to gain an understanding of the data&lt;br&gt;
&lt;strong&gt;This step has different processes which are:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt; Data visualization which is the physical representation of the dataset using graphs, charts, and other forms of visualization. This helps us to identify trends in the data and how data in different columns relates to each other. Python Libraries such as seaborn and Matplotlib are greatly used to visualize the data.
&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;sns&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;displot&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;x&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Temp_C&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;sns&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;displot&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;x&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Dew_Point_Temp_C&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="n"&gt;kde&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="bp"&gt;True&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The snippet above helps us create bar graphs to visualize the temperature and dew point temperature columns.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
Descriptive analysis of the data is the statistical analysis of the different columns to get values such as the mean, standard deviation, range, the max and minimum values of the columns.This is made possible by the following code:
&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;describe&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;include&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;all&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Cleaning the dataset- removal of errors in the dataset is done in this stage. It helps us deal with missing and inconsistent values thus making it easier for us to get accurate information from the data.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Identifying and removing outliers-outliers are anomalous data points in the dataset and they deviate significantly from the other values.It is important to identify them because they can make it difficult to draw meaningful conclusions as they skew the results.There are various methods to identify outliers and the most common one is the box model.Points outside the box are considered as outliers and need to be removed. &lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Understanding the distribution of data in the dataset-This helps to determine the statistical methods to be used in analysis.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In conclusion, EDA is very important as it helps us to get more information and identify patterns from the data and this helps us in making accurate choices in the statistical methods used in analysis.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>Data science:Wanna master the art of storytelling using data?</title>
      <dc:creator>kinuthia john</dc:creator>
      <pubDate>Sun, 04 Aug 2024 16:52:58 +0000</pubDate>
      <link>https://dev.to/kinuthia_john_/data-sciencewanna-master-the-art-of-storytelling-using-data-3171</link>
      <guid>https://dev.to/kinuthia_john_/data-sciencewanna-master-the-art-of-storytelling-using-data-3171</guid>
      <description>&lt;h2&gt;
  
  
  I know we all have come across the term data science, but have you ever thought about what it is and its impacts on the tech world?
&lt;/h2&gt;




&lt;p&gt;Well, today we'll dive deeper into what the field of data science entails&lt;/p&gt;

&lt;h3&gt;
  
  
  What is data science?
&lt;/h3&gt;

&lt;p&gt;Data science is a tech-related field that uses various scientific methods to identify a problem in society, analyze it, collect meaningful data concerning the domain, clean the data, and structure it so that we can get some meaningful insights and knowledge to solve the specific problem.&lt;br&gt;
&lt;strong&gt;Data analyst&lt;/strong&gt;&lt;br&gt;
This person specializes in the analysis of complex datasets from various sources and can extract information that can be used to make decisions in an organization.&lt;/p&gt;

&lt;h3&gt;
  
  
  What it takes to be a data scientist
&lt;/h3&gt;




&lt;p&gt;The following are among the few key skill requirements for having a successful data science career:&lt;/p&gt;

&lt;h4&gt;
  
  
  Basic computer science knowledge
&lt;/h4&gt;

&lt;p&gt;To be a data scientist, you need to have the fundamental knowledge of computer science which helps you to understand various concepts such as core maths, computer basics, database management and artificial intelligence.&lt;/p&gt;

&lt;h4&gt;
  
  
  programming
&lt;/h4&gt;

&lt;p&gt;One also needs to understand programming languages such as Python, R and SQL which are mostly used by data scientists in their day-to-day life. The knowledge is practically applied in data manipulation and performing complex mathematical functions by use of simple libraries such as numpy in python.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data collection and preparation
&lt;/h4&gt;

&lt;p&gt;A data scientist must have solid skills in collecting and cleaning the collected data for easier analysis.This skill is key because clean data is easier to work with and also make accurate decisions from the data.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data examining
&lt;/h4&gt;

&lt;p&gt;The collected data should be examined to understand its flow and the overall characteristics which helps the data scientist to understand his data well.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data analysis
&lt;/h4&gt;

&lt;p&gt;In order to get insights from data,some data analysis skills must be applied by the scientist so as deep knowledge of statistics and mathematics is required.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data modeling
&lt;/h4&gt;

&lt;p&gt;Data modeling is the process of fitting and developing machine learning models that can predict possible outcomes and future patterns from the data. This requires one to have skills in the usage of various machine-learning models based on the type of outcome they need and the type of data present.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data visualization
&lt;/h4&gt;

&lt;p&gt;Visualization skills are key as they help us create visual presentations of findings and insights gained from the data. The tools used include Tableau and powerbi.&lt;/p&gt;

&lt;p&gt;These are just but a few skills required to get you started in the field. Remember that the field of data is very wide and one requires to practice and learn daily so as to get better and always remember that &lt;strong&gt;Rome was never built in one day&lt;/strong&gt;.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>Data science:Wanna master the art of storytelling using data?</title>
      <dc:creator>kinuthia john</dc:creator>
      <pubDate>Sun, 04 Aug 2024 16:52:56 +0000</pubDate>
      <link>https://dev.to/kinuthia_john_/data-sciencewanna-master-the-art-of-storytelling-using-data-56od</link>
      <guid>https://dev.to/kinuthia_john_/data-sciencewanna-master-the-art-of-storytelling-using-data-56od</guid>
      <description>&lt;h2&gt;
  
  
  I know we all have come across the term data science, but have you ever thought about what it is and its impacts on the tech world?
&lt;/h2&gt;




&lt;p&gt;Well, today we'll dive deeper into what the field of data science entails&lt;/p&gt;

&lt;h3&gt;
  
  
  What is data science?
&lt;/h3&gt;

&lt;p&gt;Data science is a tech-related field that uses various scientific methods to identify a problem in society, analyze it, collect meaningful data concerning the domain, clean the data, and structure it so that we can get some meaningful insights and knowledge to solve the specific problem.&lt;br&gt;
&lt;strong&gt;Data analyst&lt;/strong&gt;&lt;br&gt;
This person specializes in the analysis of complex datasets from various sources and can extract information that can be used to make decisions in an organization.&lt;/p&gt;

&lt;h3&gt;
  
  
  What it takes to be a data scientist
&lt;/h3&gt;




&lt;p&gt;The following are among the few key skill requirements for having a successful data science career:&lt;/p&gt;

&lt;h4&gt;
  
  
  Basic computer science knowledge
&lt;/h4&gt;

&lt;p&gt;To be a data scientist, you need to have the fundamental knowledge of computer science which helps you to understand various concepts such as core maths, computer basics, database management and artificial intelligence.&lt;/p&gt;

&lt;h4&gt;
  
  
  programming
&lt;/h4&gt;

&lt;p&gt;One also needs to understand programming languages such as Python, R and SQL which are mostly used by data scientists in their day-to-day life. The knowledge is practically applied in data manipulation and performing complex mathematical functions by use of simple libraries such as numpy in python.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data collection and preparation
&lt;/h4&gt;

&lt;p&gt;A data scientist must have solid skills in collecting and cleaning the collected data for easier analysis.This skill is key because clean data is easier to work with and also make accurate decisions from the data.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data examining
&lt;/h4&gt;

&lt;p&gt;The collected data should be examined to understand its flow and the overall characteristics which helps the data scientist to understand his data well.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data analysis
&lt;/h4&gt;

&lt;p&gt;In order to get insights from data,some data analysis skills must be applied by the scientist so as deep knowledge of statistics and mathematics is required.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data modeling
&lt;/h4&gt;

&lt;p&gt;Data modeling is the process of fitting and developing machine learning models that can predict possible outcomes and future patterns from the data. This requires one to have skills in the usage of various machine-learning models based on the type of outcome they need and the type of data present.&lt;/p&gt;

&lt;h4&gt;
  
  
  Data visualization
&lt;/h4&gt;

&lt;p&gt;Visualization skills are key as they help us create visual presentations of findings and insights gained from the data. The tools used include Tableau and powerbi.&lt;/p&gt;

&lt;p&gt;These are just but a few skills required to get you started in the field. Remember that the field of data is very wide and one requires to practice and learn daily so as to get better and always remember that &lt;strong&gt;Rome was never built in one day&lt;/strong&gt;.&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
