<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Auralia Malik</title>
    <description>The latest articles on DEV Community by Auralia Malik (@taurus).</description>
    <link>https://dev.to/taurus</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F975690%2F56f6e177-8852-4e06-8420-3bdf02e855b7.jpeg</url>
      <title>DEV Community: Auralia Malik</title>
      <link>https://dev.to/taurus</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/taurus"/>
    <language>en</language>
    <item>
      <title>Data Engineering for Beginners: A Step-by-Step Guide</title>
      <dc:creator>Auralia Malik</dc:creator>
      <pubDate>Wed, 08 Nov 2023 19:03:46 +0000</pubDate>
      <link>https://dev.to/taurus/data-engineering-for-beginners-a-step-by-step-guide-187</link>
      <guid>https://dev.to/taurus/data-engineering-for-beginners-a-step-by-step-guide-187</guid>
      <description>&lt;p&gt;Before delving into the role and responsibilities of a data engineer, it's beneficial to first explore the various career paths within the data science field. Do you have a clear vision of your desired data science career, or are you still contemplating your specialization? What drives your passion in the data science domain, and what aspects pique your curiosity? If you find these questions somewhat perplexing, don't fret; we will methodically address each of these inquiries. It's common to feel bewildered when you discover the multitude of career options within this expansive field. Does this resonate with you? Let's commence our exploration.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Data Scientist:&lt;/strong&gt; Data scientists are responsible for collecting, cleaning, and analyzing large datasets to extract valuable insights and make data-driven decisions. They use various machine learning and statistical techniques to build predictive models and solve complex problems.Data scientists often work closely with business stakeholders to identify opportunities for leveraging data to drive business growth.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Data Analyst:&lt;/strong&gt; Data analysts focus on examining data to provide actionable insights to their organizations. They perform data cleaning, data visualization, and basic statistical analysis to help businesses understand trends, patterns, and make informed decisions.&lt;br&gt;
Data analysts may work in various industries such as finance, marketing, or healthcare.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Data Engineer:&lt;/strong&gt; Data engineers are responsible for the design, construction, and maintenance of data pipelines and infrastructure. They ensure that data is collected, stored, and made accessible for analysis by data scientists and analysts.&lt;br&gt;
Data engineers work with tools like Hadoop, Spark, and databases to manage and process large volumes of data efficiently.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Data Architect:&lt;/strong&gt; Data architects design the overall structure and organization of data within an organization. They create data models, define data standards, and ensure data is stored, integrated, and accessed effectively.&lt;br&gt;
Data architects play a critical role in establishing data governance and ensuring data quality.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Machine Learning Engineers&lt;/strong&gt;: Machine Learning Engineers are responsible for designing, building, and deploying machine learning models and systems. Their primary focus is on developing algorithms and systems that can learn from and make predictions or decisions based on data. &lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The field of data science is continually evolving, so there are always new opportunities and roles emerging as technology advances and businesses become more data-driven. It's important to choose a path that aligns with your interests and career goals.&lt;/p&gt;

&lt;p&gt;Let's explore the skills and qualifications required for a career as a data engineer, focusing on their role in ensuring the collection, storage, and accessibility of data for analysis by data scientists and analysts&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;ETL (Extract, Transform, Load)&lt;/strong&gt;: Expertise in ETL processes to extract data from various sources, transform it into the desired format, and load it into data storage or data warehouses.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Data Modeling&lt;/strong&gt;: Ability to create efficient and scalable data models for databases and data warehouses.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Knowledge of distributed systems&lt;/strong&gt; like Hadoop and Spark as well as cloud computing platforms such as Azure and AWS&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Data analysis&lt;/strong&gt;: Understanding of analytics software, specifically Apache Hadoop-based solutions like MapReduce, Hive, Pig and HBase. A primary focus for engineers is to build systems that gather information for use by other analysts or scientists. Having strong analysis skills yourself can help you create such systems and improve them.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;You’ll also need to familiarize yourself with various data-related programs and languages. Here are some known ones:&lt;/p&gt;

&lt;p&gt;Python&lt;br&gt;
Apache Hadoop and Apache Spark&lt;br&gt;
SQL&lt;br&gt;
Amazon Web Services&lt;br&gt;
Azure&lt;/p&gt;

&lt;p&gt;Remember that learning at your own pace and gaining experience over time is totally normal as you make your way through data engineering. You're well on your way to a meaningful and gratifying career in data engineering if you have determination, perseverance, and the willingness to adapt to the changing landscape. Data engineering, with its ever-expanding potential and problems, awaits your exploration and contribution.&lt;/p&gt;

&lt;p&gt;Leave a comment, I'd love to hear from you guys.&lt;/p&gt;

</description>
      <category>dataengineering</category>
      <category>datascience</category>
      <category>roadmap</category>
      <category>roadmaptodataengineering</category>
    </item>
    <item>
      <title>The Complete Guide to Time Series Models</title>
      <dc:creator>Auralia Malik</dc:creator>
      <pubDate>Mon, 23 Oct 2023 10:19:36 +0000</pubDate>
      <link>https://dev.to/taurus/the-complete-guide-to-time-series-models-3h38</link>
      <guid>https://dev.to/taurus/the-complete-guide-to-time-series-models-3h38</guid>
      <description>&lt;p&gt;As we journey through the intricate landscape of time series modeling, it becomes apparent that our quest for understanding is evolving gradually. Let us embark on a comprehensive exploration of time series modeling, and I eagerly await your feedback.&lt;br&gt;
&lt;strong&gt;Defining Time Series&lt;/strong&gt;: A time series is a collection of data points meticulously recorded over time. It's a window into the past that offers us the opportunity to scrutinize the ever-shifting influences on specific variables across distinct time intervals.&lt;br&gt;
&lt;strong&gt;Use &amp;amp; Importance of Time series analysis&lt;/strong&gt; : Helps organizations understand the underlying causes of trends or systemic patterns over time. Using data visualizations, business users can see seasonal trends and dig deeper into why these trends occur. With modern analytics platforms, these visualizations can go far beyond line graphs.&lt;br&gt;
&lt;strong&gt;Type of Time- Series models&lt;/strong&gt;: There are various types of time-series forecasting models that help in predicting future values. &lt;/p&gt;

&lt;p&gt;Time series models exhibit distinct characteristics: &lt;br&gt;
&lt;strong&gt;Trend - T(t):&lt;/strong&gt; a long-term upward or downward change in the average value.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Seasonality - S(t):&lt;/strong&gt; a periodic change to the value that follows an identifiable pattern.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Residual  R(t):&lt;/strong&gt; random fluctuations in the time series data that does not follow any patterns. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Types of time-series models are:&lt;/strong&gt; &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;ARMA(Autoregressive + Moving Average)&lt;/strong&gt;: it is a combination of two parts - the model with p autoregressive terms and q moving-average terms. This model contains the AR(p) and MA(q) models.The combination of said models (ARMA) are linear models that work off of an assumption of a stationary input. Under this assumption, they can be used to predict a future occurrence based on previous observations if suitably defined.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--78QFVgYo--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/hc32oo058esnglk6kkjd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--78QFVgYo--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/hc32oo058esnglk6kkjd.png" alt="Image description" width="800" height="416"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;ARIMA(Autoregressive Integrated Moving Average)&lt;/strong&gt; : It extends from ARMA model and incorporates the integrated component (inverse of differencing).Its used as a forecasting tool to predict how something will act in the future based on past performance. It is used in technical analysis to predict an asset's future performance. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--CNgD9KrN--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/8ih91wl8v4vhm42wtsxp.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--CNgD9KrN--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/8ih91wl8v4vhm42wtsxp.jpg" alt="Image description" width="800" height="426"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;SARIMA(Seasonal Auto-Regressive Integrated Moving Average)&lt;/strong&gt;: An extension of the ARIMA which addresses the periodic pattern observed in the time series. They are specifically designed to handle data with seasonal patterns.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--dGbOOGah--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/esvp8ornqw5vm3mx1ok8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--dGbOOGah--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/esvp8ornqw5vm3mx1ok8.png" alt="Image description" width="800" height="396"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In conclusion, time series modeling equips us with the knowledge to unearth insights, make informed decisions, and forecast future trends. Whether you're a data scientist, a business analyst, or simply someone fascinated by the compelling world of data, this can help you with understanding what time series models are.&lt;/p&gt;

&lt;p&gt;Kindly share your thoughts and insights.&lt;/p&gt;

</description>
      <category>timeseriesm</category>
      <category>machinelearning</category>
      <category>datascience</category>
      <category>dataengineering</category>
    </item>
    <item>
      <title>Exploratory Data Analysis using Data Visualization Techniques.</title>
      <dc:creator>Auralia Malik</dc:creator>
      <pubDate>Fri, 13 Oct 2023 15:32:15 +0000</pubDate>
      <link>https://dev.to/taurus/exploratory-data-analysis-using-data-visualization-techniques-100j</link>
      <guid>https://dev.to/taurus/exploratory-data-analysis-using-data-visualization-techniques-100j</guid>
      <description>&lt;p&gt;Something that's so exciting when it comes to data science is making your hands dirty!, not literally but you get the drill, right? I hope so. Exploratory Data Analysis sounds like such a big scary statement but its not, it literally means exploring data, analysing the dataset that you have. But you see, inasmuch as it sounds so easy, it doesn't mean you skip it. Because if you do, you'll be shown "shege" in short, you will cry till the end. &lt;/p&gt;

&lt;p&gt;As a data scientist/ analyst/ engineer, this is the part that brings us all together, because we need to pass here to go to any next step. EDA in short, includes data preprocessing and data visualization basically. For you to get to visualization, you need to do the preprocessing first. Don't be scared, it's actually very fun. &lt;br&gt;
Now let's dive into it, Shall we?&lt;br&gt;
EDA's importance to a datascientist is: It helps in understanding the data being used better and helps in identifying any outliers.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Bar Plots&lt;/strong&gt;: The most used and common plots that are used when visualizing data. I know you've probably come across barplots in Microsoft Excel. They are also applicable here. However their main purpose is to identify categorical values within a dataset. Categorical values are commonly to describe attributes in a dataset.&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--_2edB0mQ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/xou99pnac88qxmisrdpc.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--_2edB0mQ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/xou99pnac88qxmisrdpc.jpg" alt="Image description" width="800" height="411"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Histograms&lt;/strong&gt;: These are also commonly used in data visualization and they usually help in identifying numerical values in a dataset.&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--XJeyVOeF--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/0gvvb7dls4t2yanvupix.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--XJeyVOeF--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/0gvvb7dls4t2yanvupix.png" alt="Image description" width="264" height="280"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Box Plots&lt;/strong&gt;: These are the very important in data visualization as they help in identifying outliers in a dataset. For an easier way to look for anomalies in your dataset, always count on boxplots to come through for you. &lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--7uCbx3bL--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/3kqw759863x18cfrfb6q.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--7uCbx3bL--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/3kqw759863x18cfrfb6q.png" alt="Image description" width="392" height="252"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Scatter plots&lt;/strong&gt; : As the name suggests, they have a scattered pattern on them and they are commonly used to identify correlations within the dataset. They also help in identifying outliers. Count on them to do a good job.&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--DHZvNCI5--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/hc3v2dhwyt72lha34nwk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--DHZvNCI5--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/hc3v2dhwyt72lha34nwk.png" alt="Image description" width="498" height="293"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Heatmaps&lt;/strong&gt; : They don't necessarily have heat in them as the name suggests, but the pattern kinda reveal heat, you gerrit. If not, here is an example of a dataset I'm working on at the moment &lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--RuX6ZnlG--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/vf0ooskequ60lh8blj60.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--RuX6ZnlG--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/vf0ooskequ60lh8blj60.png" alt="Image description" width="800" height="513"&gt;&lt;/a&gt; You now, get it?&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Word Clouds&lt;/strong&gt;: As from Google's definition, an image composed of words used in a particular text or subject, in which the size of each word indicates its frequency or importance.&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--sNQ-eVVI--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ozlzjxoxfy1zlvt7lipg.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--sNQ-eVVI--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ozlzjxoxfy1zlvt7lipg.jpg" alt="Image description" width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In my opinion these are the types of plots you'll encounter or use when visualizing data. The others include line plots, pie charts and pair plots, among others. You'll definitely find a use for each of them as you navigate the field of data science. My intention here is to provide some guidance on the ways to visualize data as you progress in this space.&lt;/p&gt;

&lt;p&gt;Remember, exploratory data analysis (EDA) is not about visualizing data; it's about telling a story bringing data to life and gaining insights from your analysis. Keep that in mind. Anyway that's all from me for now. I hope you found my article on EDA techniques interesting. If you did feel free to leave a like or comment as feedback.. If theres anything I may have missed or if you have any suggestions, for improvement please let me know! It helps me grow and learn. Alright then Bye!&lt;/p&gt;

</description>
      <category>eventdriven</category>
      <category>analysis</category>
      <category>python</category>
      <category>data</category>
    </item>
    <item>
      <title>Data Science for Beginners: 2023 - 2024 Complete Roadmap.</title>
      <dc:creator>Auralia Malik</dc:creator>
      <pubDate>Sat, 30 Sep 2023 19:49:50 +0000</pubDate>
      <link>https://dev.to/taurus/data-science-for-beginners-2023-2024-complete-roadmap-3537</link>
      <guid>https://dev.to/taurus/data-science-for-beginners-2023-2024-complete-roadmap-3537</guid>
      <description>&lt;p&gt;Hello there, fellow data enthusiast! I'm thrilled that your curiosity and interest in data science led you to my article. Now, let me ask you a question: Why data science? For me, it was a winding journey through the tech world until I discovered my passion for handling data. Little did I know that data science would become my calling. It's an exhilarating field, but it does require a healthy dose of curiosity and a gentle nudge to truly excel.&lt;/p&gt;

&lt;p&gt;But enough with the chitchat; let's dive right in!&lt;/p&gt;

&lt;p&gt;Is there a roadmap to navigate the vast landscape of data science? You bet there is!&lt;/p&gt;

&lt;p&gt;Just as in life, every journey has its path, and data science is no exception. Before embarking on this exciting journey, there are a few crucial things you need to know. Neglect them, and you might find your interest waning faster than you'd expect.&lt;/p&gt;

&lt;p&gt;Imagine yourself on an exciting road trip with data as your reliable co-pilot in the field of data science. Hold on tight as we explore some crucial stops on the path to data-driven triumph.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Learning Python/ R Programming Languages&lt;/li&gt;
&lt;li&gt;Getting Around in Statistics&lt;/li&gt;
&lt;li&gt;SQL Querying Techniques&lt;/li&gt;
&lt;li&gt;Data Visualization 
5.Uncovering the Mysteries of Machine Learning
6.Cloud computing &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;There are a few things to consider before going on this quest. Develop a strong sense of curiosity and a strong drive to learn.&lt;br&gt;
As for the rest, consider it part of the thrilling adventure, waiting to be discovered as you navigate this exciting road.&lt;/p&gt;

</description>
      <category>python</category>
      <category>datascience</category>
      <category>roadmap</category>
    </item>
  </channel>
</rss>
