<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Lacey Butler</title>
    <description>The latest articles on DEV Community by Lacey Butler (@frecklefacelace).</description>
    <link>https://dev.to/frecklefacelace</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F345990%2F7e7a4979-f73e-49e3-a13d-9734a5f8780c.jpg</url>
      <title>DEV Community: Lacey Butler</title>
      <link>https://dev.to/frecklefacelace</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/frecklefacelace"/>
    <language>en</language>
    <item>
      <title>How to Get Started with Time-Series Data Analysis: Resources Roundup</title>
      <dc:creator>Lacey Butler</dc:creator>
      <pubDate>Thu, 09 Apr 2020 22:40:57 +0000</pubDate>
      <link>https://dev.to/tigerdata/how-to-get-started-with-time-series-data-analysis-resources-roundup-23o6</link>
      <guid>https://dev.to/tigerdata/how-to-get-started-with-time-series-data-analysis-resources-roundup-23o6</guid>
      <description>&lt;p&gt;Every data point we collect has a timestamp, whether it’s when a customer did something on your website, your infrastructure’s CPU utilization, or when a bus last received maintenance. &lt;/p&gt;

&lt;p&gt;At Team Timescale, we believe that &lt;strong&gt;all&lt;/strong&gt; data is time-series data, and, thus, we’ve built &lt;a href="https://www.timescale.com/products/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=products" rel="noopener noreferrer"&gt;TimescaleDB&lt;/a&gt;: a time-series database that gives you the tools and services you need to store long-term metrics and gain insight for cost-management, capacity planning, root-cause analysis, and more. &lt;/p&gt;

&lt;p&gt;...and, to help you get started with time-series data analysis, &lt;strong&gt;we’ve rounded up a few of our most popular tutorials, recommended resources, and datasets&lt;/strong&gt;. While this list isn’t exhaustive, it’s comprehensive enough to get you up and running, without leaving you saying “which one should I start with?” (disclaimer: I’m fascinated by psychology and sociology, including  &lt;a href="https://en.wikipedia.org/wiki/The_Paradox_of_Choice" rel="noopener noreferrer"&gt;the paradox of choice theory&lt;/a&gt;).&lt;/p&gt;

&lt;h2&gt;
  
  
  Tutorials &amp;amp; How-Tos
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;(1) &lt;a href="https://docs.timescale.com/latest/tutorials/tutorial-hello-timescale/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=hello-timescale" rel="noopener noreferrer"&gt;Analyzing Historical NYC Taxicab Data&lt;/a&gt; and &lt;a href="https://docs.timescale.com/latest/tutorials/tutorial-forecasting/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=forecasting-tutorial" rel="noopener noreferrer"&gt;making predictions&lt;/a&gt;:&lt;/strong&gt; this set of tutorials uses NYC taxicab data to walk you through, step-by-step, how to load data in your database and use SQL to understand past trends (like the number of rides to specific airports, ride duration, and rate). From there, you’ll ask questions and plan for the future (e.g., should NYC plan to have more taxis on the streets before or after 12am on New Year’s Eve?).&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  Need to brush up on SQL? &lt;a href="https://mode.com/sql-tutorial/" rel="noopener noreferrer"&gt;Check out Mode Analytics’ SQL tutorial&lt;/a&gt; for beginner, intermediate, and advanced lessons that you can mix and match.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;(2) Visualizing Data in &lt;a href="https://docs.timescale.com/latest/tutorials/tutorial-grafana/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=grafana-tutorial" rel="noopener noreferrer"&gt;Grafana&lt;/a&gt; and &lt;a href="https://docs.timescale.com/latest/tutorials/visualizing-time-series-data-in-tableau/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=tableau-tutorial" rel="noopener noreferrer"&gt;Tableau&lt;/a&gt;:&lt;/strong&gt; if you’re collecting data and want to analyze it, odds are you also need visualizations: the custom dashboards, reports, graphs, and charts that make it easier to see trends and current status at a glance. Grafana is an open-source visualization tool that’s well-suited to time-series data (and Team Timescale uses it ourselves!), while Tableau is a proprietary solution that’s popular with many Timescale customers and global organizations.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  These tutorials show you, in detail, how to turn your data into powerful visualizations that help you better understand whatever you’re collecting. You’ll see how to connect each tool to your database - TimescaleDB in this case - run queries, and build the visualizations you’re after. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;(3) &lt;a href="http://docs.timescale.com/tutorials/tutorial-setup-timescale-prometheus/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=analyze-prom-tutorial" rel="noopener noreferrer"&gt;Use TimescaleDB to Store Prometheus Metrics Data &amp;amp; Visualize Results in Grafana&lt;/a&gt; (2 parts!):&lt;/strong&gt; &lt;a href="https://prometheus.io/" rel="noopener noreferrer"&gt;Prometheus&lt;/a&gt; is arguably the most popular open-source systems monitoring solution, and, as mentioned above, Grafana is a much-loved open-source visualization engine. If you’re looking for a lower-cost or more flexible way to store your metrics, correlate them to past trends, and plot results in team dashboards, this tutorial has you covered.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  In this first part, you’ll learn how to set up TimescaleDB, Prometheus, and Grafana. &lt;/li&gt;
&lt;li&gt;  In the second part, you’ll learn how to &lt;span&gt;&lt;span&gt;use your metrics data to answer questions&lt;/span&gt; about infrastructure performance.&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;  Includes a sample dataset, so you can run through without setting up a full monitoring harness 🎉.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;(4) &lt;a href="https://blog.timescale.com/blog/public-dataset-tips-tricks/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=public-data-tips-blog" rel="noopener noreferrer"&gt;How to Clean (Normalize) Public Datasets&lt;/a&gt;:&lt;/strong&gt; Public datasets can help us gain insight into our businesses, our users, and our world. If you’re new to time-series data analysis, open datasets give you a pretty rad training ground: datasets range from municipal and government data and weather almanacs to tweet sentiment during the Super Bowl or new video game popularity. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  But, combining public datasets or joining them with your &lt;em&gt;own&lt;/em&gt; data often requires a series of steps to clean up (or “normalize”) the data. &lt;/li&gt;
&lt;li&gt;  Fear not: this short how-to walks through 2 ways to normalize datasets so you can get to the best part: analyzing and using data to better understand your world.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Videos
&lt;/h2&gt;

&lt;p&gt;The below examples use IoT and monitoring scenarios to demonstrate what time-series data analysis and visualization make possible – but there are &lt;strong&gt;many&lt;/strong&gt; other applicable use cases, from geospatial analysis to using historical data to predict sales or market trends.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;(1) &lt;a href="https://blog.timescale.com/blog/time-series-for-developers-what-the-heck-is-time-series-recap-and-resources/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=time-series-for-devs-recap-blog" rel="noopener noreferrer"&gt;Time-Series for Developers coding session recap&lt;/a&gt;&lt;/strong&gt; (&lt;a href="https://youtu.be/Tlbep7vanMg" rel="noopener noreferrer"&gt;YouTube recording&lt;/a&gt;): For a primer on just what time-series data is and what makes time-series databases different, this recap blog and session recording breaks down the fundamentals, but keeps it light on theory. You’ll learn (a) what time-series data is in practical terms and (b) the types of questions you can ask and answer with it. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  To demo just how powerful time-series analysis can be, Avathar (my fantastic colleague) spends 15-20 mins walking through a mock scenario, where we’re “tasked” with analyzing NYC taxicab data to find ways to cut carbon emissions, suggest routes to travelers, and more.&lt;/li&gt;
&lt;li&gt;  You can follow along with the tutorial and get the dataset &lt;a href="https://docs.timescale.com/latest/tutorials/tutorial-hello-timescale/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=hello-timescale-tutorial" rel="noopener noreferrer"&gt;here&lt;/a&gt; - it’s our “Hello, World!”-esqe example. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;(2) &lt;a href="https://youtu.be/vohCnThNsXw" rel="noopener noreferrer"&gt;How to Analyze Your Prometheus Data in SQL 3 Queries You Need to Know&lt;/a&gt;&lt;/strong&gt; &amp;amp; &lt;strong&gt;&lt;a href="https://youtu.be/wm9T7lWCgpE" rel="noopener noreferrer"&gt;How (and why) to use TimescaleDB as a long-term store for your Prometheus metrics&lt;/a&gt;&lt;/strong&gt;: Our resident webinar and demo pro Avthar shows you how to set up a long-term store for Prometheus metrics (and why you should), write custom queries, and spin up Grafana dashboards.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  You'll set up aggregates to rollup hourly and daily summaries of your metrics, create automated downsampling rules to keep metrics around - without wasting disk space - and see 3 common monitoring queries that you can use and customize right away.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;(3) &lt;a href="https://youtu.be/fY1DgS9e0HM" rel="noopener noreferrer"&gt;IoT Monitoring and Time Series Analysis&lt;/a&gt;&lt;/strong&gt;(YouTube): We demo'd this at AWS re:Invent 2019 to show how to connect NYC’s &lt;a href="https://new.mta.info/developers" rel="noopener noreferrer"&gt;Metropolitan Transportation Authority&lt;/a&gt; data to your database (TimescaleDB in this instance), and use special SQL functions - like time bucketing and gap filling - to monitor all buses running in NYC and display real-time results Grafana. You’ll see how to track when buses go off route, deal with missing data points, and beyond.&lt;/p&gt;

&lt;h2&gt;
  
  
  Datasets
&lt;/h2&gt;

&lt;p&gt;The above examples include a few datasets that you can use to practice your skills, but there are &lt;em&gt;thousands&lt;/em&gt; of options available. A few places to start:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;a href="https://datasf.org/opendata/" rel="noopener noreferrer"&gt;San Francisco Open Data&lt;/a&gt; includes comprehensive time-series datasets for health and safety, city services, and much more. Similar projects exist for cities all over the world, including &lt;a href="https://opendata.cityofnewyork.us/" rel="noopener noreferrer"&gt;New York&lt;/a&gt;, &lt;a href="https://data.london.gov.uk/" rel="noopener noreferrer"&gt;London&lt;/a&gt;, &lt;a href="https://data.lacity.org/" rel="noopener noreferrer"&gt;Los Angeles&lt;/a&gt;, &lt;a href="https://data.gov.in/faqs" rel="noopener noreferrer"&gt;various Government of India entities&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;  Beyond cities and municipalities, you can find and access data about &lt;a href="https://www.ncdc.noaa.gov/cdo-web/" rel="noopener noreferrer"&gt;weather&lt;/a&gt;, &lt;a href="https://www.cryptocompare.com/" rel="noopener noreferrer"&gt;cryptocurrency&lt;/a&gt;, and the &lt;a href="https://www.census.gov/data/datasets.html" rel="noopener noreferrer"&gt;United States Census Bureau&lt;/a&gt;. &lt;/li&gt;
&lt;li&gt;  &lt;a href="https://www.kaggle.com/" rel="noopener noreferrer"&gt;Kaggle&lt;/a&gt; offers almost 20K publicly available datasets, with search and sort functionality based on your interests and programming language of choice. You’ll find everything from public education to presidential debate Twitter sentiment and video game sales - as well as tutorials and competitions for all levels. &lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Wrapping up
&lt;/h2&gt;

&lt;p&gt;Time-series data allows us to better understand our world, whatever that means to you (be it your open source project community, your business operations, or the global population writ large). &lt;/p&gt;

&lt;p&gt;🤞I hope this list gives you a place to start or brush up on existing skills. If I've piqued your curiosity and you’d like to connect with like-minded folks, I invite you to &lt;strong&gt;&lt;a href="https://www.timescale.com/meetups/datapub/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=data-pub-invite" rel="noopener noreferrer"&gt;join us for DataPub - a virtual meetup for data enthusiasts&lt;/a&gt;&lt;/strong&gt;. Our next meeting will be on Tuesday, April 21st at 1pm PT/ 4pm ET/ 8pm GMT. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  You’ll hear from community members about their projects, ways developers are using open data to build amazing applications, and most, importantly, get a chance to ask questions and share your own. &lt;/li&gt;
&lt;li&gt;  Everyone’s welcome, so invite friends, family, colleagues, or your pets!&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Can’t make this month’s session?  You’ll receive a recording and follow-up even if you’re unable to attend (we know things come up). &lt;/p&gt;

&lt;p&gt;You can also &lt;strong&gt;&lt;a href="https://www.timescale.com/signup/newsletter/?utm_source=dev-time-series-roundup&amp;amp;utm_medium=blog&amp;amp;utm_campaign=april-2020-advocacy&amp;amp;utm_content=newsletter-subscribe" rel="noopener noreferrer"&gt;subscribe to our newsletter&lt;/a&gt;&lt;/strong&gt;to learn about next month’s date and guest speakers. &lt;/p&gt;

</description>
      <category>beginners</category>
      <category>sql</category>
      <category>database</category>
      <category>postgres</category>
    </item>
  </channel>
</rss>
