<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Santosh Tirunagari</title>
    <description>The latest articles on DEV Community by Santosh Tirunagari (@tsantosh7).</description>
    <link>https://dev.to/tsantosh7</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F177721%2F4a468a6f-069f-400f-a81a-164e062806d8.jpeg</url>
      <title>DEV Community: Santosh Tirunagari</title>
      <link>https://dev.to/tsantosh7</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/tsantosh7"/>
    <language>en</language>
    <item>
      <title>Must have Chrome extensions for Data Scientists</title>
      <dc:creator>Santosh Tirunagari</dc:creator>
      <pubDate>Tue, 09 Jul 2019 21:23:09 +0000</pubDate>
      <link>https://dev.to/tsantosh7/must-have-chrome-extensions-for-data-scientists-249d</link>
      <guid>https://dev.to/tsantosh7/must-have-chrome-extensions-for-data-scientists-249d</guid>
      <description>&lt;p&gt;Browser extensions are software programs that enable you to tailor/add more functionality to your web browser, without altering any of the native code. The success of Google Chrome (with ~ 62.7% browser market share) lies in its extensibility. You can have a plugin or extension for just about everything you may ever possibly want.&lt;/p&gt;

&lt;p&gt;Here are the must-have chrome extensions for data scientists to use in their daily workflows.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;a href="https://chrome.google.com/webstore/detail/web-scraper/jnhgnonknehpejjnehehllkliplmbmhn?hl=en"&gt;Web Scraper&lt;/a&gt;
&lt;/h2&gt;

&lt;p&gt;Data is what data scientist rely upon for building their models. State and the volume of data are often very crucial in determining the prediction accuracy. Lack of sufficient data (sparse) or learning from poor data source would result in overly simplistic predictions/solutions. Provided with the right tools to procure and analyse, data scientists can take leverage on petabytes of data available on the web. However, in reality, the data extraction from these sources often requires scripts to scrape. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--wFuLAONH--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/v2vc3i6079huuq7rdrd2.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--wFuLAONH--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/v2vc3i6079huuq7rdrd2.jpg" alt=""&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://chrome.google.com/webstore/detail/web-scraper/jnhgnonknehpejjnehehllkliplmbmhn?hl=en"&gt;Web Scraper&lt;/a&gt; is a tool which can extract text, numbers, content from a webpage. Their Chrome browser extension is built, especially for data extraction from web pages in minutes and for free. Using this extension, you can create a plan (sitemap) on how we can traverse a web site and what to extract. Using these sitemaps, the &lt;a href="https://chrome.google.com/webstore/detail/web-scraper/jnhgnonknehpejjnehehllkliplmbmhn?hl=en"&gt;Web Scraper&lt;/a&gt; then navigates the website accordingly and extract relevant data. Scraped data later can be exported as CSV. The core features of &lt;a href="https://chrome.google.com/webstore/detail/web-scraper/jnhgnonknehpejjnehehllkliplmbmhn?hl=en"&gt;Web Scraper&lt;/a&gt; that places this extension to the must-have list are the ability to scrape multiple pages, various data selection types, data extraction from dynamic pages (JavaScript+AJAX) and it's export capabilities.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;a href="https://chrome.google.com/webstore/detail/decs-code-snippets-manage/mkclebnkdjjpamialfgieminnkdepbcl?hl=en"&gt;DECS&lt;/a&gt;
&lt;/h2&gt;

&lt;p&gt;We all crawl the web in search of code. Data scientist often rely on scripts to perform and automate repetitive tasks. How many times have we painfully repeated steps to land on that same piece of code again? These scripts/code snippets typically used for/in pre-processing or machine learning (classification or regression or clustering) pipelines usually remain the same. So rather than remembering the complete details about a specific pipeline and re-writing the same code across several experiments/applications, it helps to reuse, refactor, repurpose and review the code snippets. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--ojS_HvNY--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/a31v4rj12eta3ng2k0cg.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--ojS_HvNY--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/a31v4rj12eta3ng2k0cg.png" alt=""&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://chrome.google.com/webstore/detail/decs-code-snippets-manage/mkclebnkdjjpamialfgieminnkdepbcl?hl=en"&gt;DECS&lt;/a&gt; is a decentralised and end-to-end encrypted tool for managing code snippets. There are a few code snippets managers available, but data end-to-end encryption and one-click code copy feature set it apart from the rest. A code snippet is intellectual property, and it's essential that modern tools give more importance to privacy and user data ownership ( given a choice I wouldn't prefer to store my code on someone else's cloud/server where they could look into the data).  With &lt;a href="https://chrome.google.com/webstore/detail/decs-code-snippets-manage/mkclebnkdjjpamialfgieminnkdepbcl?hl=en"&gt;DECS&lt;/a&gt; - Code snippets manager, you can save the code snippets fully encrypted (on their server or anywhere you would want to), and use it across several projects. &lt;/p&gt;

&lt;p&gt;Since the data is encrypted, you can store your configs, including the API keys, and is always just a search away, saving a lot of time, effort and money. To add cream on top, with &lt;a href="https://chrome.google.com/webstore/detail/decs-code-snippets-manage/mkclebnkdjjpamialfgieminnkdepbcl?hl=en"&gt;DECS&lt;/a&gt; browser extension, using just a single click, you can capture the snippets that catch your eye (anywhere on a webpage including Stack Overflow) and store them forever for future use.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;a href="https://chrome.google.com/webstore/detail/diigo-web-collector-captu/pnhplgjpclknigjpccbcnmicgcieojbh?hl=en"&gt;Diigo Web Collector&lt;/a&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://chrome.google.com/webstore/detail/diigo-web-collector-captu/pnhplgjpclknigjpccbcnmicgcieojbh?hl=en"&gt;Diigo&lt;/a&gt; is a useful extension for annotating, archiving and bookmarking webpages. With this easy-to-use tool, you can bookmark links to archive webpages or read later. Several new papers related to data science on Arxiv, Medium blogs, can be bookmarked as well as can be annotated with highlights &amp;amp; stickies for essential notes.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--4pivckt5--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/2f5my1go4ziz4xcz7xnm.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--4pivckt5--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/2f5my1go4ziz4xcz7xnm.png" alt=""&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The annotations can also be shared via social media for e.g., via Twitter and Linkedin.  Moreover, it can be accessed anywhere, via iPhone, iPad (&lt;a href="https://apps.apple.com/gb/app/diigo-your-learning-simplified/id933773981"&gt;Appstore&lt;/a&gt;), Android (&lt;a href="https://play.google.com/store/apps/details?id=com.diigo.android"&gt;Playstore&lt;/a&gt;). &lt;/p&gt;

&lt;p&gt;All in all, this chrome extension, can be used to create bookmarks, groups to pool findings, share resources and finally for curating content, especially for building your training data.&lt;/p&gt;

&lt;p&gt;These are the three must-have chrome extensions for data scientists to use in their daily workflows. What do you think of the list? Let me know if you have any other chrome extensions that are super useful for a data scientist.&lt;/p&gt;

</description>
      <category>chromeextensions</category>
      <category>motivation</category>
      <category>productivity</category>
      <category>datascientists</category>
    </item>
    <item>
      <title>5 Tools for Data Scientist: Getting started with programming</title>
      <dc:creator>Santosh Tirunagari</dc:creator>
      <pubDate>Sat, 08 Jun 2019 12:11:35 +0000</pubDate>
      <link>https://dev.to/tsantosh7/5-tools-for-data-scientist-getting-started-with-programming-1pjj</link>
      <guid>https://dev.to/tsantosh7/5-tools-for-data-scientist-getting-started-with-programming-1pjj</guid>
      <description>&lt;p&gt;Data Science is a multidisciplinary blend of data analysis, algorithm development, and technology to solve analytically complex data-driven problems. Data Scientists are a new class of analytical data experts who have the technical skills to solve these complex data problems and is undoubtedly the sexiest job of the 21st century. The typical tasks of a data scientist comprise of data collection, data pre-processing, and data analysis. &lt;/p&gt;

&lt;p&gt;Previously, there were not that many tools to assist data scientists, and they relied mostly upon writing their custom code. However, during the past five years, the landscape had changed so much that most of these tasks/routines have been automated using various tools, and there are dozens and dozens of them. However, data scientists often seek out new tools that help them solve and find answers to their complex data problems. For this very reason, many data scientists consider programming knowledge an integral part of data science. Not all data scientist can code, however, it is helpful to be aware of tools that can assist and organize programming. &lt;/p&gt;

&lt;p&gt;Here is an attempt to list the top 5 must-have tools for a data scientist, in no particular order. This list is more focused on data scientists who would like to get their hands dirty with code. &lt;/p&gt;

&lt;p&gt;The tools below would guide you from setting up, actually coding, version control, organising the code and finally exposing your predictive models to the real world.&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;a href="https://www.anaconda.com/distribution/"&gt;Anaconda&lt;/a&gt; / Anaconda Navigator (Windows)
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--ZGwCa1n---/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/x837pvzezeohqxjq355s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--ZGwCa1n---/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/x837pvzezeohqxjq355s.png" alt="Anaconda Navigator (Windows)"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Anaconda is a free and open-source distribution of the Python and R programming languages for scientific computing (data science, machine learning applications, large-scale data processing, predictive analytics, etc.), that aims to simplify package management and deployment.&lt;/p&gt;

&lt;p&gt;Previously, the academic world inclined heavily towards using Matlab. However, due to the advancements, the Python has witnessed recently, and thanks to the open-source community, the wave has now shifted towards the Python ecosystem. Anaconda Navigator is a desktop graphical user interface (GUI) included in Anaconda distribution that allows users to launch applications and manage Conda packages, environments and channels without using command-lines and is no doubt, a must-have programming tool to get started with Python programming in no time.&lt;/p&gt;

&lt;p&gt;Installation of Anaconda-Navigator comes with many packages such as: &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Jupyter Notebook&lt;/li&gt;
&lt;li&gt;Spyder&lt;/li&gt;
&lt;li&gt;Glueviz&lt;/li&gt;
&lt;li&gt;Orange&lt;/li&gt;
&lt;li&gt;Rstudio&lt;/li&gt;
&lt;li&gt;Visual Studio - Code&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Now that you have set up everything you need to get started with writing the code. This next tool would help you code and see the outcome in real time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pricing: FREE&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;a href="https://jupyter.org/"&gt;Jupyter Notebook&lt;/a&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--28PVZfvV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/6s3mzkmohxrs3t22ng0d.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--28PVZfvV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/6s3mzkmohxrs3t22ng0d.png" alt="Jupyter Notebook"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If you are just getting started with Python, Jupyter Notebooks are the best way to learn. They provide “interactiveness” (code as you go) as a web application in which you can create and share documents that contain live code, equations, visualizations as well as text. The Jupyter Notebook is one of the ideal tools to help you to gain the data science skills you need.&lt;/p&gt;

&lt;p&gt;Software/code development can never be complete. It's an ongoing process and evolves with the data. The next tool would help you version control and manage projects enabling collaboration.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pricing: FREE&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;a href="https://github.com/"&gt;Git&lt;/a&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--OdvHdKZd--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/9zq16iynqdsdcz97an97.PNG" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--OdvHdKZd--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/9zq16iynqdsdcz97an97.PNG" alt="GitHub"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Git is the most widely used version control system. Github is the world's leading software development platform that uses Git for version control. A version control system is something that records changes to a file or set of files over time so that you can recall specific versions later. Git is an essential tool as it helps you work with others, and it is something you find in many workplaces.&lt;/p&gt;

&lt;p&gt;Using Git, nothing is lost as one can always go back to see previous versions of their programs. It can handle conflicts while synchronizing work done by different people on different machines, so it scales as your team does. Knowing Git makes it easier to contribute to open source development of packages.&lt;/p&gt;

&lt;p&gt;Git and its subversion control systems are vital in keeping the record of entire projects; it is not devised to manage sheer code snippets. Often it's not a good practise committing sensitive information to a GIT repository.&lt;/p&gt;

&lt;p&gt;For this purpose, a code snippet manager can be used. One such unique snippet organizer is DECS (Decentralized Encrypted Code Snippets).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pricing: FREE / Paid Enterprise Version&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;a href="https://app.decs.xyz/"&gt;DECS&lt;/a&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--nYUX9Vnp--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/mok4fxtutsvm3dytmzxw.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--nYUX9Vnp--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/mok4fxtutsvm3dytmzxw.png" alt="DECS"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;DECS offers an all-in-one workspace to store securely and tightly control access to code snippets, sensitive information such as to proprietary snippets, tokens, configurations, certificates etc.&lt;/p&gt;

&lt;p&gt;Data analysis pipelines are usually the same, rather than remembering the complete routine about a specific pipeline and re-writing the same code across several experiments/applications, it’s always easier to remember what that pipeline/algorithm is instead.&lt;/p&gt;

&lt;p&gt;So using DECS, one can store the code snippets or routines fully encrypted and use them across several applications just by copy-pasting or downloading. These features help in organizing an infinite amount of valuable information without actually forgetting. Code capture on the go feature of DECS enables you to copy the code from anywhere on the web with just one click. As data stored on DECS is end-to-end encrypted, you can store sensitive data including the API keys, and always is just a search away, saving much time. It is also Decentralised and FREE.&lt;/p&gt;

&lt;p&gt;Next is the tool that helps you expose your work as an API to other external tools/users and helps you scale up.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pricing: FREE&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;a href="http://flask.pocoo.org/"&gt;Flask&lt;/a&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--6343RMKE--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/22lw5xn9o6w23pnpszzh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--6343RMKE--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/22lw5xn9o6w23pnpszzh.png" alt="Flask"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Flask is a handy tool for exposing the machine learning models as web calls and is useful for building microservices. Flask is fun and easy to set up, as it says on the Flask website. That’s true, as this microframework for Python offers a powerful way of annotating Python function with a REST endpoint. So basically using Flask, the machine learning models can be published as an API to be accessible by users/customers/clients or any other 3rd party business applications. This way, one could even commercialize their machine learning models as a web service.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pricing: FREE&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;These are the tools that can help you get started, up and rolling. What do you think of the list? Let me know if you have any other tools that are super useful for a data scientist.&lt;/p&gt;

</description>
      <category>tools</category>
      <category>datascientist</category>
      <category>programming</category>
      <category>python</category>
    </item>
  </channel>
</rss>
