<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Chamindu Perera</title>
    <description>The latest articles on DEV Community by Chamindu Perera (@chaminduperera).</description>
    <link>https://dev.to/chaminduperera</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1152113%2F452fb671-675b-4c71-8716-f0484c4a5ebc.png</url>
      <title>DEV Community: Chamindu Perera</title>
      <link>https://dev.to/chaminduperera</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/chaminduperera"/>
    <language>en</language>
    <item>
      <title>KNIME Analytics Platform for Data Science-1</title>
      <dc:creator>Chamindu Perera</dc:creator>
      <pubDate>Sat, 02 Sep 2023 19:44:08 +0000</pubDate>
      <link>https://dev.to/chaminduperera/knime-analytics-platform-for-data-science-1-2ie7</link>
      <guid>https://dev.to/chaminduperera/knime-analytics-platform-for-data-science-1-2ie7</guid>
      <description>&lt;h2&gt;
  
  
  KNIME
&lt;/h2&gt;

&lt;p&gt;KNIME Analytics is a Java-based data science tool. It is simpler to construct data mining applications with this software since it supports visual programming in the form of a workflow with numerous nodes, which &lt;strong&gt;eliminates the need for highly developed coding abilities, but also allows if you need it&lt;/strong&gt;. It has a huge and varied plugin center and is commonly used in academic settings. It is an extensible data science platform that supports user-created scripts and codes in addition to visual programming. KNIME is a cross-platform software package that may be set up on several operating systems. Early versions of KNIME works with JAVA 8 but Updated KNIME requires &lt;strong&gt;JAVA 11&lt;/strong&gt; and higher versions. So, you can install one of the &lt;code&gt;openjdk-11&lt;/code&gt; or &lt;code&gt;openjdk-latest&lt;/code&gt; packages before install the KNIME platform. After that You can start surfing with start downloading KNIME by this &lt;a href="https://www.knime.com/downloads"&gt;Link:)&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;KNIME Workbench&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The main tab in the KNIME interface is the Workbench. This Workbench is the place where you will be building your workflows. It is also where you'll find all the resources you need to help you build your workflows. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--IoDOXhje--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/7xihyw8bizedpc8atlin.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--IoDOXhje--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/7xihyw8bizedpc8atlin.png" alt="Image description" width="800" height="455"&gt;&lt;/a&gt;&lt;br&gt;
Other Tabs are,&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;KNIME Explorer&lt;br&gt;
Explorer is what you use to manage your workflows, workflow groups, and server connections and also you can access example workflows that have been built to demonstrate different functionalities in KNIME.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Workflow Coach&lt;br&gt;
This tab will recommend you the node that you can use as the next step after where you are now. These node recommendations are based on KNIME community usage statistics. You can add nodes from the workflow coach to the editor by dragging and dropping them, or by double clicking.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Node repository&lt;br&gt;
Currently installed nodes are available from the node repository. You build your workflow by dragging or double clicking nodes from the node repository to the workflow editor as in the workflow coach. There are two types of searches in the repository as crisp search and fuzzy search. The &lt;strong&gt;crisp search&lt;/strong&gt; is the default mode and returns all nodes that either have the search term in the name or that are in a subcategory whose name includes the search term. The &lt;strong&gt;fuzzy search&lt;/strong&gt; returns all nodes that are related to the search term.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Description&lt;br&gt;
This gives you information about the currently active workflow, or about an individual node selected either in the node repository or the workflow editor and tells you the purpose of the workflow, what it does, what you might need to run it plus links  to blog articles, for example, or other web pages related to the workflow.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Console&lt;br&gt;
The console is where any warnings and error messages are shown that relate to your workflow, indicating what is going on under the hood.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--vXmn_XVN--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/w918son6x0e95z89lwxq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--vXmn_XVN--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/w918son6x0e95z89lwxq.png" alt="Image description" width="742" height="452"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h4&gt;
  
  
  Nodes, Data &amp;amp; Workflows
&lt;/h4&gt;

&lt;p&gt;The following image illustrates a node and explains the different parts of a node.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--2dXzEUQ1--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/vofgr9qocuqz6f18h10y.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--2dXzEUQ1--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/vofgr9qocuqz6f18h10y.png" alt="Image description" width="391" height="208"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h4&gt;
  
  
  Data access
&lt;/h4&gt;

&lt;p&gt;The start of any project is to access the required data. So, refer below different nodes which we can access data in KNIME. You can access and blend data from different local end remote file systems: CSV files and other formatted text files, Excel workbooks, proprietary file formats of other tools, and more.&lt;/p&gt;

&lt;h4&gt;
  
  
  Reader Nodes
&lt;/h4&gt;

&lt;p&gt;The common file types for which there are dedicated nodes are CSV, Tabular, and Excel files. For a more exhaustive list and description of all KNIME nodes for data access, download the free e-book &lt;a href="https://www.knime.com/knimepress/will-they-blend#:~:text=This%20is%20the%20fourth%20edition,social%20media%2C%20from%20R%20and"&gt;"Will they blend?"&lt;/a&gt;, a collection of blog posts centered around data access &amp;amp; blending.&lt;/p&gt;

&lt;h6&gt;
  
  
  Related Nodes:
&lt;/h6&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Enk18K2H--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/n5h51yra4cwwnqlnax3b.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Enk18K2H--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/n5h51yra4cwwnqlnax3b.png" alt="Image description" width="304" height="103"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;1.CSV Reader&lt;br&gt;
2.Table Reader&lt;br&gt;
3.Excel Reader&lt;/p&gt;

&lt;h4&gt;
  
  
  Accessing Databases
&lt;/h4&gt;

&lt;p&gt;In KNIME, you can connect to any JDBC compliant database and manipulate data directly on the database. At any point, you can read the data into a local KNIME data table, and vice versa.&lt;/p&gt;

&lt;h6&gt;
  
  
  Related Nodes:
&lt;/h6&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--mUvdC_oE--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/tikarpe9yieo4vd2tktd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--mUvdC_oE--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/tikarpe9yieo4vd2tktd.png" alt="Image description" width="294" height="96"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;1.DB Connector&lt;br&gt;
2.DB Table Selector&lt;br&gt;
3.DB Reader&lt;/p&gt;

&lt;h4&gt;
  
  
  Data Cleaning
&lt;/h4&gt;

&lt;p&gt;After accessed the data from one or more sources, you need to clean and transform it. One purpose of these steps is to reduce the irrelevant and retain the essential information. The possible operations include row and column filtering and transformations on numbers, strings, and rules.&lt;/p&gt;

&lt;p&gt;Another purpose of cleaning data is to reduce bias and enhance clarity in the next steps, such as model training, statistics calculation, and creating dashboards. Missing value handling and outlier detection are helpful operations there.&lt;/p&gt;

&lt;h4&gt;
  
  
  Row &amp;amp; Column Filtering
&lt;/h4&gt;

&lt;p&gt;A good start to transforming data is to exclude unnecessary information. Rows can be removed, for example, if they are duplicates, have missing values, or if they just don't belong to the subset of interest.&lt;/p&gt;

&lt;h6&gt;
  
  
  Related Nodes:
&lt;/h6&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--1yrsIQT9--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/jw16m89ldumuh1lh5q7t.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--1yrsIQT9--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/jw16m89ldumuh1lh5q7t.png" alt="Image description" width="240" height="102"&gt;&lt;/a&gt;&lt;br&gt;
1.Row Filter&lt;br&gt;
2.Column Filter&lt;/p&gt;

&lt;p&gt;That's it for the Basic introduction and all data accessing, cleaning parts in KNIME for beginners and keep in touch for upcoming Data Standardization part with numbers, Strings and Rules.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Thank you!&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

</description>
      <category>datascience</category>
      <category>dataengineering</category>
      <category>ai</category>
      <category>machinelearning</category>
    </item>
  </channel>
</rss>
