<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: SRIMATHI S</title>
    <description>The latest articles on DEV Community by SRIMATHI S (@srimathi_s).</description>
    <link>https://dev.to/srimathi_s</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3482132%2F99c3479c-8bc7-44e9-976b-8878bdbb0347.png</url>
      <title>DEV Community: SRIMATHI S</title>
      <link>https://dev.to/srimathi_s</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/srimathi_s"/>
    <language>en</language>
    <item>
      <title>Understanding 6 Common Data Formats in Data Analytics</title>
      <dc:creator>SRIMATHI S</dc:creator>
      <pubDate>Wed, 08 Oct 2025 05:07:42 +0000</pubDate>
      <link>https://dev.to/srimathi_s/understanding-6-common-data-formats-in-data-analytics-3k1c</link>
      <guid>https://dev.to/srimathi_s/understanding-6-common-data-formats-in-data-analytics-3k1c</guid>
      <description>&lt;p&gt;In the world of data analytics, information comes in many different formats — from simple spreadsheets to structured databases and modern big data files. Choosing the right format affects storage efficiency, speed, and compatibility with tools like Python, Spark, or SQL engines.&lt;/p&gt;

&lt;p&gt;In this article, we’ll explore six popular data formats used in analytics:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;CSV (Comma-Separated Values)&lt;/li&gt;
&lt;li&gt;SQL (Relational Table Format)&lt;/li&gt;
&lt;li&gt;JSON (JavaScript Object Notation)&lt;/li&gt;
&lt;li&gt;Parquet (Columnar Storage Format)&lt;/li&gt;
&lt;li&gt;XML (Extensible Markup Language)&lt;/li&gt;
&lt;li&gt;Avro (Row-Based Storage Format)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Sample Dataset&lt;/strong&gt;&lt;br&gt;
Name    Register Number Subject Marks&lt;br&gt;
Anitha  101 Data Analytics  85&lt;br&gt;
Bala    102 Cloud Computing 90&lt;br&gt;
Charan  103 Machine Learning 88&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1.CSV (Comma-Separated Values)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What it is:&lt;br&gt;
CSV is one of the simplest and most widely used data formats. Each row represents a record, and columns are separated by commas. It’s human-readable and supported by almost every data tool.&lt;/p&gt;

&lt;p&gt;Example (data.csv):&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Name,RegisterNumber,Subject,Marks
Anitha,101,Data Analytics,85
Bala,102,Cloud Computing,90
Charan,103,Machine Learning,88
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;2.SQL (Relational Table Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What it is:&lt;br&gt;
SQL format represents data stored in a relational database table. Each row is a record, and the structure (columns, types) is defined by a schema.&lt;/p&gt;

&lt;p&gt;Example (data.sql):&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;CREATE TABLE Students (
  Name VARCHAR(50),
  RegisterNumber INT,
  Subject VARCHAR(50),
  Marks INT
);

INSERT INTO Students VALUES
('Anitha', 101, 'Data Analytics', 85),
('Bala', 102, 'Cloud Computing', 90),
('Charan', 103, 'Machine Learning', 88);
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;3.JSON (JavaScript Object Notation)&lt;/strong&gt;&lt;br&gt;
What it is:&lt;br&gt;
JSON is a lightweight format used for structured and semi-structured data. It’s easy for humans to read and easy for machines to parse.&lt;/p&gt;

&lt;p&gt;Example (data.json):&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;[
  {
    "Name": "Anitha",
    "RegisterNumber": 101,
    "Subject": "Data Analytics",
    "Marks": 85
  },
  {
    "Name": "Bala",
    "RegisterNumber": 102,
    "Subject": "Cloud Computing",
    "Marks": 90
  },
  {
    "Name": "Charan",
    "RegisterNumber": 103,
    "Subject": "Machine Learning",
    "Marks": 88
  }
]
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;4.Parquet (Columnar Storage Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What it is:&lt;br&gt;
Parquet is a columnar storage format developed for efficient big data processing. Instead of storing row-by-row like CSV, it stores data column-by-column, which saves space and speeds up analytics queries.&lt;/p&gt;

&lt;p&gt;Example (Conceptual View):&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Columns:
Name → ["Anitha", "Bala", "Charan"]
RegisterNumber → [101, 102, 103]
Subject → ["Data Analytics", "Cloud Computing", "Machine Learning"]
Marks → [85, 90, 88]
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;5.XML (Extensible Markup Language)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What it is:&lt;br&gt;
XML uses tags to describe data, similar to HTML. It’s widely used for configuration files and data interchange between systems.&lt;/p&gt;

&lt;p&gt;Example (data.xml):&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;&amp;lt;Students&amp;gt;
  &amp;lt;Student&amp;gt;
    &amp;lt;Name&amp;gt;Anitha&amp;lt;/Name&amp;gt;
    &amp;lt;RegisterNumber&amp;gt;101&amp;lt;/RegisterNumber&amp;gt;
    &amp;lt;Subject&amp;gt;Data Analytics&amp;lt;/Subject&amp;gt;
    &amp;lt;Marks&amp;gt;85&amp;lt;/Marks&amp;gt;
  &amp;lt;/Student&amp;gt;
  &amp;lt;Student&amp;gt;
    &amp;lt;Name&amp;gt;Bala&amp;lt;/Name&amp;gt;
    &amp;lt;RegisterNumber&amp;gt;102&amp;lt;/RegisterNumber&amp;gt;
    &amp;lt;Subject&amp;gt;Cloud Computing&amp;lt;/Subject&amp;gt;
    &amp;lt;Marks&amp;gt;90&amp;lt;/Marks&amp;gt;
  &amp;lt;/Student&amp;gt;
  &amp;lt;Student&amp;gt;
    &amp;lt;Name&amp;gt;Charan&amp;lt;/Name&amp;gt;
    &amp;lt;RegisterNumber&amp;gt;103&amp;lt;/RegisterNumber&amp;gt;
    &amp;lt;Subject&amp;gt;Machine Learning&amp;lt;/Subject&amp;gt;
    &amp;lt;Marks&amp;gt;88&amp;lt;/Marks&amp;gt;
  &amp;lt;/Student&amp;gt;
&amp;lt;/Students&amp;gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;6.Avro (Row-Based Storage Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What it is:&lt;br&gt;
Avro is a binary row-based data format developed by Apache. It stores both the schema and the data, making it ideal for streaming and serialization in Hadoop/Spark environments.&lt;/p&gt;

&lt;p&gt;Example (Conceptual View):&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;{
  "type": "record",
  "name": "Student",
  "fields": [
    {"name": "Name", "type": "string"},
    {"name": "RegisterNumber", "type": "int"},
    {"name": "Subject", "type": "string"},
    {"name": "Marks", "type": "int"}
  ],
  "records": [
    {"Name": "Anitha", "RegisterNumber": 101, "Subject": "Data Analytics", "Marks": 85},
    {"Name": "Bala", "RegisterNumber": 102, "Subject": "Cloud Computing", "Marks": 90},
    {"Name": "Charan", "RegisterNumber": 103, "Subject": "Machine Learning", "Marks": 88}
  ]
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Every data format serves a unique purpose.&lt;br&gt;
Use CSV/JSON for simplicity and readability.&lt;br&gt;
Choose SQL/XML for structured data exchange.&lt;br&gt;
Opt for Parquet/Avro when handling big data at scale.&lt;br&gt;
Understanding these formats helps you pick the right tool for efficient data storage, transfer, and analytics in the cloud. &lt;/p&gt;

</description>
      <category>database</category>
      <category>datascience</category>
      <category>learning</category>
    </item>
    <item>
      <title>Exploring MongoDB</title>
      <dc:creator>SRIMATHI S</dc:creator>
      <pubDate>Fri, 05 Sep 2025 15:49:55 +0000</pubDate>
      <link>https://dev.to/srimathi_s/exploring-mongodb-2nd3</link>
      <guid>https://dev.to/srimathi_s/exploring-mongodb-2nd3</guid>
      <description>&lt;p&gt;As part of exploring modern databases, I decided to work with MongoDB.&lt;br&gt;
It uses JSON-like documents to store data, making it very intuitive to query and manipulate.&lt;br&gt;
Compared to traditional SQL databases, it offers great flexibility and speed for unstructured datasets.&lt;br&gt;
In this post, I’ll share my learning experience and some hands-on tasks I completed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1.Setting Up MongoDB&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I installed MongoDB (you can either install locally or use MongoDB Atlas Cloud).&lt;br&gt;
After installation, I verified the setup using:&lt;br&gt;
mongod --version&lt;br&gt;
mongo --version &lt;/p&gt;

&lt;p&gt;MongoDB was up and running!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2.Importing Dataset&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;For this project, I used a sample Yelp Reviews Dataset (JSON format).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.Insert Records Manually&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
I inserted 10 sample records into the collection:&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo4pu8dxxhjcu6v0pooua.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo4pu8dxxhjcu6v0pooua.jpeg" alt="A pie chart showing 40% responded " width="800" height="423"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;4.Queries&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Find Top 5 Businesses with Highest Average Rating.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6sfowazv4d7s6x3ukcak.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6sfowazv4d7s6x3ukcak.jpeg" alt="A pie chart showing 40% responded " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
Count How Many Reviews Contain the Word “good”.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhdf59cxucf58v6thsx9w.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhdf59cxucf58v6thsx9w.jpeg" alt="A pie chart showing 40% responded " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Get All Reviews for a Specific Business ID.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdernugs56ag8hd9ivei1.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdernugs56ag8hd9ivei1.jpeg" alt="A pie chart showing 40% responded " width="800" height="265"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Update a review and delete a record.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg2hs4si33vb5hja9qa8a.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg2hs4si33vb5hja9qa8a.jpeg" alt="A pie chart showing 40% responded " width="800" height="421"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Through this hands-on project, I learned how to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Insert, query, update, and delete records in MongoDB&lt;/li&gt;
&lt;li&gt;Perform aggregation queries&lt;/li&gt;
&lt;li&gt;Use regex for text search&lt;/li&gt;
&lt;li&gt;Export query results to JSON/CSV&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;MongoDB’s flexibility and power make it a great tool for handling real-world datasets.&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
