<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Sri Vardhan</title>
    <description>The latest articles on DEV Community by Sri Vardhan (@sri_vardhan_a896a0c1b7d72).</description>
    <link>https://dev.to/sri_vardhan_a896a0c1b7d72</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3493768%2F27d56b30-1e80-484f-bdc1-aed40af4bb8b.png</url>
      <title>DEV Community: Sri Vardhan</title>
      <link>https://dev.to/sri_vardhan_a896a0c1b7d72</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/sri_vardhan_a896a0c1b7d72"/>
    <language>en</language>
    <item>
      <title>Common Data Formats Used in Cloud Data Analytics</title>
      <dc:creator>Sri Vardhan</dc:creator>
      <pubDate>Wed, 19 Nov 2025 17:23:52 +0000</pubDate>
      <link>https://dev.to/sri_vardhan_a896a0c1b7d72/common-data-formats-used-in-cloud-data-analytics-1g2b</link>
      <guid>https://dev.to/sri_vardhan_a896a0c1b7d72/common-data-formats-used-in-cloud-data-analytics-1g2b</guid>
      <description>&lt;p&gt;When we work with data in cloud platforms like Google Cloud, AWS, or Azure, we often see different file formats.&lt;br&gt;
Each format is used for a different purpose — some are easy to read, some save a lot of space, and some are perfect for big-data analytics.&lt;/p&gt;

&lt;p&gt;In this post, I’ll explain 6 commonly used data formats with simple definitions and a small sample dataset represented in all formats.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Sample Dataset Used in All Examples&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;Register_No&lt;/th&gt;
&lt;th&gt;Subject&lt;/th&gt;
&lt;th&gt;Marks&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Sri&lt;/td&gt;
&lt;td&gt;101&lt;/td&gt;
&lt;td&gt;Math&lt;/td&gt;
&lt;td&gt;87&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Vardhan&lt;/td&gt;
&lt;td&gt;102&lt;/td&gt;
&lt;td&gt;Science&lt;/td&gt;
&lt;td&gt;92&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Darsani&lt;/td&gt;
&lt;td&gt;103&lt;/td&gt;
&lt;td&gt;English&lt;/td&gt;
&lt;td&gt;89&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;1️⃣ CSV (Comma Separated Values)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;CSV is the simplest and most widely used data format.&lt;br&gt;
It is a plain text file where each value is separated by a comma.&lt;/p&gt;

&lt;p&gt;✔ Easy to read&lt;br&gt;
✔ Works with Excel, Google Sheets, and almost every tool&lt;br&gt;
✔ Good for small datasets&lt;/p&gt;

&lt;p&gt;✅ Example (CSV)&lt;br&gt;
Name,Register_No,Subject,Marks&lt;br&gt;
Sri,101,Math,87&lt;br&gt;
Vardhan,102,Science,92&lt;br&gt;
Keerthana,103,English,89&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2️⃣ SQL (Relational Table Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;SQL format represents data in a table structure (rows and columns).&lt;br&gt;
It is used in relational databases like MySQL, PostgreSQL, and SQL Server.&lt;/p&gt;

&lt;p&gt;✔ Best for structured, organized data&lt;br&gt;
✔ Allows powerful querying using SQL commands&lt;/p&gt;

&lt;p&gt;✅ Example (SQL)&lt;br&gt;
CREATE TABLE Students (&lt;br&gt;
  Name TEXT,&lt;br&gt;
  Register_No INT,&lt;br&gt;
  Subject TEXT,&lt;br&gt;
  Marks INT&lt;br&gt;
);&lt;/p&gt;

&lt;p&gt;INSERT INTO Students VALUES&lt;br&gt;
('Sri', 101, 'Math', 87),&lt;br&gt;
('Vardhan', 102, 'Science', 92),&lt;br&gt;
('Keerthana', 103, 'English', 89);&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3️⃣ JSON (JavaScript Object Notation)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;JSON stores data as key-value pairs.&lt;br&gt;
It is widely used in APIs, web apps, and NoSQL databases like MongoDB.&lt;/p&gt;

&lt;p&gt;✔ Human-readable&lt;br&gt;
✔ Great for semi-structured data&lt;br&gt;
✔ Works well with programming languages&lt;/p&gt;

&lt;p&gt;✅ Example (JSON)&lt;br&gt;
[&lt;br&gt;
  { "Name": "Sri", "Register_No": 101, "Subject": "Math", "Marks": 87 },&lt;br&gt;
  { "Name": "Vardhan", "Register_No": 102, "Subject": "Science", "Marks": 92 },&lt;br&gt;
  { "Name": "Keerthana", "Register_No": 103, "Subject": "English", "Marks": 89 }&lt;br&gt;
]&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4️⃣ Parquet (Columnar Storage Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Parquet is a binary, column-oriented file format used in big-data systems like Spark, Hive, and BigQuery.&lt;/p&gt;

&lt;p&gt;✔ Extremely efficient for analytics&lt;br&gt;
✔ Compresses data well&lt;br&gt;
✔ Reads only required columns → very fast&lt;/p&gt;

&lt;p&gt;Parquet is not displayed as plain text, but here’s the conceptual layout:&lt;/p&gt;

&lt;p&gt;📌 Parquet (Columnar View)&lt;br&gt;
Name:        ["Sri", "Vardhan", "Keerthana"]&lt;br&gt;
Register_No: [101, 102, 103]&lt;br&gt;
Subject:     ["Math", "Science", "English"]&lt;br&gt;
Marks:       [87, 92, 89]&lt;br&gt;
**&lt;br&gt;
5️⃣ XML (Extensible Markup Language)**&lt;/p&gt;

&lt;p&gt;XML stores data using custom tags (similar to HTML).&lt;br&gt;
It was widely used in older systems and is still common in some enterprise applications.&lt;/p&gt;

&lt;p&gt;✔ Self-descriptive&lt;br&gt;
✔ Structured&lt;br&gt;
✔ Used in configurations and data interchange&lt;/p&gt;

&lt;p&gt;✅ Example (XML)&lt;br&gt;
&lt;br&gt;
  &lt;br&gt;
    Sri&lt;br&gt;
    101&lt;br&gt;
    Math&lt;br&gt;
    87&lt;br&gt;
  &lt;/p&gt;

&lt;p&gt;&lt;br&gt;
    Vardhan&lt;br&gt;
    102&lt;br&gt;
    Science&lt;br&gt;
    92&lt;br&gt;
  &lt;/p&gt;

&lt;p&gt;&lt;br&gt;
    Keerthana&lt;br&gt;
    103&lt;br&gt;
    English&lt;br&gt;
    89&lt;br&gt;
  &lt;br&gt;
&lt;/p&gt;

&lt;p&gt;6️⃣ Avro (Row-Based Format with Schema)&lt;/p&gt;

&lt;p&gt;Avro is a binary file format used mostly in streaming systems like Kafka and big-data pipelines.&lt;/p&gt;

&lt;p&gt;✔ Stores data + schema together&lt;br&gt;
✔ Good for fast data serialization&lt;br&gt;
✔ Used in Hadoop ecosystems&lt;/p&gt;

&lt;p&gt;📌 Avro Schema&lt;br&gt;
{&lt;br&gt;
  "type": "record",&lt;br&gt;
  "name": "Student",&lt;br&gt;
  "fields": [&lt;br&gt;
    { "name": "Name", "type": "string" },&lt;br&gt;
    { "name": "Register_No", "type": "int" },&lt;br&gt;
    { "name": "Subject", "type": "string" },&lt;br&gt;
    { "name": "Marks", "type": "int" }&lt;br&gt;
  ]&lt;br&gt;
}&lt;/p&gt;

&lt;p&gt;📌 Avro Data (Readable Preview)&lt;br&gt;
[&lt;br&gt;
  { "Name": "Sri", "Register_No": 101, "Subject": "Math", "Marks": 87 },&lt;br&gt;
  { "Name": "Vardhan", "Register_No": 102, "Subject": "Science", "Marks": 92 },&lt;br&gt;
  { "Name": "Keerthana", "Register_No": 103, "Subject": "English", "Marks": 89 }&lt;br&gt;
]&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Each data format has its own strengths:&lt;/p&gt;

&lt;p&gt;CSV → simple and universal&lt;/p&gt;

&lt;p&gt;SQL → perfect for structured data&lt;/p&gt;

&lt;p&gt;JSON → great for APIs&lt;/p&gt;

&lt;p&gt;Parquet → best for fast analytics&lt;/p&gt;

&lt;p&gt;XML → still used in many systems&lt;/p&gt;

&lt;p&gt;Avro → excellent for streaming data&lt;/p&gt;

&lt;p&gt;Understanding these formats is essential for working with cloud computing, big data, and modern analytics tools.&lt;/p&gt;

</description>
      <category>dataanalytics</category>
      <category>cloudcomputing</category>
      <category>csv</category>
    </item>
    <item>
      <title>Hands-On with MongoDB: Storing, Querying &amp; Analyzing Data</title>
      <dc:creator>Sri Vardhan</dc:creator>
      <pubDate>Thu, 11 Sep 2025 04:16:45 +0000</pubDate>
      <link>https://dev.to/sri_vardhan_a896a0c1b7d72/hands-on-with-mongodb-storing-querying-analyzing-data-1foc</link>
      <guid>https://dev.to/sri_vardhan_a896a0c1b7d72/hands-on-with-mongodb-storing-querying-analyzing-data-1foc</guid>
      <description>&lt;p&gt;MongoDB is one of the most popular NoSQL databases.&lt;br&gt;
In this blog, I’ll walk you through how I used MongoDB Compass to insert data, run queries, and perform analysis.&lt;/p&gt;

&lt;p&gt;Step 1: Install MongoDB&lt;br&gt;
-&amp;gt;I installed MongoDB Community Server and MongoDB Compass for GUI interaction.&lt;/p&gt;




&lt;p&gt;Step 2: Create Database &amp;amp; Collection&lt;br&gt;
-&amp;gt;Created database: mydb&lt;br&gt;
-&amp;gt;Created collection: reviews&lt;/p&gt;

&lt;p&gt;Inserted 10 documents manually:&lt;br&gt;
{ "business_id": 1, "name": "Pizza Point", "review": "The pizza was good", "rating": 4 }&lt;br&gt;
{ "business_id": 2, "name": "Book World", "review": "Good variety of books", "rating": 5 }&lt;br&gt;
...&lt;/p&gt;




&lt;p&gt;Step 3: Top 5 Businesses by Average Rating&lt;br&gt;
Using the Aggregations tab:&lt;br&gt;
// Stage 1: Group&lt;br&gt;
{ &lt;br&gt;
  "$group": { &lt;br&gt;
    "_id": "$business_id", &lt;br&gt;
    "name": { "$first": "$name" }, &lt;br&gt;
    "avgRating": { "$avg": "$rating" } &lt;br&gt;
  } &lt;br&gt;
}&lt;/p&gt;

&lt;p&gt;// Stage 2: Sort&lt;br&gt;
{ "avgRating": -1 }&lt;/p&gt;

&lt;p&gt;// Stage 3: Limit&lt;br&gt;
{ "$limit": 5 }&lt;/p&gt;




&lt;p&gt;Step 4: Find Reviews Containing “good”&lt;br&gt;
In the Filter bar:&lt;br&gt;
{ "review": { "$regex": "good", "$options": "i" } }&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftkalb9898k6rgxyk9rar.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftkalb9898k6rgxyk9rar.png" alt=" " width="800" height="449"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Step 5: Query Reviews for a Specific Business&lt;br&gt;
{ "business_id": 2 }&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9c4giaj91averq6t68ja.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9c4giaj91averq6t68ja.png" alt=" " width="800" height="685"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Step 6: Update a Review&lt;br&gt;
{ "business_id": 2 },&lt;br&gt;
{ "$set": { "review": "Excellent variety of books" } }&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqxsd98rn5cj4ge47zl9s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqxsd98rn5cj4ge47zl9s.png" alt=" " width="800" height="683"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Step 7: Delete a Record&lt;br&gt;
-&amp;gt;Before deletion&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ff1epcc0wf6hf0nbzxhy3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ff1epcc0wf6hf0nbzxhy3.png" alt=" " width="800" height="621"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;-&amp;gt;After deletion&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fl1y9louz7eoq8cyx59gx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fl1y9louz7eoq8cyx59gx.png" alt=" " width="800" height="641"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Step 8: Export Query Results&lt;br&gt;
In Compass → Aggregations → Export → JSON/CSV&lt;br&gt;
Example: exported top 5 businesses as JSON.&lt;br&gt;
[{&lt;br&gt;
  "_id": {&lt;br&gt;
    "$oid": "68c1aa6ed8d3811853f084ec"&lt;br&gt;
  },&lt;br&gt;
  "business_id": 1,&lt;br&gt;
  "name": "Cafe Delight",&lt;br&gt;
  "rating": 4,&lt;br&gt;
  "review": "good coffee and snacks"&lt;br&gt;
},&lt;br&gt;
{&lt;br&gt;
  "_id": {&lt;br&gt;
    "$oid": "68c1aa6ed8d3811853f084ed"&lt;br&gt;
  },&lt;br&gt;
  "business_id": 2,&lt;br&gt;
  "name": "Food Haven",&lt;br&gt;
  "rating": 5,&lt;br&gt;
  "review": "excellent food and good staff"&lt;br&gt;
},&lt;br&gt;
{&lt;br&gt;
  "_id": {&lt;br&gt;
    "$oid": "68c1aa6ed8d3811853f084ee"&lt;br&gt;
  },&lt;br&gt;
  "business_id": 3,&lt;br&gt;
  "name": "Tech Store",&lt;br&gt;
  "rating": 3,&lt;br&gt;
  "review": "service improved a lot, very good now"&lt;br&gt;
},&lt;br&gt;
{&lt;br&gt;
  "_id": {&lt;br&gt;
    "$oid": "68c1aa6ed8d3811853f084ef"&lt;br&gt;
  },&lt;br&gt;
  "business_id": 4,&lt;br&gt;
  "name": "Book World",&lt;br&gt;
  "rating": 5,&lt;br&gt;
  "review": "good collection of books"&lt;br&gt;
},&lt;br&gt;
]&lt;/p&gt;




&lt;p&gt;Conclusion&lt;br&gt;
This was a quick walkthrough of MongoDB basics using Compass:&lt;br&gt;
-&amp;gt;Inserted documents&lt;br&gt;
-&amp;gt;Ran queries&lt;br&gt;
-&amp;gt;Aggregated results&lt;br&gt;
-&amp;gt;Updated &amp;amp; deleted records&lt;br&gt;
-&amp;gt;Exported JSON/CSV&lt;br&gt;
MongoDB Compass makes working with data super easy without needing to memorize all commands.&lt;/p&gt;




</description>
    </item>
  </channel>
</rss>
