<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: AISHWARYA M 23IT007</title>
    <description>The latest articles on DEV Community by AISHWARYA M 23IT007 (@aishwarya_m23it007_c5d8e).</description>
    <link>https://dev.to/aishwarya_m23it007_c5d8e</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3491352%2F1f498127-7e0e-40ba-90c1-5dd9b3ba2cec.png</url>
      <title>DEV Community: AISHWARYA M 23IT007</title>
      <link>https://dev.to/aishwarya_m23it007_c5d8e</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/aishwarya_m23it007_c5d8e"/>
    <language>en</language>
    <item>
      <title>Data in the Cloud: Understanding 6 Common Data Formats in Analytics</title>
      <dc:creator>AISHWARYA M 23IT007</dc:creator>
      <pubDate>Sat, 11 Oct 2025 03:45:23 +0000</pubDate>
      <link>https://dev.to/aishwarya_m23it007_c5d8e/data-in-the-cloud-understanding-6-common-data-formats-in-analytics-46bi</link>
      <guid>https://dev.to/aishwarya_m23it007_c5d8e/data-in-the-cloud-understanding-6-common-data-formats-in-analytics-46bi</guid>
      <description>&lt;p&gt;When working with data in the cloud or in analytics pipelines, the format of your data matters. Choosing the right format can impact performance, storage cost, and ease of use.&lt;/p&gt;

&lt;p&gt;In this article, we’ll explore six widely used data formats:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;CSV (Comma-Separated Values)&lt;/li&gt;
&lt;li&gt;SQL (Relational Tables)&lt;/li&gt;
&lt;li&gt;JSON (JavaScript Object Notation)&lt;/li&gt;
&lt;li&gt;Parquet (Columnar Storage) &lt;/li&gt;
&lt;li&gt;XML (Extensible Markup Language)&lt;/li&gt;
&lt;li&gt;Avro (Row-based Storage)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;*&lt;em&gt;SAMPLE DATASET: *&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Name    Register_No Subject Marks&lt;br&gt;
Aisha   101 Math    85&lt;br&gt;
Rahul   102 Science 90&lt;br&gt;
Meera   103 English 78&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1️⃣ CSV (Comma-Separated Values)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What is CSV?&lt;br&gt;
CSV is the simplest data format. Each row represents a record, and values are separated by commas. It’s human-readable and widely supported by Excel, Python, R, and databases.&lt;/p&gt;

&lt;p&gt;Dataset in CSV:&lt;/p&gt;

&lt;p&gt;Name,Register_No,Subject,Marks&lt;br&gt;
Aisha,101,Math,85&lt;br&gt;
Rahul,102,Science,90&lt;br&gt;
Meera,103,English,78&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2️⃣ SQL (Relational Table Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What is SQL format?&lt;br&gt;
SQL databases store data in tables with rows and columns. Data is structured, and queries are run using SQL commands.&lt;/p&gt;

&lt;p&gt;Dataset in SQL:&lt;/p&gt;

&lt;p&gt;CREATE TABLE Students (&lt;br&gt;
    Name VARCHAR(50),&lt;br&gt;
    Register_No INT,&lt;br&gt;
    Subject VARCHAR(50),&lt;br&gt;
    Marks INT&lt;br&gt;
);&lt;/p&gt;

&lt;p&gt;INSERT INTO Students (Name, Register_No, Subject, Marks) VALUES&lt;br&gt;
('Aisha', 101, 'Math', 85),&lt;br&gt;
('Rahul', 102, 'Science', 90),&lt;br&gt;
('Meera', 103, 'English', 78);&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3️⃣ JSON (JavaScript Object Notation)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What is JSON?&lt;br&gt;
JSON is a lightweight, human-readable data format often used in APIs. It stores data as key-value pairs, arrays, and nested objects.&lt;/p&gt;

&lt;p&gt;Dataset in JSON:&lt;/p&gt;

&lt;p&gt;[&lt;br&gt;
  {&lt;br&gt;
    "Name": "Aisha",&lt;br&gt;
    "Register_No": 101,&lt;br&gt;
    "Subject": "Math",&lt;br&gt;
    "Marks": 85&lt;br&gt;
  },&lt;br&gt;
  {&lt;br&gt;
    "Name": "Rahul",&lt;br&gt;
    "Register_No": 102,&lt;br&gt;
    "Subject": "Science",&lt;br&gt;
    "Marks": 90&lt;br&gt;
  },&lt;br&gt;
  {&lt;br&gt;
    "Name": "Meera",&lt;br&gt;
    "Register_No": 103,&lt;br&gt;
    "Subject": "English",&lt;br&gt;
    "Marks": 78&lt;br&gt;
  }&lt;br&gt;
]&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4️⃣ Parquet (Columnar Storage Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What is Parquet?&lt;br&gt;
Parquet is a binary columnar storage format optimized for big data analytics. Instead of storing row by row, it stores column by column, which speeds up aggregations and reduces storage.&lt;/p&gt;

&lt;p&gt;Dataset in Parquet (conceptual view):&lt;/p&gt;

&lt;p&gt;Columns:&lt;br&gt;
Name      → ["Aisha", "Rahul", "Meera"]&lt;br&gt;
Register_No → [101, 102, 103]&lt;br&gt;
Subject   → ["Math", "Science", "English"]&lt;br&gt;
Marks     → [85, 90, 78]&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5️⃣ XML (Extensible Markup Language)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What is XML?&lt;br&gt;
XML is a markup language that uses tags to structure data. It’s verbose but still used in enterprise systems and web services.&lt;/p&gt;

&lt;p&gt;Dataset in XML:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;&amp;lt;Students&amp;gt;
  &amp;lt;Student&amp;gt;
    &amp;lt;Name&amp;gt;Aisha&amp;lt;/Name&amp;gt;
    &amp;lt;Register_No&amp;gt;101&amp;lt;/Register_No&amp;gt;
    &amp;lt;Subject&amp;gt;Math&amp;lt;/Subject&amp;gt;
    &amp;lt;Marks&amp;gt;85&amp;lt;/Marks&amp;gt;
  &amp;lt;/Student&amp;gt;
  &amp;lt;Student&amp;gt;
    &amp;lt;Name&amp;gt;Rahul&amp;lt;/Name&amp;gt;
    &amp;lt;Register_No&amp;gt;102&amp;lt;/Register_No&amp;gt;
    &amp;lt;Subject&amp;gt;Science&amp;lt;/Subject&amp;gt;
    &amp;lt;Marks&amp;gt;90&amp;lt;/Marks&amp;gt;
  &amp;lt;/Student&amp;gt;
  &amp;lt;Student&amp;gt;
    &amp;lt;Name&amp;gt;Meera&amp;lt;/Name&amp;gt;
    &amp;lt;Register_No&amp;gt;103&amp;lt;/Register_No&amp;gt;
    &amp;lt;Subject&amp;gt;English&amp;lt;/Subject&amp;gt;
    &amp;lt;Marks&amp;gt;78&amp;lt;/Marks&amp;gt;
  &amp;lt;/Student&amp;gt;
&amp;lt;/Students&amp;gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;6️⃣ Avro (Row-based Storage Format)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What is Avro?&lt;br&gt;
Avro is a row-based binary format developed by Apache. It’s schema-driven and commonly used in Kafka and Hadoop ecosystems. Unlike Parquet, it stores data row by row.&lt;/p&gt;

&lt;p&gt;Dataset in Avro (conceptual view):&lt;/p&gt;

&lt;p&gt;Schema (JSON-based):&lt;/p&gt;

&lt;p&gt;{&lt;br&gt;
  "type": "record",&lt;br&gt;
  "name": "Student",&lt;br&gt;
  "fields": [&lt;br&gt;
    {"name": "Name", "type": "string"},&lt;br&gt;
    {"name": "Register_No", "type": "int"},&lt;br&gt;
    {"name": "Subject", "type": "string"},&lt;br&gt;
    {"name": "Marks", "type": "int"}&lt;br&gt;
  ]&lt;br&gt;
}&lt;/p&gt;

&lt;p&gt;{"Name": "Aisha", "Register_No": 101, "Subject": "Math", "Marks": 85}&lt;br&gt;
{"Name": "Rahul", "Register_No": 102, "Subject": "Science", "Marks": 90}&lt;br&gt;
{"Name": "Meera", "Register_No": 103, "Subject": "English", "Marks": 78}&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Each data format serves a different purpose:&lt;/p&gt;

&lt;p&gt;CSV → Simple, universal&lt;/p&gt;

&lt;p&gt;SQL → Structured, relational&lt;/p&gt;

&lt;p&gt;JSON → Flexible, great for APIs&lt;/p&gt;

&lt;p&gt;Parquet → Columnar, optimized for analytics&lt;/p&gt;

&lt;p&gt;XML → Structured but verbose&lt;/p&gt;

&lt;p&gt;Avro → Row-based, schema-driven&lt;/p&gt;

&lt;p&gt;In modern cloud data platforms (like AWS, GCP, Azure), Parquet and Avro are heavily used for large-scale analytics, while CSV, JSON, and SQL remain popular for data interchange.&lt;/p&gt;

</description>
      <category>dataengineering</category>
      <category>analytics</category>
      <category>data</category>
      <category>cloud</category>
    </item>
    <item>
      <title>Managing Yelp Reviews in MongoDB Compass</title>
      <dc:creator>AISHWARYA M 23IT007</dc:creator>
      <pubDate>Wed, 10 Sep 2025 06:27:20 +0000</pubDate>
      <link>https://dev.to/aishwarya_m23it007_c5d8e/managing-yelp-reviews-in-mongodb-compass-22oc</link>
      <guid>https://dev.to/aishwarya_m23it007_c5d8e/managing-yelp-reviews-in-mongodb-compass-22oc</guid>
      <description>&lt;p&gt;Working with databases can sometimes feel intimidating, but tools like MongoDB Compass make it much easier by providing a visual interface for managing and analyzing collections. In this blog, we’ll walk through some practical steps using screenshots from a sample yelpDB.reviews collection.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Viewing and Aggregating Documents&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The first step is exploring the documents stored inside the reviews collection. MongoDB Compass provides an Aggregations tab that allows us to run aggregation pipelines visually.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7zevrj0n39kjam6t04nc.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7zevrj0n39kjam6t04nc.jpg" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Searching with Filters&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Often, we need to filter reviews to find specific insights. Compass makes this easy with its filter bar.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb94s4imwrvknyl44sxbo.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb94s4imwrvknyl44sxbo.jpg" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Finding Documents by ID&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you want to look up a specific review or restaurant entry, you can filter using its business_id&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxtjdupl9m5zevawb9sq0.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxtjdupl9m5zevawb9sq0.jpg" alt=" " width="800" height="265"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Deleting Documents&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Sometimes, data cleanup is necessary. In Compass, you can flag documents for deletion:&lt;br&gt;
Select the document.&lt;br&gt;
Click Delete.&lt;br&gt;
Confirm deletion.&lt;br&gt;
The document will be permanently removed from the collection.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3u2myvb99ogwq0v0lc9x.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3u2myvb99ogwq0v0lc9x.jpg" alt=" " width="800" height="418"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. Updating Documents&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Compass also allows editing documents directly:&lt;br&gt;
Click the pencil icon beside a field.&lt;br&gt;
Modify the value (e.g., updating a business_id or correcting a review text).&lt;br&gt;
Click Update to save the change.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwi3owmj9teoo308tlqio.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwi3owmj9teoo308tlqio.jpg" alt=" " width="800" height="421"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;CONCLUSION&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Using MongoDB Compass, we performed key database operations visually:&lt;br&gt;
Aggregating data (average ratings).&lt;br&gt;
Filtering with regex.&lt;br&gt;
Querying by IDs.&lt;br&gt;
Deleting and updating documents.&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
