<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Sugithaa K</title>
    <description>The latest articles on DEV Community by Sugithaa K (@sugithaa_k).</description>
    <link>https://dev.to/sugithaa_k</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3461168%2F122a1796-366c-4ee9-bd7a-c5d1af3b5a81.png</url>
      <title>DEV Community: Sugithaa K</title>
      <link>https://dev.to/sugithaa_k</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/sugithaa_k"/>
    <language>en</language>
    <item>
      <title>Data in Cloud</title>
      <dc:creator>Sugithaa K</dc:creator>
      <pubDate>Mon, 06 Oct 2025 05:20:05 +0000</pubDate>
      <link>https://dev.to/sugithaa_k/data-in-cloud-apd</link>
      <guid>https://dev.to/sugithaa_k/data-in-cloud-apd</guid>
      <description>&lt;p&gt;Data analytics in the cloud requires efficient ways to store, organize, and process data.&lt;br&gt;
Depending on the use case, different data formats are used — some are human-readable, while others are optimized for speed and scalability.&lt;/p&gt;

&lt;p&gt;In this blog, we’ll explore six popular data formats used in cloud-based data analytics:&lt;br&gt;
CSV, SQL, JSON, Parquet, XML, and Avro.&lt;/p&gt;

&lt;p&gt;For each format, we’ll:&lt;br&gt;
 -&amp;gt;Explain it in simple terms&lt;br&gt;
 -&amp;gt;Show a small dataset &lt;br&gt;
 -&amp;gt;Represent the dataset in that format&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1)CSV (Comma Separated Values)&lt;/strong&gt;&lt;br&gt;
 &lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
   CSV is the simplest text-based data format.&lt;br&gt;
   Each row in a CSV file represents one record, and the fields are separated by commas (,).&lt;br&gt;
It’s widely used because it’s easy to create and can be opened in Excel or Google Sheets.&lt;br&gt;
&lt;strong&gt;Our Dataset Example&lt;/strong&gt;&lt;br&gt;
 We’ll use a simple dataset of three students and their marks:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;-&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;Register_No&lt;/th&gt;
&lt;th&gt;Subject&lt;/th&gt;
&lt;th&gt;Marks&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Kavya&lt;/td&gt;
&lt;td&gt;101&lt;/td&gt;
&lt;td&gt;Cloud Computing&lt;/td&gt;
&lt;td&gt;95&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Ravi&lt;/td&gt;
&lt;td&gt;102&lt;/td&gt;
&lt;td&gt;Data Analytics&lt;/td&gt;
&lt;td&gt;88&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Meena&lt;/td&gt;
&lt;td&gt;103&lt;/td&gt;
&lt;td&gt;AI &amp;amp; ML&lt;/td&gt;
&lt;td&gt;91&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Dataset in CSV Format:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Name,Register_No,Subject,Marks&lt;br&gt;
Kavya,101,Cloud Computing,95&lt;br&gt;
Ravi,102,Data Analytics,88&lt;br&gt;
Meena,103,AI &amp;amp; ML,91&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Advantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Easy to read and edit manually.&lt;/li&gt;
&lt;li&gt;Compatible with most software and tools.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Disadvantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;No schema (data type information missing).&lt;/li&gt;
&lt;li&gt;Not suitable for complex or nested data.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;2)SQL (Relational Table Format)&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
SQL (Structured Query Language) stores data in tables with defined columns and data types.&lt;br&gt;
It’s used in relational databases such as MySQL, PostgreSQL, or Oracle.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Our Dataset Example&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;-&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;Register_No&lt;/th&gt;
&lt;th&gt;Subject&lt;/th&gt;
&lt;th&gt;Marks&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Kavya&lt;/td&gt;
&lt;td&gt;101&lt;/td&gt;
&lt;td&gt;Cloud Computing&lt;/td&gt;
&lt;td&gt;95&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Ravi&lt;/td&gt;
&lt;td&gt;102&lt;/td&gt;
&lt;td&gt;Data Analytics&lt;/td&gt;
&lt;td&gt;88&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Meena&lt;/td&gt;
&lt;td&gt;103&lt;/td&gt;
&lt;td&gt;AI &amp;amp; ML&lt;/td&gt;
&lt;td&gt;91&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Dataset in SQL Format:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;CREATE TABLE Students (&lt;br&gt;
  Name VARCHAR(50),&lt;br&gt;
  Register_No INT,&lt;br&gt;
  Subject VARCHAR(50),&lt;br&gt;
  Marks INT&lt;br&gt;
);&lt;/p&gt;

&lt;p&gt;INSERT INTO Students (Name, Register_No, Subject, Marks) VALUES&lt;br&gt;
('Kavya', 101, 'Cloud Computing', 95),&lt;br&gt;
('Ravi', 102, 'Data Analytics', 88),&lt;br&gt;
('Meena', 103, 'AI &amp;amp; ML', 91);&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Advantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data is structured and well-organized.&lt;/li&gt;
&lt;li&gt;Easy to query and analyze using SQL commands.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Disadvantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Requires a database engine.&lt;/li&gt;
&lt;li&gt;Not suitable for unstructured or flexible data.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;3)JSON (JavaScript Object Notation)&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
JSON is a lightweight text-based format used to exchange data between applications.&lt;br&gt;
It stores data as key–value pairs, making it easy for computers and humans to read.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Our Dataset Example&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;-&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;Register_No&lt;/th&gt;
&lt;th&gt;Subject&lt;/th&gt;
&lt;th&gt;Marks&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Kavya&lt;/td&gt;
&lt;td&gt;101&lt;/td&gt;
&lt;td&gt;Cloud Computing&lt;/td&gt;
&lt;td&gt;95&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Ravi&lt;/td&gt;
&lt;td&gt;102&lt;/td&gt;
&lt;td&gt;Data Analytics&lt;/td&gt;
&lt;td&gt;88&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Meena&lt;/td&gt;
&lt;td&gt;103&lt;/td&gt;
&lt;td&gt;AI &amp;amp; ML&lt;/td&gt;
&lt;td&gt;91&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Dataset in JSON Format:&lt;/strong&gt;&lt;br&gt;
[&lt;br&gt;
  {&lt;br&gt;
    "Name": "Kavya",&lt;br&gt;
    "Register_No": 101,&lt;br&gt;
    "Subject": "Cloud Computing",&lt;br&gt;
    "Marks": 95&lt;br&gt;
  },&lt;br&gt;
  {&lt;br&gt;
    "Name": "Ravi",&lt;br&gt;
    "Register_No": 102,&lt;br&gt;
    "Subject": "Data Analytics",&lt;br&gt;
    "Marks": 88&lt;br&gt;
  },&lt;br&gt;
  {&lt;br&gt;
    "Name": "Meena",&lt;br&gt;
    "Register_No": 103,&lt;br&gt;
    "Subject": "AI &amp;amp; ML",&lt;br&gt;
    "Marks": 91&lt;br&gt;
  }&lt;br&gt;
]&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Advantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Readable and easy to use in web APIs.&lt;/li&gt;
&lt;li&gt;Supports nested structures (objects, arrays).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Disadvantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Slightly larger in size compared to CSV.&lt;/li&gt;
&lt;li&gt;Parsing can be slower for very large files.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;4)Parquet (Columnar Storage Format)&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
Parquet is a binary, columnar storage format designed for big data analytics.&lt;br&gt;
Instead of saving data row by row, it stores data by columns, which reduces storage space and speeds up queries.&lt;br&gt;
It’s used in systems like Apache Spark, Hadoop, AWS Athena, and Google BigQuery.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Our Dataset Example&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;-&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;Register_No&lt;/th&gt;
&lt;th&gt;Subject&lt;/th&gt;
&lt;th&gt;Marks&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Kavya&lt;/td&gt;
&lt;td&gt;101&lt;/td&gt;
&lt;td&gt;Cloud Computing&lt;/td&gt;
&lt;td&gt;95&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Ravi&lt;/td&gt;
&lt;td&gt;102&lt;/td&gt;
&lt;td&gt;Data Analytics&lt;/td&gt;
&lt;td&gt;88&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Meena&lt;/td&gt;
&lt;td&gt;103&lt;/td&gt;
&lt;td&gt;AI &amp;amp; ML&lt;/td&gt;
&lt;td&gt;91&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Dataset in Parquet Format (Conceptual Representation):&lt;/strong&gt;&lt;br&gt;
Parquet File (Binary Representation)&lt;/p&gt;

&lt;p&gt;Columns:&lt;br&gt;
Name: [Kavya, Ravi, Meena]&lt;br&gt;
Register_No: [101, 102, 103]&lt;br&gt;
Subject: [Cloud Computing, Data Analytics, AI &amp;amp; ML]&lt;br&gt;
Marks: [95, 88, 91]&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Advantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Highly compressed and efficient for analytical queries.&lt;/li&gt;
&lt;li&gt;Excellent performance in cloud big data systems.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Disadvantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Not readable manually.&lt;/li&gt;
&lt;li&gt;Needs specific software to open or process.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;XML (Extensible Markup Language)&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
XML represents data using tags similar to HTML.&lt;br&gt;
It’s structured and self-descriptive, making it useful for web services and document storage.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Our Dataset Example&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;-&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;Register_No&lt;/th&gt;
&lt;th&gt;Subject&lt;/th&gt;
&lt;th&gt;Marks&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Kavya&lt;/td&gt;
&lt;td&gt;101&lt;/td&gt;
&lt;td&gt;Cloud Computing&lt;/td&gt;
&lt;td&gt;95&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Ravi&lt;/td&gt;
&lt;td&gt;102&lt;/td&gt;
&lt;td&gt;Data Analytics&lt;/td&gt;
&lt;td&gt;88&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Meena&lt;/td&gt;
&lt;td&gt;103&lt;/td&gt;
&lt;td&gt;AI &amp;amp; ML&lt;/td&gt;
&lt;td&gt;91&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Dataset in XML Format:&lt;/strong&gt;&lt;br&gt;
&lt;br&gt;
  &lt;br&gt;
    Kavya&lt;br&gt;
    101&lt;br&gt;
    Cloud Computing&lt;br&gt;
    95&lt;br&gt;
  &lt;br&gt;
  &lt;br&gt;
    Ravi&lt;br&gt;
    102&lt;br&gt;
    Data Analytics&lt;br&gt;
    88&lt;br&gt;
  &lt;br&gt;
  &lt;br&gt;
    Meena&lt;br&gt;
    103&lt;br&gt;
    AI &amp;amp; ML&lt;br&gt;
    91&lt;br&gt;
  &lt;br&gt;
&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Advantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Good for hierarchical (tree-like) data.&lt;/li&gt;
&lt;li&gt;Self-descriptive structure.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Disadvantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Very verbose (takes more space).&lt;/li&gt;
&lt;li&gt;Slower parsing compared to JSON.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;6)Avro (Row-based Storage Format)&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
Avro is a binary, row-based storage format developed by Apache for use in Hadoop ecosystems.&lt;br&gt;
It stores both data and schema, which makes it great for streaming and scalable systems.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Our Dataset Example&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;-&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;Register_No&lt;/th&gt;
&lt;th&gt;Subject&lt;/th&gt;
&lt;th&gt;Marks&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Kavya&lt;/td&gt;
&lt;td&gt;101&lt;/td&gt;
&lt;td&gt;Cloud Computing&lt;/td&gt;
&lt;td&gt;95&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Ravi&lt;/td&gt;
&lt;td&gt;102&lt;/td&gt;
&lt;td&gt;Data Analytics&lt;/td&gt;
&lt;td&gt;88&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;-&lt;/td&gt;
&lt;td&gt;Meena&lt;/td&gt;
&lt;td&gt;103&lt;/td&gt;
&lt;td&gt;AI &amp;amp; ML&lt;/td&gt;
&lt;td&gt;91&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Dataset in Avro Format:&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Schema (in JSON format):&lt;/strong&gt;&lt;br&gt;
{&lt;br&gt;
  "type": "record",&lt;br&gt;
  "name": "Student",&lt;br&gt;
  "fields": [&lt;br&gt;
    {"name": "Name", "type": "string"},&lt;br&gt;
    {"name": "Register_No", "type": "int"},&lt;br&gt;
    {"name": "Subject", "type": "string"},&lt;br&gt;
    {"name": "Marks", "type": "int"}&lt;br&gt;
  ]&lt;br&gt;
}&lt;br&gt;
&lt;strong&gt;Data (Conceptual):&lt;/strong&gt;&lt;br&gt;
Row 1: Kavya, 101, Cloud Computing, 95&lt;br&gt;
Row 2: Ravi, 102, Data Analytics, 88&lt;br&gt;
Row 3: Meena, 103, AI &amp;amp; ML, 91&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Advantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Compact binary format (saves space).&lt;/li&gt;
&lt;li&gt;Schema evolution supported (easy to change fields).&lt;/li&gt;
&lt;li&gt;Ideal for big data and streaming (Kafka, Hadoop).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;-&amp;gt; Disadvantages:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Not human-readable.&lt;/li&gt;
&lt;li&gt;Needs Avro libraries to read or write.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>cloud</category>
      <category>data</category>
      <category>dataengineering</category>
    </item>
    <item>
      <title>Hands-On with MongoDB: Storing, Querying, and Analyzing Data</title>
      <dc:creator>Sugithaa K</dc:creator>
      <pubDate>Wed, 27 Aug 2025 04:53:51 +0000</pubDate>
      <link>https://dev.to/sugithaa_k/hands-on-with-mongodb-storing-querying-and-analyzing-data-d01</link>
      <guid>https://dev.to/sugithaa_k/hands-on-with-mongodb-storing-querying-and-analyzing-data-d01</guid>
      <description>&lt;p&gt;In this tutorial, I explored MongoDB, a NoSQL database, to learn how to store, query, and analyze data. I worked with a sample dataset of business reviews and performed common operations like insertion, aggregation, search, update, and deletion.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step 1: Setup MongoDB:&lt;/strong&gt;&lt;br&gt;
 I installed MongoDB Compass and connected to my local database. Here’s how the dashboard looks:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6xov5v1uvup6e0pfb7sf.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6xov5v1uvup6e0pfb7sf.jpg" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step 2: Insert Sample Data&lt;/strong&gt;&lt;br&gt;
  I inserted 10 sample business reviews manually into the collection using Compass in JSON mode. Here are the documents:[&lt;br&gt;
  { "business_id": 1, "name": "Cafe One", "rating": 4, "review": "Good food and service" },&lt;br&gt;
  { "business_id": 2, "name": "Pizza Place", "rating": 5, "review": "Excellent pizza, good staff" },&lt;br&gt;
  { "business_id": 3, "name": "Tea Corner", "rating": 3, "review": "Average taste but good location" },&lt;br&gt;
  { "business_id": 4, "name": "Burger Hub", "rating": 2, "review": "Not good, very slow service" },&lt;br&gt;
  { "business_id": 5, "name": "Sushi World", "rating": 5, "review": "Fresh sushi, good experience" },&lt;br&gt;
  { "business_id": 6, "name": "Taco House", "rating": 4, "review": "Good tacos and friendly staff" },&lt;br&gt;
  { "business_id": 7, "name": "Pasta Point", "rating": 3, "review": "Average pasta but good ambience" },&lt;br&gt;
  { "business_id": 8, "name": "Biryani Express", "rating": 5, "review": "Good biryani, loved it" },&lt;br&gt;
  { "business_id": 9, "name": "Coffee Bar", "rating": 4, "review": "Good coffee, nice place to relax" },&lt;br&gt;
  { "business_id": 10, "name": "Ice Cream Land", "rating": 5, "review": "Very good flavors and service" }&lt;br&gt;
]&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9mmpmlj4t3ftaq6karg8.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9mmpmlj4t3ftaq6karg8.jpg" alt=" " width="800" height="670"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step 3: Running Queries&lt;/strong&gt;&lt;br&gt;
  &lt;strong&gt;3.1 Top 5 Businesses with Highest Average Rating&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Aggregation query:&lt;/strong&gt;&lt;br&gt;
 [&lt;br&gt;
  { "$group": { "_id": "$business_id", "avgRating": { "$avg": "$rating" } } },&lt;br&gt;
  { "$sort": { "avgRating": -1 } },&lt;br&gt;
  { "$limit": 5 }&lt;br&gt;
]&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flc93hw0gtk67go6f428w.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flc93hw0gtk67go6f428w.jpg" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I used an aggregation pipeline to group by business_id and calculate the average rating, then sorted descending to find the top 5.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.2 Count Reviews Containing “Good”&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Filter query:&lt;/strong&gt;&lt;br&gt;
  { "review": { "$regex": "good", "$options": "i" } }&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0kxb37uprivkzfmut8i3.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0kxb37uprivkzfmut8i3.jpg" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
Using a regex filter, I found all reviews containing the word ‘good’ (case-insensitive).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.3 Get Reviews for a Specific Business&lt;/strong&gt;&lt;br&gt;
Query example for business_id = 2:&lt;br&gt;
  { "business_id": 2 }&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdfup4bj032j62jx7q2nn.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdfup4bj032j62jx7q2nn.jpg" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This query retrieves all reviews for a specific business.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.4 Update a Review&lt;/strong&gt;&lt;br&gt;
  &lt;strong&gt;Query example:&lt;/strong&gt;&lt;br&gt;
     { "$set": { "review": "Updated review: Really good service and tasty food!" } }&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7fjd4fz1azslx3xsenje.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7fjd4fz1azslx3xsenje.jpg" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
  I updated the review for business_id = 1 to reflect a more detailed feedback.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.5 Delete a Record&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fbqeml0gda82hmhhlmnkw.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fbqeml0gda82hmhhlmnkw.jpg" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;br&gt;
 &lt;strong&gt;Explanation:&lt;/strong&gt;&lt;br&gt;
   I deleted one record from the collection to demonstrate the deletion operation.&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
