<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Naveen Gokul</title>
    <description>The latest articles on DEV Community by Naveen Gokul (@naveengokul).</description>
    <link>https://dev.to/naveengokul</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3602120%2Fb3fc2597-a3f4-4137-8ac4-8973f4fcaac4.jpg</url>
      <title>DEV Community: Naveen Gokul</title>
      <link>https://dev.to/naveengokul</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/naveengokul"/>
    <language>en</language>
    <item>
      <title>🧹 Data Cleaning Challenge with Pandas (Google Colab)</title>
      <dc:creator>Naveen Gokul</dc:creator>
      <pubDate>Sat, 08 Nov 2025 05:12:16 +0000</pubDate>
      <link>https://dev.to/naveengokul/data-cleaning-challenge-with-pandas-google-colab-4llj</link>
      <guid>https://dev.to/naveengokul/data-cleaning-challenge-with-pandas-google-colab-4llj</guid>
      <description>&lt;h1&gt;
  
  
  🧹 Data Cleaning Challenge with Pandas (Google Colab)
&lt;/h1&gt;

&lt;p&gt;Data cleaning is one of the most crucial steps in any data science or analytics project. In this challenge, I worked on a real-world dataset from &lt;strong&gt;Kaggle&lt;/strong&gt; with over &lt;strong&gt;100,000 rows&lt;/strong&gt;, performing various &lt;strong&gt;Pandas operations&lt;/strong&gt; to clean, preprocess, and prepare it for further analysis.&lt;/p&gt;




&lt;h2&gt;
  
  
  📂 Dataset Details
&lt;/h2&gt;

&lt;p&gt;For this challenge, I selected the &lt;strong&gt;E-commerce Sales Dataset&lt;/strong&gt; from Kaggle containing around &lt;strong&gt;120,000 rows and 12 columns&lt;/strong&gt;.&lt;br&gt;&lt;br&gt;
It includes data such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;🧾 Order ID
&lt;/li&gt;
&lt;li&gt;👤 Customer Name
&lt;/li&gt;
&lt;li&gt;🛒 Product &amp;amp; Quantity
&lt;/li&gt;
&lt;li&gt;💰 Sales &amp;amp; Discount
&lt;/li&gt;
&lt;li&gt;🌍 Region
&lt;/li&gt;
&lt;li&gt;📅 Order Date
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Before Cleaning:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Rows → 120,000
&lt;/li&gt;
&lt;li&gt;Columns → 12
&lt;/li&gt;
&lt;li&gt;File format → &lt;code&gt;.csv&lt;/code&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  ⚙️ Tools &amp;amp; Environment
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Python 3&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Google Colab&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Libraries:&lt;/strong&gt; Pandas, NumPy, Matplotlib
&lt;/li&gt;
&lt;/ul&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;
python
from google.colab import files
uploaded = files.upload()

import pandas as pd
df = pd.read_csv('ecommerce_sales.csv')
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

</description>
    </item>
    <item>
      <title>My Experience on NoSQL Data Analysis Using Dataset From Kaggle</title>
      <dc:creator>Naveen Gokul</dc:creator>
      <pubDate>Sat, 08 Nov 2025 05:01:00 +0000</pubDate>
      <link>https://dev.to/naveengokul/my-experience-on-nosql-data-analysis-using-dataset-from-kaggle-51jf</link>
      <guid>https://dev.to/naveengokul/my-experience-on-nosql-data-analysis-using-dataset-from-kaggle-51jf</guid>
      <description>&lt;p&gt;``# 🧩 MongoDB Atlas: Insert, Query, Update, Delete, and Export Data&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Author:&lt;/strong&gt; NAVEEN GOKUL S&lt;br&gt;
&lt;strong&gt;Date:&lt;/strong&gt; November 2025&lt;br&gt;
&lt;strong&gt;Topic:&lt;/strong&gt; &lt;em&gt;Data Engineering Assignment — MongoDB CRUD Operations&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  🗂️ Step 1: Setting up MongoDB Atlas
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;Go to &lt;a href="https://www.mongodb.com/atlas" rel="noopener noreferrer"&gt;&lt;strong&gt;MongoDB Atlas&lt;/strong&gt;&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Create a free cluster (use the &lt;strong&gt;Shared Tier&lt;/strong&gt; option).&lt;/li&gt;
&lt;li&gt;Under &lt;strong&gt;Network Access&lt;/strong&gt;, add your IP:&lt;/li&gt;
&lt;/ol&gt;

&lt;ul&gt;
&lt;li&gt;Click &lt;strong&gt;Network Access → Add IP Address → Allow access from anywhere (0.0.0.0/0)&lt;/strong&gt;.

&lt;ol&gt;
&lt;li&gt;Create a database user and remember the credentials.
Example:&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;`&lt;br&gt;
   Username: 22cs098&lt;br&gt;
   Password: NAVEEN&lt;br&gt;
   `&lt;/code&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Once your cluster is ready, click &lt;strong&gt;“Connect → Connect using MongoDB Shell”&lt;/strong&gt; and copy the connection string.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fl4kkyurycaly48yuwi7i.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fl4kkyurycaly48yuwi7i.png" alt=" " width="800" height="622"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  💻 Step 2: Connect from Mongo Shell
&lt;/h2&gt;

&lt;p&gt;Open &lt;strong&gt;PowerShell&lt;/strong&gt; or &lt;strong&gt;Command Prompt&lt;/strong&gt;, then run:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;`bash&lt;br&gt;
mongosh "mongodb+srv://m0.wpjmxqh.mongodb.net/" --apiVersion 1 --username 22cs098_db_user&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Then enter your password when prompted:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;`&lt;br&gt;
Enter password: NAVEEN&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;If connection succeeds, you’ll see:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;`&lt;br&gt;
Atlas atlas-xxxx-shard-0 [primary]&amp;gt;&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1am7tlzgbyn70eqdfiol.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1am7tlzgbyn70eqdfiol.png" alt=" " width="800" height="425"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  📥 Step 3: Create a Database and Insert Records
&lt;/h2&gt;

&lt;p&gt;Switch to a database (it will auto-create):&lt;/p&gt;

&lt;p&gt;&lt;code&gt;`javascript&lt;br&gt;
use businessDB&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Insert 10 sample business review records:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;`javascript&lt;br&gt;
db.reviews.insertMany([&lt;br&gt;
  { "business_id": "B001", "name": "Cafe Aroma", "rating": 4.6, "review": "Good food and fast service!", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B002", "name": "Pizza Palace", "rating": 4.8, "review": "Amazing crust and cheese quality!", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B003", "name": "Tea Time", "rating": 4.2, "review": "Nice ambience and friendly staff.", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B004", "name": "Sweet Treats", "rating": 3.9, "review": "Desserts were good but service was slow.", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B005", "name": "Veggie Delight", "rating": 4.1, "review": "Healthy food with good taste.", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B006", "name": "Burger Hub", "rating": 4.9, "review": "Best burgers ever!", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B007", "name": "Ocean Dine", "rating": 4.7, "review": "Fresh seafood and great view.", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B008", "name": "Spice Route", "rating": 3.8, "review": "Food was okay, but spicy.", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B009", "name": "Bakers Street", "rating": 4.5, "review": "Good pastries and coffee.", "date": "2025-11-07" },&lt;br&gt;
  { "business_id": "B010", "name": "Quick Bite", "rating": 4.0, "review": "Good service and clean place.", "date": "2025-11-07" }&lt;br&gt;
])&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg3dr6xruk55m8xaed1ev.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg3dr6xruk55m8xaed1ev.png" alt=" " width="800" height="423"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  🔍 Step 4: Queries
&lt;/h2&gt;

&lt;h3&gt;
  
  
  🏆 4.1 Top 5 Businesses by Rating
&lt;/h3&gt;

&lt;p&gt;&lt;code&gt;`javascript&lt;br&gt;
db.reviews.find().sort({ rating: -1 }).limit(5)&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  🔤 4.2 Count of Reviews Containing “good”
&lt;/h3&gt;

&lt;p&gt;&lt;code&gt;`javascript&lt;br&gt;
db.reviews.countDocuments({ review: /good/i })&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  🏪 4.3 Get Reviews for a Specific Business ID
&lt;/h3&gt;

&lt;p&gt;&lt;code&gt;`javascript&lt;br&gt;
db.reviews.find({ business_id: "B005" })&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fi1f1hj4yeznvaa3au6yg.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fi1f1hj4yeznvaa3au6yg.png" alt=" " width="800" height="416"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  ✏️ Step 5: Update and Delete
&lt;/h2&gt;

&lt;h3&gt;
  
  
  ✏️ Update a Review
&lt;/h3&gt;

&lt;p&gt;&lt;code&gt;`javascript&lt;br&gt;
db.reviews.updateOne(&lt;br&gt;
  { business_id: "B005" },&lt;br&gt;
  { $set: { rating: 4.3, review: "Updated: Great taste and fresh ingredients!" } }&lt;br&gt;
)&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  🗑️ Delete a Record
&lt;/h3&gt;

&lt;p&gt;&lt;code&gt;`javascript&lt;br&gt;
db.reviews.deleteOne({ business_id: "B010" })&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqwsb1m5uavitwrx24vr4.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqwsb1m5uavitwrx24vr4.png" alt=" " width="800" height="123"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  📤 Step 6: Export Data to JSON/CSV
&lt;/h2&gt;

&lt;p&gt;Exit Mongo shell:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;`bash&lt;br&gt;
exit&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Then run the following &lt;strong&gt;from PowerShell (not inside mongosh)&lt;/strong&gt; 👇&lt;/p&gt;

&lt;h3&gt;
  
  
  📄 Export as CSV
&lt;/h3&gt;

&lt;p&gt;&lt;code&gt;`bash&lt;br&gt;
mongoexport --uri="mongodb+srv://22cs098_db_user:NAVEEN@m0.wpjmxqh.mongodb.net/businessDB" --collection=reviews --type=csv --fields=business_id,name,rating,review,date --out=reviews.csv&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  📦 Export as JSON
&lt;/h3&gt;

&lt;p&gt;&lt;code&gt;`bash&lt;br&gt;
mongoexport --uri="mongodb+srv://22cs098_db_user:NAVEEN@m0.wpjmxqh.mongodb.net/businessDB" --collection=reviews --out=reviews.json&lt;br&gt;
`&lt;/code&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  📊 Step 7: View the Exported Files
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Open &lt;strong&gt;reviews.csv&lt;/strong&gt; in Excel or VS Code.&lt;/li&gt;
&lt;li&gt;Open &lt;strong&gt;reviews.json&lt;/strong&gt; in any text editor.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgpxp8fln9t5vv7zbgvz1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgpxp8fln9t5vv7zbgvz1.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  ✅ Step 8: Summary
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Operation&lt;/th&gt;
&lt;th&gt;Command Type&lt;/th&gt;
&lt;th&gt;Example&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Insert&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;insertMany()&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Add 10 reviews&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Query&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;find()&lt;/code&gt;, &lt;code&gt;countDocuments()&lt;/code&gt;
&lt;/td&gt;
&lt;td&gt;Search data&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Update&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;updateOne()&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Modify rating/review&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Delete&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;deleteOne()&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Remove record&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Export&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;mongoexport&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;CSV/JSON output&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  🎯 Final Thoughts
&lt;/h2&gt;

&lt;p&gt;MongoDB Atlas makes it easy to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Manage cloud-hosted databases&lt;/li&gt;
&lt;li&gt;Perform CRUD operations&lt;/li&gt;
&lt;li&gt;Export results in multiple formats&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This project demonstrates all essential MongoDB operations — perfect for &lt;strong&gt;Data Engineering&lt;/strong&gt; and &lt;strong&gt;Database Management&lt;/strong&gt; learning tasks.&lt;/p&gt;

</description>
      <category>database</category>
      <category>analytics</category>
      <category>tutorial</category>
      <category>mongodb</category>
    </item>
  </channel>
</rss>
