<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: ivanmartiolme</title>
    <description>The latest articles on DEV Community by ivanmartiolme (@ivanmartiolme).</description>
    <link>https://dev.to/ivanmartiolme</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3936331%2Fe665cfb8-148b-422e-bea5-e9e3a25dfb11.png</url>
      <title>DEV Community: ivanmartiolme</title>
      <link>https://dev.to/ivanmartiolme</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ivanmartiolme"/>
    <language>en</language>
    <item>
      <title>Customer Review Analysis with NLP: Sentiment Analysis and Topic Modeling using Python</title>
      <dc:creator>ivanmartiolme</dc:creator>
      <pubDate>Sun, 17 May 2026 13:44:41 +0000</pubDate>
      <link>https://dev.to/evolve-space/customer-review-analysis-with-nlp-sentiment-analysis-and-topic-modeling-using-python-2mim</link>
      <guid>https://dev.to/evolve-space/customer-review-analysis-with-nlp-sentiment-analysis-and-topic-modeling-using-python-2mim</guid>
      <description>&lt;h1&gt;
  
  
  Customer Review Analysis with NLP: Sentiment Analysis and Topic Modeling using Python
&lt;/h1&gt;

&lt;p&gt;Customer reviews contain valuable information about user satisfaction, service quality, and business performance. However, manually analyzing thousands of reviews is inefficient and often impossible at scale.&lt;/p&gt;

&lt;p&gt;In this project, I developed a Natural Language Processing (NLP) pipeline to automatically analyze customer reviews and extract actionable business insights from unstructured text data.&lt;/p&gt;

&lt;p&gt;The full project repository is available here:&lt;br&gt;
&lt;strong&gt;GitHub:&lt;/strong&gt; &lt;a href="https://github.com/ivanmartiolme/Proyecto-Master-DataScience-Evolve-IvanMartinez" rel="noopener noreferrer"&gt;https://github.com/ivanmartiolme/Proyecto-Master-DataScience-Evolve-IvanMartinez&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Project Objectives
&lt;/h2&gt;

&lt;p&gt;The main goals of this project were:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Clean and preprocess customer review text&lt;/li&gt;
&lt;li&gt;Perform sentiment analysis to classify reviews as positive or negative&lt;/li&gt;
&lt;li&gt;Identify recurring discussion topics automatically&lt;/li&gt;
&lt;li&gt;Analyze sentiment by topic&lt;/li&gt;
&lt;li&gt;Generate business insights from textual data&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Tools and Technologies
&lt;/h2&gt;

&lt;p&gt;The project was developed using:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Python&lt;/li&gt;
&lt;li&gt;Pandas&lt;/li&gt;
&lt;li&gt;NumPy&lt;/li&gt;
&lt;li&gt;Scikit-learn&lt;/li&gt;
&lt;li&gt;Transformers (Hugging Face)&lt;/li&gt;
&lt;li&gt;Datasets (Hugging Face)&lt;/li&gt;
&lt;li&gt;Matplotlib&lt;/li&gt;
&lt;li&gt;Jupyter Notebook&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Methodology
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Data Collection and Preparation
&lt;/h3&gt;

&lt;p&gt;Customer reviews were loaded into a structured DataFrame for analysis. A full text preprocessing pipeline was applied, including:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Lowercasing&lt;/li&gt;
&lt;li&gt;URL removal&lt;/li&gt;
&lt;li&gt;Punctuation and number removal&lt;/li&gt;
&lt;li&gt;Whitespace normalization&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This ensured better model performance and cleaner topic extraction.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. Sentiment Analysis
&lt;/h3&gt;

&lt;p&gt;A pretrained Transformer-based sentiment analysis model was used to classify each review as positive or negative.&lt;/p&gt;

&lt;p&gt;This allowed measurement of overall customer satisfaction and helped identify problematic areas.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. Topic Modeling
&lt;/h3&gt;

&lt;p&gt;To uncover the most frequent themes discussed by customers, TF-IDF vectorization and Non-negative Matrix Factorization (NMF) were applied.&lt;/p&gt;

&lt;p&gt;This revealed recurring topics such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Airport parking and transportation&lt;/li&gt;
&lt;li&gt;Customer service experience&lt;/li&gt;
&lt;li&gt;Pricing and additional fees&lt;/li&gt;
&lt;li&gt;Refund and cancellation issues&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  4. Insight Generation
&lt;/h3&gt;

&lt;p&gt;Combining sentiment analysis with topic modeling made it possible to detect which topics generated the most negative feedback and where service improvements could be prioritized.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Learnings
&lt;/h2&gt;

&lt;p&gt;This project demonstrated how NLP techniques can transform large volumes of unstructured customer feedback into meaningful business intelligence.&lt;/p&gt;

&lt;p&gt;It also reinforced practical skills in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Text preprocessing&lt;/li&gt;
&lt;li&gt;Transformer-based sentiment analysis&lt;/li&gt;
&lt;li&gt;Topic modeling&lt;/li&gt;
&lt;li&gt;Data storytelling&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Customer review analysis using NLP can significantly improve decision-making and customer experience strategies.&lt;/p&gt;

&lt;p&gt;This project was developed as part of the Master in Data Science at Evolve.&lt;/p&gt;

&lt;p&gt;Portfolio web:&lt;br&gt;
&lt;a href="https://ivanmartiolme.github.io/Proyecto-Master-DataScience-Evolve-IvanMartinez/" rel="noopener noreferrer"&gt;https://ivanmartiolme.github.io/Proyecto-Master-DataScience-Evolve-IvanMartinez/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Linkedin:&lt;br&gt;
&lt;a href="https://www.linkedin.com/pulse/an%C3%A1lisis-de-rese%C3%B1as-clientes-con-nlp-sentimiento-y-martinez-olmedo-rutae/" rel="noopener noreferrer"&gt;https://www.linkedin.com/pulse/an%C3%A1lisis-de-rese%C3%B1as-clientes-con-nlp-sentimiento-y-martinez-olmedo-rutae/&lt;/a&gt;&lt;/p&gt;

</description>
      <category>python</category>
      <category>machinelearning</category>
      <category>nlp</category>
      <category>datascience</category>
    </item>
  </channel>
</rss>
