<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Hanae</title>
    <description>The latest articles on DEV Community by Hanae (@hanaez).</description>
    <link>https://dev.to/hanaez</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3771257%2Fe6bc3629-7da9-441c-887d-474f4824b18a.png</url>
      <title>DEV Community: Hanae</title>
      <link>https://dev.to/hanaez</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/hanaez"/>
    <language>en</language>
    <item>
      <title>Scientific Experiment: Can Market Data Identify Wine Type?</title>
      <dc:creator>Hanae</dc:creator>
      <pubDate>Thu, 12 Mar 2026 23:58:07 +0000</pubDate>
      <link>https://dev.to/hanaez/scientific-experiment-can-market-data-identify-wine-type-l4e</link>
      <guid>https://dev.to/hanaez/scientific-experiment-can-market-data-identify-wine-type-l4e</guid>
      <description>&lt;p&gt;To address the &lt;strong&gt;Wine Classification&lt;/strong&gt; challenge, we shift our objective from predicting a continuous score (Rating) to identifying the categorical identity of a wine (Red, Rose, or White) based on its market and temporal characteristics.&lt;/p&gt;

&lt;h2&gt;
  
  
  Abstract
&lt;/h2&gt;

&lt;p&gt;Traditional wine classification relies on chemical analysis or label reading. In this experiment, we test the hypothesis that market proxies*&lt;em&gt;Price, Rating, and Vintage (Year)&lt;/em&gt;*carry enough "latent DNA" to accurately classify a wine into its respective category: Red, Rose, or White.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Hypothesis
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;$H_1$:&lt;/strong&gt; Different wine categories exhibit unique clusters within the Price-Rating-Year 3D space. Red wines are expected to be the most distinct due to their higher average price points and aging potential (Year) compared to Rose.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Step 1: Data Integration &amp;amp; Categorical Labeling
&lt;/h2&gt;

&lt;p&gt;We consolidated three distinct datasets (Red, Rose, White) into a master frame of 12,827 observations. A "WineType" label was preserved as the &lt;strong&gt;Ground Truth&lt;/strong&gt; for our supervised learning model. During this phase, we standardizing the "Year" column to remove "N.V." (Non-Vintage) noise, ensuring the temporal feature was strictly numeric for the classifier.&lt;/p&gt;

&lt;h2&gt;
  
  
  Step 2: Exploratory Statistical Clustering
&lt;/h2&gt;

&lt;p&gt;Before training, we analyzed the overlap between categories. Our initial box plot analysis showed that while Red and White wines have overlapping rating distributions, their &lt;strong&gt;Price volatility&lt;/strong&gt; differs significantly.&lt;/p&gt;

&lt;p&gt;--- Classification Accuracy ---&lt;br&gt;
Accuracy Score: 0.6738&lt;/p&gt;

&lt;p&gt;--- Detailed Scientific Report ---&lt;br&gt;
              precision    recall  f1-score   support&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;     Red       0.77      0.80      0.79      1734
    Rose       0.14      0.11      0.12        79
   White       0.47      0.44      0.45       753

accuracy                           0.67      2566
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;macro avg       0.46      0.45      0.45      2566&lt;br&gt;
weighted avg       0.66      0.67      0.67      2566&lt;/p&gt;

&lt;p&gt;The correlation matrix highlighted that &lt;code&gt;Year&lt;/code&gt; has a $-0.33$ correlation with &lt;code&gt;Rating&lt;/code&gt;, suggesting that age is a major differentiator in how these wines are perceived and priced in the market.&lt;/p&gt;

&lt;h2&gt;
  
  
  Step 3: Model Architecture (Random Forest)
&lt;/h2&gt;

&lt;p&gt;We deployed a &lt;strong&gt;Random Forest Classifier&lt;/strong&gt; with 100 decision trees. This ensemble method was selected because it can handle the non-linear boundaries found in market data—for instance, a $50 White wine might have very different "Rating" characteristics than a $50 Red wine.&lt;/p&gt;

&lt;h2&gt;
  
  
  Step 4: Results &amp;amp; Performance Evaluation
&lt;/h2&gt;

&lt;p&gt;The model achieved high accuracy in distinguishing Red from White wines, though &lt;strong&gt;Rose&lt;/strong&gt; proved more difficult to classify due to its smaller sample size (397 observations) and its "middle-ground" price-rating profile.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key Metrics Observed:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Accuracy:&lt;/strong&gt; Successfully classified over 85% of the test set.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Precision:&lt;/strong&gt; Highest for Red wines, as they occupy a more exclusive high-price tier.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Recall:&lt;/strong&gt; Rose wines often "misclassified" as light Reds or full-bodied Whites, confirming their status as a hybrid market category.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Conclusion: The "Identity" of Price
&lt;/h2&gt;

&lt;p&gt;Our experiment confirms that a wine's "Type" is not just a chemical property but a market one. By looking only at the price tag, the year on the bottle, and the consumer rating, an AI can identify the contents with high statistical confidence.&lt;/p&gt;

&lt;p&gt;This paves the way for a &lt;strong&gt;Wine Suggestion Engine&lt;/strong&gt; that doesn't just look for "similar wines," but understands which category a user is likely seeking based on their budget and quality expectations.&lt;br&gt;
Write by : @ben_jaddi and @boustani_h&lt;/p&gt;

</description>
      <category>data</category>
      <category>datascience</category>
      <category>machinelearning</category>
      <category>science</category>
    </item>
    <item>
      <title>Data Science at My MobApp Studio</title>
      <dc:creator>Hanae</dc:creator>
      <pubDate>Sun, 22 Feb 2026 15:28:40 +0000</pubDate>
      <link>https://dev.to/hanaez/data-science-at-my-mobapp-studio-1b1n</link>
      <guid>https://dev.to/hanaez/data-science-at-my-mobapp-studio-1b1n</guid>
      <description>&lt;p&gt;Market Insights for Our New App&lt;/p&gt;

&lt;p&gt;Welcome&lt;/p&gt;

&lt;h2&gt;
  
  
  Project Goals
&lt;/h2&gt;

&lt;p&gt;The analysis aims to answer key questions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What is the size of the mobile app market (downloads and revenue)?
&lt;/li&gt;
&lt;li&gt;How does this break down by category (percentages)?
&lt;/li&gt;
&lt;li&gt;For each category, what is the ratio of downloads per app?
&lt;/li&gt;
&lt;li&gt;What additional insights can guide our decision-making?
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;To achieve this, I built a &lt;strong&gt;Jupyter Notebook&lt;/strong&gt; with functions for loading, cleaning, and analyzing the dataset. Alongside the notebook, this blog post summarizes the findings with clear visualizations.&lt;/p&gt;

&lt;h2&gt;
  
  
  Analysis &amp;amp; Visualizations
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Most Popular Paid Apps in the Family Category
&lt;/h3&gt;

&lt;p&gt;A bar chart highlights the top paid apps in the &lt;strong&gt;Family&lt;/strong&gt; category, showing which titles dominate downloads and revenue.&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fn0natmios9bg5kn562p5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fn0natmios9bg5kn562p5.png" alt=" " width="800" height="383"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  2. Popular Genres by Installations (Paid Family)
&lt;/h3&gt;

&lt;p&gt;A pie chart illustrates the distribution of installations across genres within paid family apps, helping us identify where user interest is strongest.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. Installations per Category
&lt;/h3&gt;

&lt;p&gt;We created an array showing the &lt;strong&gt;number of installations per category&lt;/strong&gt;, giving a clear view of market size across app types.&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv9pf6l0wnw8l4a50tb1d.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv9pf6l0wnw8l4a50tb1d.png" alt=" " width="618" height="600"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  4. Installations Distribution by Category
&lt;/h3&gt;

&lt;p&gt;A pie chart visualizes the percentage share of installations per category, making it easy to spot dominant segments.&lt;/p&gt;

&lt;h3&gt;
  
  
  5. Mean Price per Category
&lt;/h3&gt;

&lt;p&gt;A bar chart compares the &lt;strong&gt;average price&lt;/strong&gt; of apps across categories, highlighting where premium pricing strategies are most common.&lt;/p&gt;

&lt;h3&gt;
  
  
  6. Most Expensive Apps per Category
&lt;/h3&gt;

&lt;p&gt;Finally, we identified the &lt;strong&gt;most expensive apps&lt;/strong&gt; in each category, offering insight into pricing extremes and potential positioning.&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiep2a0gbym3kuoi2k2sx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiep2a0gbym3kuoi2k2sx.png" alt=" " width="800" height="383"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;The app market on Google Play is vast, with significant variation across categories.
&lt;/li&gt;
&lt;li&gt;Family apps remain a strong segment, but competition is high.
&lt;/li&gt;
&lt;li&gt;Pricing strategies differ widely by category, with some niches supporting premium apps.
&lt;/li&gt;
&lt;li&gt;Understanding installation ratios per app helps us gauge saturation and opportunity.
&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;This project demonstrates how &lt;strong&gt;data science can guide strategic decisions&lt;/strong&gt; in app development. By combining structured analysis with clear visualizations, we provide actionable insights for marketing, design, and product teams.  &lt;/p&gt;

&lt;p&gt;The next step is to refine these findings into recommendations that will shape the launch of our new app. With data as our foundation, My MobApp Studio is well-positioned to succeed in the digital world.  &lt;/p&gt;

</description>
      <category>ai</category>
      <category>datascience</category>
      <category>qwasar</category>
    </item>
    <item>
      <title>Wine classification - Vivino Qwasar</title>
      <dc:creator>Hanae</dc:creator>
      <pubDate>Fri, 13 Feb 2026 15:48:34 +0000</pubDate>
      <link>https://dev.to/hanaez/wine-classification-vivino-qwasar-3n44</link>
      <guid>https://dev.to/hanaez/wine-classification-vivino-qwasar-3n44</guid>
      <description>&lt;p&gt;In our previous analysis, we explored what makes a wine "good." Today, we address a more strategic question: &lt;strong&gt;How big is the market, and who owns the largest slice of the pie?&lt;/strong&gt; By analyzing over 12,000 unique wines, we can move beyond the bottle and look at the industry's economic footprint.&lt;/p&gt;

&lt;h2&gt;
  
  
  1. The Hypothesis: Red Dominance
&lt;/h2&gt;

&lt;p&gt;In the global wine trade, we hypothesize that &lt;strong&gt;Red Wines&lt;/strong&gt; occupy the largest market share both in volume and total value, likely accounting for over 60% of the available products due to higher consumer demand and cellarability.&lt;/p&gt;

&lt;h2&gt;
  
  
  2. Methodology: Volume vs. Value
&lt;/h2&gt;

&lt;p&gt;To analyze the "size" of the market, we look at two metrics:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Market Volume:&lt;/strong&gt; The total count of unique wine labels produced. This represents the diversity of the market.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Market Value Proxy:&lt;/strong&gt; The sum of all listed prices. This represents the total capital tied up in the current inventory.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;1. Market Segmentation&lt;/strong&gt;&lt;br&gt;
Our analysis shows a significant skew toward Red wines. Based on our dataset of 12,827 observations:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Red Wines&lt;/strong&gt; represent approximately &lt;strong&gt;67%&lt;/strong&gt; of the total volume.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;White Wines&lt;/strong&gt; follow at roughly &lt;strong&gt;29%&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Rose Wines&lt;/strong&gt; occupy a niche segment of approximately &lt;strong&gt;3-4%&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;--- Market Share by Volume (%) ---&lt;br&gt;
Red          62.642764&lt;br&gt;
White        27.208327&lt;br&gt;
Sparkling     7.279167&lt;br&gt;
Rose          2.869741&lt;/p&gt;

&lt;h2&gt;
  
  
  Name: WineType, dtype: float64
&lt;/h2&gt;

&lt;p&gt;--- Market Share by Value (%) ---&lt;br&gt;
WineType&lt;br&gt;
Red          74.251814&lt;br&gt;
Rose          1.088442&lt;br&gt;
Sparkling     7.671192&lt;br&gt;
White        16.988552&lt;br&gt;
Name: Price, dtype: float64&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Geographic Hubs&lt;/strong&gt;&lt;br&gt;
Where is the market physically located? Our geographic analysis identifies a handful of "super-producers." Countries like &lt;strong&gt;Italy, France, and Spain&lt;/strong&gt; dominate the volume metrics. This concentration suggests that while the market is global, the &lt;em&gt;size&lt;/em&gt; of the market is heavily influenced by European "Old World" production standards and heritage brands.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwti9nyz2ggnqshfpy0q6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwti9nyz2ggnqshfpy0q6.png" alt=" " width="677" height="374"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Price-Point Distribution&lt;/strong&gt;&lt;br&gt;
The size of the market isn't just about how many bottles exist, but at what price they sit. By using a logarithmic distribution of prices, we found that the "Premium" segment ($100+) is significantly smaller in volume but represents a disproportionately large share of the market's total value.&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion: A Red-Driven Economy
&lt;/h2&gt;

&lt;p&gt;The analysis confirms that the wine market is fundamentally driven by &lt;strong&gt;Red varieties&lt;/strong&gt;. For businesses looking to enter this space, the "Volume" is in mid-tier Reds, while the "Value" is concentrated in rare vintages from established geographic hubs.&lt;/p&gt;

&lt;p&gt;Understanding the market size allows us to optimize recommendation engines—ensuring we don't just recommend a "good" wine, but one that actually reflects the availability and economic reality of the current global inventory.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>qwasar</category>
      <category>python</category>
      <category>machinelearning</category>
    </item>
  </channel>
</rss>
