<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Deblina Roy</title>
    <description>The latest articles on DEV Community by Deblina Roy (@deblina111).</description>
    <link>https://dev.to/deblina111</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3845732%2F33c48aaf-c04b-4208-8637-e3109c1740d2.jpeg</url>
      <title>DEV Community: Deblina Roy</title>
      <link>https://dev.to/deblina111</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/deblina111"/>
    <language>en</language>
    <item>
      <title>🧬 Not All Heart Disease Is the Same - So I Built an AI to Prove It</title>
      <dc:creator>Deblina Roy</dc:creator>
      <pubDate>Sat, 28 Mar 2026 10:18:36 +0000</pubDate>
      <link>https://dev.to/deblina111/not-all-heart-disease-is-the-same-so-i-built-an-ai-to-prove-it-22ic</link>
      <guid>https://dev.to/deblina111/not-all-heart-disease-is-the-same-so-i-built-an-ai-to-prove-it-22ic</guid>
      <description>&lt;h2&gt;
  
  
  &lt;strong&gt;The Question That Started It All&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;A few months ago, while diving into cardiovascular data, I found myself asking a simple question: &lt;strong&gt;Why do two heart patients with the exact same diagnosis respond so differently to the same treatment?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Clinically, they are labeled the same. But biologically? That didn’t feel right. Coming from a background in Microbiology, I’ve always been fascinated by the invisible "mechanics" of the cell. Transitioning into Data Science at Northwestern allowed me to finally quantify those mechanics at scale.&lt;/p&gt;

&lt;p&gt;I started imagining three distinct patients:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Patient A:&lt;/strong&gt; Struggles because of inherited genetic and metabolic "fuel" issues.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Patient B:&lt;/strong&gt; Damage is driven by a hyper-active immune system (inflammation).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Patient C:&lt;/strong&gt; The heart is slowly stiffening due to excessive scar tissue (fibrosis).&lt;/p&gt;

&lt;p&gt;That’s the gap this project aims to bridge. Yet, we often treat them with a "one-size-fits-all" approach. I built this AI portal to prove we can do better.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Multi-Omics Integration&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If the disease is different at a molecular level, then the data already knows and we just aren’t looking at all the layers at once. Instead of using one dataset, I decided to integrate four biological "chapters" of the same story:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Genomics:&lt;/strong&gt; The blueprint you’re born with (&lt;a href="https://www.ebi.ac.uk/gwas/" rel="noopener noreferrer"&gt;GWAS Catalog&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Transcriptomics:&lt;/strong&gt; What your genes are actually doing (&lt;a href="https://gtexportal.org/home/" rel="noopener noreferrer"&gt;GTEx Portal&lt;/a&gt;).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Proteomics:&lt;/strong&gt; The functional machinery (Proteins).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Metabolomics:&lt;/strong&gt; The downstream biochemical consequences.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;[&lt;/strong&gt; &lt;strong&gt;Experience the Portal:&lt;/strong&gt; &lt;a href="https://multi-omic-heart-disease.streamlit.app/" rel="noopener noreferrer"&gt;https://multi-omic-heart-disease.streamlit.app/&lt;/a&gt; &lt;strong&gt;]&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Incremental Pipeline Validation&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;I tested each integration layer sequentially using &lt;strong&gt;&lt;a href="https://link.springer.com/article/10.15252/msb.20178124" rel="noopener noreferrer"&gt;MOFA+ (Multi-Omics Factor Analysis)&lt;/a&gt;&lt;/strong&gt;:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Phase&lt;/th&gt;
&lt;th&gt;Data Layers&lt;/th&gt;
&lt;th&gt;Silhouette Score&lt;/th&gt;
&lt;th&gt;Change&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Phase 1&lt;/td&gt;
&lt;td&gt;Genomics + Transcriptomics&lt;/td&gt;
&lt;td&gt;0.0659&lt;/td&gt;
&lt;td&gt;Baseline&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Phase 2&lt;/td&gt;
&lt;td&gt;+ Proteomics&lt;/td&gt;
&lt;td&gt;0.1247&lt;/td&gt;
&lt;td&gt;+88%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Phase 3&lt;/td&gt;
&lt;td&gt;+ Metabolomics&lt;/td&gt;
&lt;td&gt;0.1834&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;+178% total&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;This 178% improvement in clustering quality proved that biological signals are not just additive, they are synergistic.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fviv2cew70doaz3y0lcso.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fviv2cew70doaz3y0lcso.png" alt="Performance Comparison" width="800" height="508"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Figure 1:&lt;/strong&gt; &lt;em&gt;Evolution of the pipeline from Phase 1 (Baseline) to Phase 3 (+178% cluster separation).&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Three Molecular Subtypes&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;The model identified three disease subtypes with distinct biomarker patterns:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Subtype 0: Energy Metabolism&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Mitochondrial dysfunction phenotype with reduced ATP production.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Aspect&lt;/th&gt;
&lt;th&gt;Details&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Root Cause&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Genetic predisposition + mitochondrial dysfunction&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Biomarker Profile&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;High PRS, Troponin I, NT-proBNP&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Clinical Feature&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Reduced cardiac output&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Treatment Target&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Metabolic support, AMPK activators&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Subtype 1: Inflammatory&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Immune dysregulation phenotype with elevated pro-inflammatory markers.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Aspect&lt;/th&gt;
&lt;th&gt;Details&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Root Cause&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Autoimmune-mediated cardiac inflammation&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Biomarker Profile&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Elevated IL-6, CRP, TNF-α&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Clinical Feature&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Immune dysregulation&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Treatment Target&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Anti-inflammatory drugs, TNF inhibitors&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Subtype 2: Fibrotic&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Pathological fibrosis phenotype with excessive collagen accumulation.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Aspect&lt;/th&gt;
&lt;th&gt;Details&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Root Cause&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Pathological cardiac fibrosis&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Biomarker Profile&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Elevated TGF-β, TIMP1&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Clinical Feature&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Diastolic dysfunction, stiffness&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Treatment Target&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Anti-fibrotic agents (Finerenone, SGLT2i)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjusi1y57ifnct2kxy4az.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjusi1y57ifnct2kxy4az.png" alt="Heatmap showing biomarker profiles for each of the 3 subtypes" width="800" height="619"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Figure 2:&lt;/strong&gt; &lt;em&gt;The "Molecular Fingerprint" of Heart Disease. This heatmap reveals how specific biomarkers from Genomic risk (PRS) to Inflammatory cytokines (IL-6) cluster into three distinct, actionable subtypes. Green indicates high expression, proving that each subtype requires a different therapeutic focus.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;The proof: Key Metrics&lt;/strong&gt;
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Result&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Patient Samples&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;387 real patients&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Integrated Features&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;50+ across 4 omics layers&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Cross-Validation Accuracy&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;94.2%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;AUC-ROC Score&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;0.947 (Excellent)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Balanced Accuracy&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;91.8%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Clustering Improvement&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;+178% (Silhouette: 0.0659 → 0.1834)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fttozt57c8v536kf8s8sb.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fttozt57c8v536kf8s8sb.png" alt="Bar chart of Accuracy (94.2%), AUC-ROC (0.947), Balanced Accuracy (91.8%)" width="800" height="474"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Figure 3:&lt;/strong&gt; &lt;em&gt;Model performance metrics across cross-validation folds demonstrate robust stratification capability.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;These aren't vanity metrics; they represent &lt;strong&gt;real predictive power&lt;/strong&gt; for stratifying patients into actionable subtypes.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Technical Stack&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Integration&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;PCA for dimensionality reduction&lt;/li&gt;
&lt;li&gt;MOFA+ for probabilistic factor analysis&lt;/li&gt;
&lt;li&gt;Variance filtering for feature selection&lt;/li&gt;
&lt;li&gt;Layer-wise normalization&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;ML Pipeline&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;K-means clustering (k=3)&lt;/li&gt;
&lt;li&gt;Random Forest classification&lt;/li&gt;
&lt;li&gt;SHAP for feature importance&lt;/li&gt;
&lt;li&gt;Stratified cross-validation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Visualization &amp;amp; Deployment&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Streamlit for interactive portal&lt;/li&gt;
&lt;li&gt;Plotly for dynamic visualizations&lt;/li&gt;
&lt;li&gt;GitHub for version control&lt;/li&gt;
&lt;li&gt;Streamlit Cloud for live deployment&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;The Portal: Patient-Friendly Design&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1tkxtv03znfol64ywbs9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1tkxtv03znfol64ywbs9.png" alt="Screenshot of the live portal showing predictor page" width="800" height="524"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Fig 3: &lt;em&gt;The interactive portal guides patients through symptom checklist, biomarker entry, and personalized risk assessment.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;One challenge: &lt;strong&gt;making clinical AI understandable to non-scientists&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;I implemented 6 accessibility features:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Symptom Checklist First&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Before entering biomarker numbers, patients check symptoms they experience. This helps them understand early on what to look for.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Visual Biomarker Meters&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Instead of just numbers, each biomarker shows:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Color gradient (green → yellow → red)&lt;/li&gt;
&lt;li&gt;Status indicator (Low/Moderate/High)&lt;/li&gt;
&lt;li&gt;Plain English explanation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;3. Risk Rating&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;🟢 HIGH CONFIDENCE (94.2%)
   "The model is very confident in this result"

🟡 MODERATE CONFIDENCE (65%)
   "Confirm with your doctor"

🔴 LOW CONFIDENCE (45%)
   "Need additional testing"
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;4. "What This Means For You"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;For each subtype, the portal shows:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Common symptoms to watch&lt;/li&gt;
&lt;li&gt;Lifestyle changes that help&lt;/li&gt;
&lt;li&gt;Medications your doctor might suggest&lt;/li&gt;
&lt;li&gt;5 questions to ask your cardiologist&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;5. You vs. Average Comparison&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"How do my markers compare to typical patients with this subtype?"&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;6. Trust &amp;amp; Credibility&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Why should patients believe this?&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Based on 387 real patient samples&lt;/li&gt;
&lt;li&gt;94.2% validation accuracy&lt;/li&gt;
&lt;li&gt;Reviewed by cardiologists&lt;/li&gt;
&lt;li&gt;BUT: This is NOT a diagnosis. See your doctor.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Data Sources&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;GTEx Project&lt;/strong&gt;: Gene expression in healthy heart tissue&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;GWAS Catalog&lt;/strong&gt;: Genetic variants associated with heart disease&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Clinical cohorts&lt;/strong&gt;: Real patient biomarker data&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Public databases&lt;/strong&gt;: Protein and metabolite information&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Key Insights&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. Multi-Omics &amp;gt; Single-Omics&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;No single data layer provides complete molecular classification. Integration improves discrimination power by 178%.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Explainability is Essential for Clinical Adoption&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Model performance metrics alone don't guarantee clinical utility. Patient-friendly explanations and confidence scoring are equally important.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Normalization Prevents Layer Dominance&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Biological datasets have different scales. Independent normalization per layer prevents high-variance omics from overwhelming low-variance layers.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Validation is Non-Negotiable&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Stratified cross-validation, AUC-ROC, balanced accuracy to measure everything. For prognosis in healthcare, accuracy directly impacts patient outcomes.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. Domain Knowledge Improves Model Interpretation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Understanding that IL-6 indicates ongoing inflammation helps explain &lt;em&gt;why&lt;/em&gt; subtypes cluster together. Biological plausibility validates model decisions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Next Steps &amp;amp; Future Directions&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Longitudinal tracking (how subtypes evolve over time)&lt;/li&gt;
&lt;li&gt;Imaging integration (echocardiography, cardiac MRI)&lt;/li&gt;
&lt;li&gt;Survival prediction per subtype&lt;/li&gt;
&lt;li&gt;Personalized drug response prediction&lt;/li&gt;
&lt;li&gt;HIPAA compliance for real patient deployment&lt;/li&gt;
&lt;li&gt;Clinical validation studies (prospective)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;📚 Open Science &amp;amp; Reproducibility&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Everything is open-source on GitHub:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Complete ML pipeline (Python scripts)
&lt;/li&gt;
&lt;li&gt;5 Jupyter analysis notebooks
&lt;/li&gt;
&lt;li&gt;Sample datasets &amp;amp; visualizations
&lt;/li&gt;
&lt;li&gt;Full documentation (literature review, methods, concepts)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;👉 &lt;strong&gt;&lt;a href="https://github.com/roy-deblina/multi-omic-heart-disease" rel="noopener noreferrer"&gt;Explore the Repository&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Conclusion &amp;amp; Clinical Implications&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Multi-omic integration enables molecular stratification of clinically-labeled disease.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Current cardiac diagnosis relies on ejection fraction and symptoms alone. These are downstream manifestations of three distinct underlying mechanisms:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Metabolic dysfunction&lt;/li&gt;
&lt;li&gt;Immune dysregulation&lt;/li&gt;
&lt;li&gt;Fibrotic remodeling&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Each requires different therapeutic targeting. This work demonstrates that existing clinical biomarkers, when integrated computationally, can reveal actionable patient subtypes before specialized testing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key Takeaways&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Heart disease diagnosis shouldn't be a guessing game. By moving beyond the surface and integrating 4 layers of molecular data, we can identify these "hidden" disease types before they lead to irreversible damage. This isn't just data science; it's the future of precision cardiology.&lt;/p&gt;

&lt;h2&gt;
  
  
  🤝 Let's Connect!
&lt;/h2&gt;

&lt;p&gt;I'm currently a Data Science graduate student at Northwestern University, and I'd love to hear your thoughts on precision cardiology and explainable AI.&lt;/p&gt;

&lt;p&gt;LinkedIn: &lt;a href="https://www.linkedin.com/in/deblina555/" rel="noopener noreferrer"&gt;https://www.linkedin.com/in/deblina555/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Drop your comments below!&lt;/strong&gt; 👇&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;📖 Key References&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Argelaguet et al. (2018).&lt;/strong&gt; Multi-Omics Factor Analysis. &lt;em&gt;Molecular Systems Biology&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Subramanian et al. (2005).&lt;/strong&gt; Gene Set Enrichment Analysis. &lt;em&gt;PNAS&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Lundberg &amp;amp; Lee (2017).&lt;/strong&gt; A Unified Approach to Interpreting Model Predictions. &lt;em&gt;NeurIPS&lt;/em&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  🔗 Datasets Used
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;a href="https://gtexportal.org" rel="noopener noreferrer"&gt;GTEx Portal&lt;/a&gt; - Gene expression reference&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.ebi.ac.uk/gwas" rel="noopener noreferrer"&gt;GWAS Catalog&lt;/a&gt; - Genetic variants&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://www.ukbiobank.ac.uk" rel="noopener noreferrer"&gt;UK Biobank&lt;/a&gt; - Population data&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>machinelearning</category>
      <category>python</category>
      <category>datascience</category>
      <category>healthcare</category>
    </item>
  </channel>
</rss>
