<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Ranak Ghosh</title>
    <description>The latest articles on DEV Community by Ranak Ghosh (@ranak_ghosh_59b3080779a92).</description>
    <link>https://dev.to/ranak_ghosh_59b3080779a92</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3663277%2Fdb14d876-7a5c-4a9d-afe9-964de378eef5.jpg</url>
      <title>DEV Community: Ranak Ghosh</title>
      <link>https://dev.to/ranak_ghosh_59b3080779a92</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ranak_ghosh_59b3080779a92"/>
    <language>en</language>
    <item>
      <title>BiasAwareFeedback: Detecting Textual Bias with NLP (Mini-Research Project)</title>
      <dc:creator>Ranak Ghosh</dc:creator>
      <pubDate>Mon, 15 Dec 2025 15:56:01 +0000</pubDate>
      <link>https://dev.to/ranak_ghosh_59b3080779a92/biasawarefeedback-detecting-textual-bias-with-nlp-mini-research-project-1og</link>
      <guid>https://dev.to/ranak_ghosh_59b3080779a92/biasawarefeedback-detecting-textual-bias-with-nlp-mini-research-project-1og</guid>
      <description>&lt;h1&gt;
  
  
  Bias-Aware Automated Feedback System for Student Writing
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Limitations, Reproducibility, and Research Positioning
&lt;/h2&gt;




&lt;h3&gt;
  
  
  A. System Limitations
&lt;/h3&gt;

&lt;p&gt;Despite promising qualitative results, this project has several important limitations:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Model Dependence&lt;/strong&gt;&lt;br&gt;
The bias detection component relies on a locally hosted large language model (LLaMA 3 via Ollama). While this enables free, offline experimentation, it introduces variability in outputs depending on model version, prompt phrasing, and inference temperature.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Non-deterministic Outputs&lt;/strong&gt;&lt;br&gt;
Since large language models are generative, identical inputs may yield slightly different outputs across runs. This limits strict reproducibility of exact results, although trends and qualitative behaviors remain consistent.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Synthetic Evaluation Data&lt;/strong&gt;&lt;br&gt;
Many bias tests rely on synthetically modified text (e.g., demographic swap tests). While this is common in fairness research, it may not fully capture real-world linguistic complexity.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Lack of Human Evaluation&lt;/strong&gt;&lt;br&gt;
This project does not include large-scale human annotation or expert evaluation of feedback quality. Results are therefore primarily machine- and prompt-based.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Resource Constraints&lt;/strong&gt;&lt;br&gt;
The project was intentionally designed to run on consumer-grade hardware (4–8GB VRAM). As a result, model size and inference depth are limited compared to cloud-based systems.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;




&lt;h3&gt;
  
  
  B. Reproducibility Strategy
&lt;/h3&gt;

&lt;p&gt;Although full determinism is not guaranteed, the project emphasizes &lt;strong&gt;procedural reproducibility&lt;/strong&gt;, meaning that another researcher can follow the same steps and reach comparable conclusions.&lt;/p&gt;

&lt;p&gt;Reproducibility is ensured through:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Open-source code hosted on GitHub&lt;/li&gt;
&lt;li&gt;Explicit dependency listing (&lt;code&gt;requirements.txt&lt;/code&gt;)&lt;/li&gt;
&lt;li&gt;Clear directory structure (&lt;code&gt;src/&lt;/code&gt;, &lt;code&gt;paper/&lt;/code&gt;, &lt;code&gt;results/&lt;/code&gt;)&lt;/li&gt;
&lt;li&gt;Prompt templates embedded directly in the source code&lt;/li&gt;
&lt;li&gt;Local inference via Ollama (no API keys required)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;To reproduce the experiments:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Install Ollama and download the LLaMA 3 model&lt;/li&gt;
&lt;li&gt;Clone the GitHub repository&lt;/li&gt;
&lt;li&gt;Run the bias detection module on provided sample texts&lt;/li&gt;
&lt;li&gt;Observe qualitative differences across biased vs neutral inputs&lt;/li&gt;
&lt;/ol&gt;




&lt;h3&gt;
  
  
  C.  Research Ethics and Safety Considerations
&lt;/h3&gt;

&lt;p&gt;Bias analysis inherently involves sensitive topics such as gender, race, and socioeconomic status. To mitigate harm:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;No personal data is used&lt;/li&gt;
&lt;li&gt;All test sentences are synthetic or anonymized&lt;/li&gt;
&lt;li&gt;Outputs are framed as &lt;strong&gt;analytical observations&lt;/strong&gt;, not judgments&lt;/li&gt;
&lt;li&gt;The system avoids reinforcing stereotypes by explicitly labeling detected bias&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This aligns with responsible AI research practices.&lt;/p&gt;




&lt;h3&gt;
  
  
  D.  Intended Contributions
&lt;/h3&gt;

&lt;p&gt;Although small in scale, this project contributes the following:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A &lt;strong&gt;fully local, free bias analysis pipeline&lt;/strong&gt; using modern LLMs&lt;/li&gt;
&lt;li&gt;A practical demonstration of fairness-aware NLP principles&lt;/li&gt;
&lt;li&gt;A reproducible template for student-led AI ethics research&lt;/li&gt;
&lt;li&gt;A bridge between theory (bias/fairness) and deployment (local inference)&lt;/li&gt;
&lt;/ul&gt;




&lt;h3&gt;
  
  
  E.  Positioning as a Research Mini-Project
&lt;/h3&gt;

&lt;p&gt;This work is intentionally framed as a &lt;strong&gt;research-style mini project&lt;/strong&gt;, not a production system. Its value lies in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Clear research motivation&lt;/li&gt;
&lt;li&gt;Explicit assumptions and limitations&lt;/li&gt;
&lt;li&gt;Structured experimentation&lt;/li&gt;
&lt;li&gt;Ethical awareness&lt;/li&gt;
&lt;li&gt;Transparent reporting&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These qualities are central to undergraduate research programs and academic evaluation.&lt;/p&gt;




&lt;h3&gt;
  
  
  F. Future Work
&lt;/h3&gt;

&lt;p&gt;Several extensions are possible:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Quantitative benchmarking with labeled bias datasets&lt;/li&gt;
&lt;li&gt;Human evaluation studies&lt;/li&gt;
&lt;li&gt;Prompt optimization experiments&lt;/li&gt;
&lt;li&gt;Cross-model comparisons&lt;/li&gt;
&lt;li&gt;Integration with educational writing tools&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;Summary:&lt;/strong&gt;&lt;br&gt;
This section demonstrates that the project is not only functional but also &lt;em&gt;scientifically reasoned&lt;/em&gt;, ethically grounded, and reproducible—key qualities of credible research.&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>nlp</category>
      <category>python</category>
      <category>biasdetection</category>
    </item>
  </channel>
</rss>
