DEV Community

Hassan Ahmed

Posted on Jul 17, 2025

Fine-Tuning DeepSeek-R1 for Medical QA Using LoRA + Chain-of-Thought

#llm #deepseek #deeplearning #ai

Just published a step-by-step guide on how I fine-tuned DeepSeek-R1 to handle medical question answering using LoRA and chain-of-thought prompting.

✅ Lightweight fine-tuning with Unsloth
✅ Clinical reasoning dataset
✅ Before vs. after results that actually show improvement

Read the full post here:
👉 https://medium.com/@hassanahmed.dev1/fine-tuning-deepseek-r1-for-medical-question-answering-using-lora-d5f74e747e98

Feel free to check out the GitHub repo too!

machinelearning #llm #genai #ai #deeplearning #medtech #huggingface #transformers

Top comments (0)

Subscribe

Hassan Ahmed

Started with curiosity about how machines understand language now diving deep into AI/ML, LLMs, and NLP through hands-on learning. Still growing, still building. Open to global AI opportunities.

Location

Karachi Pakistan
Education

Learned programming fundamentals, web dev, and Python via Khan Academy & freeCodeCamp.
Pronouns

He/Him
Joined

Jun 18, 2025