Just published a step-by-step guide on how I fine-tuned DeepSeek-R1 to handle medical question answering using LoRA and chain-of-thought prompting.
β
Lightweight fine-tuning with Unsloth
β
Clinical reasoning dataset
β
Before vs. after results that actually show improvement
Read the full post here:
π https://medium.com/@hassanahmed.dev1/fine-tuning-deepseek-r1-for-medical-question-answering-using-lora-d5f74e747e98
Feel free to check out the GitHub repo too!
Top comments (0)