Skip to content

DEV Community

Jimmy Guerrero for Voxel51

Posted on May 2

Computer Vision Meetup: Who needs RLHF When You Have SFT?

#computervision #ai #machinelearning #datascience

This talk will center around Reinforcement Learning from Human Feedback, and more importantly, “Why” is it even needed over Supervised Fine-Tuning? We will also understand in easy terms some current open problems in RLHF as far as research in academia is concerned.

Speaker: Srishti Gureja is an ML engineer and researcher broadly interested in two things: ML efficiency techniques, including but not limited to designing algorithms that make maximum use of the hardware at hand, and the alignment in LLMs using literature from RL. She is currently researching better, simpler methods for aligning language models with Eleuther AI and Alex Havrilla from Georgia Tech. her full-time job is as an ML Engineer at Writesonic, a YC-backed startup.

Not a Meetup member? Sign up to attend the next event:

https://voxel51.com/computer-vision-ai-meetups/

Recorded on May 2, 2024 at the AI, Machine Learning and Data Science Meetup.

Top comments (0)

Subscribe

Read next

The Horrors of AI (Halloween Edition 🎃)

Kevin Moe Myint Myat 👨‍💻 - Oct 31

TIL: Block tabs and Get IP in Javascript

Mai Chi Bao - Oct 31

🔥 Build Your Own AI-Powered Chrome Extension 🧩

Santhosh Vijayabaskar - Oct 30

Large Language Models Mirror Creators' Ideological Biases, Raising Crucial Ethical Concerns

Mike Young - Oct 29