Alignment Faking in Large Language Models: Could AI Be Deceiving Us? Jainil Prajapati Jainil Prajapati Jainil Prajapati Follow Dec 30 '24 Alignment Faking in Large Language Models: Could AI Be Deceiving Us? #aisafety #llms #largelanguagemodels #reinforcementlearnin Add Comment 18 min read
Deep Dive: OpenAI's o1 - The Dawn of Deliberate AI Rohit Agarwal Rohit Agarwal Rohit Agarwal Follow for Portkey Dec 9 '24 Deep Dive: OpenAI's o1 - The Dawn of Deliberate AI #o1models #systemcard #chainofthought #aisafety Add Comment 8 min read