Direct Preference OptimizationJinen SetpalMar 28, 2024Slides VideoRLHF Theory Deep LearningJinen SetpalML Research Intern @ AppleDeep Learning Optimization Theory & Intrinsic Interpretability