Direct Preference OptimizationJinen SetpalMar 28, 2024Slides VideoRLHF Theory Deep LearningJinen SetpalECE PhD @ PurdueResearch under Interpretable Domain Generalization.