MACa.3

Self-Play Preference Optimization for Language Model Alignment

Yue Wu, UCLA, United States; Zhiqing Sun, CMU, United States; Huizhuo Yuan, Kaixuan Ji, UCLA, United States; Yiming Yang, CMU, United States; Quanquan Gu, UCLA, United States

Session:
MACa: Mathematics in Generative AI Lecture

Track:
Adaptive Systems, Machine Learning, and Data Analytics

Location:
Chapel

Presentation Time:
Mon, 28 Oct, 09:05 - 09:30 PT (UTC -7)

Presentation
Discussion
Resources
No resources available.
Contacts