Mon, 28 Oct, 08:15 - 08:40 PT (UTC -8)
MACa.1: White-Box Transformers via Sparse Rate Reduction
Yaodong Yu, University of California, Berkeley, United States; Sam Buchanan, Toyota Technological Institute at Chicago, United States; Druv Pai, University of California, Berkeley, United States; Tianzhe Chu, ShanghaiTech University, China; Ziyang Wu, University of California, Berkeley, United States; Shengbang Tong, New York University, United States; Ben Haeffele, Johns Hopkins University, United States; Yi Ma, University of California, Berkeley and University of Hong Kong, Hong Kong SAR of China