TA4a.1: Tensor low-rank approximation of value functions in multi-task RL
Sergio Rozada, King Juan Carlos University, Spain; Santiago Paternain, Rensselaer Polytechnic Institute, United States; Juan Andrés Bazarque, University of Pittsburgh, United States; Antonio G. Marques, King Juan Carlos University, Spain
TA4a.3: Bimodal Bandits: Max-Mean Regret Minimization
Adit Jain, Cornell University, United States; Sujay Bhatt, JP Morgan Chase and Co., United States; Vikram Krishnamurthy, Cornell University, United States; Alec Koppel, JP Morgan Chase and Co., United States