MLSP-L3: Deep Learning Architectures for Large-Scale Models
Oral
Tue, 5 May, 14:00 - 16:00
Location: Room 117
Session Type: Oral
Track: Machine Learning for Signal Processing [ML]
Click the to view the manuscript on IEEE Xplore Open Preview
Tue, 5 May, 14:00 - 14:20

MLSP-L3.1: ILSA: Information Loss-guided Sparsity Allocation for Pruning Large Language Models

Lin Li, Yan Wang, Zhuopeng Wang, Feilong Bao, Inner Mongolia University, China
Tue, 5 May, 14:20 - 14:40

MLSP-L3.2: EMOE: EIGENBASIS-GUIDED ROUTING FOR MIXTURE-OF-EXPERTS

Anzhe Cheng, Shukai Duan, Shixuan Li, Chenzhong Yin, Mingxi Cheng, Shahin Nazarian, Paul Thompson, Paul Bogdan, University of Southern California, United States of America
Tue, 5 May, 14:40 - 15:00

MLSP-L3.3: Intrinsic Semantic Consistency Enhancement for Robust Hierarchical Understanding in VLMs

Zhongze Wu, Central South University, China; Yitian Long, Fudan University, China; Feng Yang, Southeast University, China; Yueyi Luo, Central South University, China; Shan You, SenseTime Research, China; Xiu Su, Jun Long, Central South University, China
Tue, 5 May, 15:00 - 15:20

MLSP-L3.4: Dynamic Self-Distillation Former for Weakly Supervised Semantic Segmentation

Fanxuan Kong, Jun Lu, Heilongjiang University, China
Tue, 5 May, 15:20 - 15:40

MLSP-L3.5: ADAPTIVE SHARED EXPERTS WITH LORA-BASED MIXTURE OF EXPERTS FOR MULTI-TASK LEARNING

Minghao Yang, Ren Togo, Guang Li, Takahiro Ogawa, Miki Haseyama, Hokkaido University, Japan
Tue, 5 May, 15:40 - 16:00

MLSP-L3.6: nGPT as a Scalable Architecture for Speech Recognition and Translation

Nune Tadevosyan, Nithin Rao Koluguri, Monica Sekoyan, Piotr Zelasko, Nikolay Karpov, Jagadeesh Balam, Boris Ginsburg, NVIDIA, Armenia