ASPS-P17: Resource-Efficient Machine Learning I
Poster
Fri, 8 May, 14:00 - 16:00
Location: Poster Area 32
Session Type: Poster
Track: Applied Signal Processing Systems [AS]
Click the to view the manuscript on IEEE Xplore Open Preview

ASPS-P17.1: STAR Meets Linear Attention: Linear Complexity-Preserving Enhanced Attention Mechanism for Vision Transformer

Senqi Guan, Wenxin Liang, Moyan Tian, Guanglu Wang, Linlin Zong, Xinyue Liu, Xianchao Zhang, Dalian University of Technology, China

ASPS-P17.2: ENABLING ON-DEVICE LIFE-THREATENING ARRHYTHMIA DETECTION VIA PERSONALIZED ADAPTIVE INFERENCE FOR IMPLANTABLE DEVICES

Yanting Shi, Yiyang Shi, Shaopu Shi, Yushu Chen, Ran Tang, Ziming Wang, Shandong University, China; Rongfeng Zhang, Xiaojie Wang, Yunlong Xia, First Affiliated Hospital of Dalian Medical University, China; Xiuzhen Cheng, Shandong University, China

ASPS-P17.3: MIX-CLAP: ADAPTIVE FUSION OF KNOWLEDGE-DISTILLED AUDIO EMBEDDINGS FOR NOISE-AWARE AUDIO-LANGUAGE MODELS

Wataru Kohno, Shaobo Han, NEC laboratories America, Inc., United States of America; Noriyuki Tonami, NEC Corporation, Japan; Tingfeng Li, NEC laboratories America, Inc., United States of America; Jingchen Sun, NEC laboratories America, Inc., The State University of New York at Buffalo, United States of America; Ting Wang, NEC laboratories America, Inc, United States of America

ASPS-P17.4: S 2VD: A SUBSPACE-AWARE SVD METHOD FOR EFFICIENT LLM COMPRESSION

Hao Zhou, Jiapeng Guan, Dalian University of Technology, China; Jie Zhang, Southeast University, China; Ran Wei, Lancaster University, United Kingdom of Great Britain and Northern Ireland; Xudong Zhao, Dalian University of Technology, China; Zhe Jiang, Southeast University, China; Xiangyang Ji, Tsinghua University, China

ASPS-P17.5: RDQ: Learnable Kronecker Rotation Matrix Decomposition for Efficient Large Language Model Quantization

Chenhui Zhu, Jinhao Liu, Hongxu Jiang, Beihang University, China; Kang Zhao, Tsinghua University, China; Runhua Zhang, Beihang University, China

ASPS-P17.6: UNILORA: A UNIFIED FRAMEWORK FOR EFFICIENT AND SECURE LORA MANAGEMENT IN MULTI-TENANT LLM INFERENCE

Zechao Lin, xingbin Wang, Dan Meng, Rui Hou, Chinese Academy of Sciences, China

ASPS-P17.7: Fractal Generative Distillation

Junhao Wang, Jilin University, China; Guoxi Xu, Alibaba Group, China; Huimin Tong, Hikvision Digital Technology Co., Ltd., China; Gaochao Xu, Jilin University, China; Yiwei Chen, Chinese Academy of Sciences, University of Science and Technology of China, China

ASPS-P17.8: DUAL-PATH COMPRESSION FOR REAL-TIME MULTIMODAL CLICKBAIT DETECTION: QUANTIZATION AND DISTILLATION

Haoqian Song, Haoran Yin, Fuwen Zhao, Yiran Du, Xuan Luo, Xindi Ma, Chaoyuan Zuo, Nankai University, China

ASPS-P17.10: DISTILLATION-BASED LAYER DROPPING (DLD): EFFECTIVE END-TO-END FRAMEWORK FOR DYNAMIC SPEECH NETWORKS

Abdul Hannan, University of Trento, Italy; Daniele Falavigna, Fondazione Bruno Kessler, Italy; Shah Nawaz, Johannes Kepler University, Austria; Mubashir Noman, Mohamed bin Zayed University of Artificial Intelligence, United Arab Emirates; Markus Schedl, Johannes Kepler University, Austria; Alessio Brutti, Fondazione Bruno Kessler, Italy