SLP-L10: Efficient Learning and Inference for LLMs I
Oral
Wed, 6 May, 16:30 - 18:30
Location: Room 115
Session Type: Oral
Track: Speech and Language Processing [SL]
Click the to view the manuscript on IEEE Xplore Open Preview
Wed, 6 May, 16:50 - 17:10
SLP-L10.2: TSQLORA: TOWARDS SENSITIVITY AND QUALITY LOW-RANK ADAPTATION FOR EFFICIENT FINE-TUNING
Wed, 6 May, 17:10 - 17:30
SLP-L10.3: I-LORA: AN ADAPTIVE RANK ALLOCATION APPROACH USING INTEGRATED GRADIENTS
Wed, 6 May, 17:50 - 18:10
SLP-L10.5: MIDAS: A Dynamic Cross-GPU KV Cache Offloading Framework For LLM On GPU Cluster Systems
Wed, 6 May, 18:10 - 18:30