MLSP-L14.5

MATHHALU: A BENCHMARK FOR MATHEMATICAL REASONING PROCESS HALLUCINATION DETECTION IN LARGE REASONING MODELS

Bo Zhang, Rocket Force University of Engineering, China; Cong Gao, Nankai University, China; Bingxu Han, Shandong University, China; Minghao Hu, Zhunchen Luo, Jun Zhang, AMS, Center of Information Research, China; Wen Yao, AMS, Defense Innovation Institute, China; Xiaoying Bai, Guotong Geng, AMS, Center of Information Research, China; Zhong Wang, Rocket Force University of Engineering, China

Session:
MLSP-L14: Adversarial Attacks and Robust Learning Oral

Track:
Machine Learning for Signal Processing [ML]

Location:
Room 120+121

Presentation Time:
Wed, 6 May, 15:20 - 15:40

Presentation
Discussion
Resources
No resources available.
Session MLSP-L14
Contacts