SLP-P38.10
USER-LEVEL SAFETY ALIGNMENT
Ziyang Zhang, Shandong University, China; Liqiang Jing, University of Texas at Dallas, United States of America
Session:
SLP-P38: Question Answering: Benchmarking and Reliability Poster
Track:
Speech and Language Processing [SL]
Location:
Poster Area 28
Presentation Time:
Thu, 7 May, 14:00 - 16:00
Presentation
Discussion
Resources
No resources available.
Session SLP-P38
SLP-P38.1: MAIA: A MULTIDIMENSIONAL BENCHMARK FOR ASSESSING MEDICAL AI AGENTS
Xin Ding, Jiaxin Ding, Yule Xie, Xinbing Wang, Shanghai Jiao Tong University, China; Biao Peng, Yan Li, Yichen Li, Andrew Huang, Yinan Wang, Noahai (Cayman) Limited, Singapore
SLP-P38.2: Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Yutao Hou, Yajing Luo, Shanghai University of Finance and Economics, China; Zhiwen Ruan, Southern University of Science and Technology, China; Hongru Wang, The Chinese University of Hong Kong, China; Weifeng Ge, Fudan University, China; Yun Chen, Shanghai University of Finance and Economics, China; Guanhua Chen, Southern University of Science and Technology, China
SLP-P38.3: DETECTWILD: IN-THE-WILD AI-GENERATED TEXT DETECTION BENCHMARK
Linqiu Zhang, Shanghai University Of Finance And Economics, China; Xinnan Guo, Ant Group, China; Zixuan Zhao, Shuyang Cai, Shanghai University Of Finance And Economics, China; Qianle Wang, ByteDance, China; Wanqing Xu, Xuan Lin, Ant Group, China; Wanyun Cui, Shanghai University Of Finance And Economics, China
SLP-P38.4: MEGATEMPQA: A MILLION-SCALE TEMPORAL QUESTION-ANSWER DATASET FOR REDUCING LLM HALLUCINATIONS
Haseeb Javed, Khan Muhammad, Hayoung Oh, Farman Ali, Sungkyunkwan University, Korea, Republic of
SLP-P38.5: HI-READER: A HIERARCHICAL COGNITIVE FRAMEWORK FOR MULTI-PAGE DOCUMENT VISUAL QUESTION ANSWERING
quancai Liu, haihui Fan, jinchao Zhang, xiangfang Li, chuanrong Li, bo Li, Institute of Information Engineering, Chinese Academy of Sciences, China
SLP-P38.6: CONSISTENCY-AWARE LEARNING FOR UNBIASED VISUAL QUESTION ANSWER
Xinyu Jiang, Qiang Lu, Liang Zhao, Shandong Jiaotong University, China; Yunfei Long, Queen Mary University of London, United Kingdom of Great Britain and Northern Ireland; Zhenfang Zhu, Shandong Jiaotong University, China; Jianyong Chai, Shandong Rail Transit Information Co., China
SLP-P38.7: Hierarchical Voting Decoder for Resolving Knowledge Conflicts
Leike An, Shuai Chen, Zhen Li, Kun Wu, Shan Yang, Xiangyu Pei, Jibin Wang, China Mobile Information Technology Center, China; Shaozhe Liu, Peking University, China
SLP-P38.8: COREANCHOR-QA: CENTER-ANCHORED AND SELF-IMPROVING FOR QUESTION-ANSWER GENERATION
Mengting Huang, Qiming Fu, Jianping Chen, Hongjie Wu, You Lu, Yunzhe Wang, Suzhou University of Science and Technology, China
SLP-P38.9: ReTools: Reflection-Enhanced Tool Invocation for Domain-Specific QA
Fuan Dong, Tao Zhou, Men Zhang, College of Computer Science, Nankai University, China; Xuan Pan, School of Artificial Intelligence, Tiangong University, China; Dongming Zhao, Artificial Intelligence Industry Research Institute, China Mobile Communications Group Tianjin Co., Ltd, Tianjin, China, China; Lei Xu, Continuous Education College, Nankai University, China; Yanlong Wen, Xiaojie Yuan, College of Computer Science, Nankai University, China
SLP-P38.10: USER-LEVEL SAFETY ALIGNMENT
Ziyang Zhang, Shandong University, China; Liqiang Jing, University of Texas at Dallas, United States of America
Contacts