APSIPA 2021
Authors
Paper Submission
Paper Kit
Program
Technical Program
Paper Search
Presentation Instructions
Presentation Instructions
Presentation Upload
Sponsors
APSIPA 2021 Attendee Access
Technical Program
Session OD-SLA-8
Paper OD-SLA-8.6
OD-SLA-8.6
Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms
Tien-Hong Lo, Yao-Ting Sung, Berlin Chen, National Taiwan Normal University, Taiwan
Session:
Speech Recognition and Spoken Language Processing
Track:
Speech, Language, and Audio (SLA)
Session Time:
Fri, 17 Dec, 15:20 - 17:00 Japan Standard Time (UTC +9)
Fri, 17 Dec, 06:20 - 08:00 Coordinated Universal Time
Fri, 17 Dec, 01:20 - 03:00 Eastern Standard Time (UTC -5)
Thu, 16 Dec, 22:20 - 00:00 Pacific Standard Time (UTC -8)
Session Chair:
Hsin-Min Wang, Academia Sinica
Presentation
Not logged in.
Not logged in.
Discussion
Not logged in.
Resources
Not logged in.
Session OD-SLA-8
FR3.OD-A.1: Enriching Under-Represented Named Entities for Improved Speech Recognition
Tingzhi Mao, Hao Huang, Aishan Wumaier, Xinjiang University, China; Yerbolat Khassanov, Nazarbayev University, Kazakhstan; Van Tung Pham, Haihua Xu, Eng Siong Chng, Nanyang Technological University, Singapore, Singapore
FR3.OD-A.2: Ensemble of One Model: Creating Model Variations for Transformer with Layer Permutation
Andrew Liaw, Jia-Hao Hsu, Chung-Hsien Wu, National Cheng Kung University, Taiwan
FR3.OD-A.3: UNCERTAINTY ESTIMATION IN AUTOMATIC PRONUNCIATION ASSESSMENT WITH PSEUDO SAMPLES BASED ON DEEP KERNEL LEARNING
Binghuai Lin, Liyuan Wang, Tencent Technology Co., Ltd, China
FR3.OD-A.4: RETRIEVAL-ORIENTED E2E ASR MODELING FOR IMPROVED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
Takumi Kurokawa, Atsuhiko Kai, Shizuoka University, Japan
FR3.OD-A.5: Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework
Yizhou Peng, Jicheng Zhang, Haobo Zhang, Hao Huang, Xinjiang University, China; Haihua Xu, Eng Siong Chng, Nanyang Technological University, Singapore; Sheng Li, National Institute of Information and Communications Technology, Japan
FR3.OD-A.6: Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms
Tien-Hong Lo, Yao-Ting Sung, Berlin Chen, National Taiwan Normal University, Taiwan
FR3.OD-A.7: Zero-shot Domain Adaptation with Inference Relation Paths for Spoken Language Understanding
Sixia Li, Jianwu Dang, Japan Advanced Institute of Science and Technology, Japan
FR3.OD-A.8: End to End Spoken Language Understanding Using Partial Disentangled Slot Embedding
Tan Liu, Wu Guo, University of Science and Technology of China, China
FR3.OD-A.9: MULTIPLE DEEP LEARNING MODELS AND ARCHITECTURES WITH DIFFERENT NUMBERS OF STATES USED TO IMPROVE RETRIEVAL ACCURACY OF QUERY-BY-EXAMPLE
Kazuki Hatakeyama, Kazunori Kojima, Yoshiaki Itoh, Iwate Prefectural University, Japan; Masahiro Nishino, TOYOTA SYSTEMS CORPORATION, Japan; Shi-wook Lee, AIST, Japan
FR3.OD-A.10: Separable Temporal Convolution plus Temporally Pooled Attention for Lightweight High-performance Keyword Spotting
Shenghua Hu, Jing Wang, Wenjing Yang, Beijing Institute of Technology, China; Yujun Wang, Xiaomi Inc., China
FR3.OD-A.11: END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING
Koharu Horii, Norihide Kitaoka, Toyohashi University of Technology, Japan; Meiko Fukuda, Ryota Nishimura, Tokushima University, Japan; Kengo Ohta, National Institute of Technology, Anan College, Japan; Atsunori Ogawa, Nippon Telegraph and Telephone Corporation, Japan
FR3.OD-A.12: UNSUPERVISED SPOKEN TERM DISCOVERY USING WAV2VEC 2.0
Yu Iwamoto, Takahiro Shinozaki, Tokyo Institute of Technology, Japan
FR3.OD-A.13: EFFECT OF PERCEPTUAL TRAINING WITH NOISE ON CHINESE LEARNERS’ ENGLISH CONSONANT RECEPTION THRESHOLDS
Jian Gong, Yameng Yu, William Bellamy, Feng Wang, Xiaoli Ji, Jiangsu University of Science and Technology, China
FR3.OD-A.14: MULTI-VIEW CONVOLUTION FOR LIPREADING
Tsubasa Maeda, Satoshi Tamura, Gifu university, Japan
FR3.OD-A.15: OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Xiamen University, China; Dong Wang, Tsinghua University, China; Liming Song, Cheng Yang, Speechocean, China
FR3.OD-A.16: CROSS-UTTERANCE RERANKING MODELS WITH BERT AND GRAPH CONVOLUTIONAL NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION
Shih-Hsuan Chiu, Tien-Hong Lo, Fu-An Chao, Berlin Chen, National Taiwan Normal University, Taiwan