F-1-1: Emotion, Dialect, and Age Recognition |
All times are in New Zealand Time (UTC +13) |
| Presentation Time: Tuesday, December 8, 12:30 - 14:00 Check your Time Zone |
| F-1-1.1: DIALECT-AWARE MODELING FOR END-TO-END JAPANESE DIALECT SPEECH RECOGNITION |
| Ryo Imaizumi; Tokyo Metropolitan University |
| Ryo Masumura; Nippon Telegraph and Telephone Corporation |
| Sayaka Shiota; Tokyo Metropolitan University |
| Hitoshi Kiya; Tokyo Metropolitan University |
| F-1-1.2: ACOUSTIC AND TEXTUAL DATA AUGMENTATION FOR CODE-SWITCHING SPEECH RECOGNITION IN UNDER-RESOURCED LANGUAGE |
| I-Ting Hsieh; National Cheng Kung University |
| Chung-Hsien Wu; National Cheng Kung University |
| Chun-Huang Wang; National Cheng Kung University |
| F-1-1.3: SPEAKER-INVARIANT PSYCHOLOGICAL STRESS DETECTION USING ATTENTION-BASED NETWORK |
| Hyeon-Kyeong Shin; Yonsei University |
| Hyewon Han; Yonsei University |
| Kyunggeun Byun; Yonsei University |
| Hong-Goo Kang; Yonsei University |
| F-1-1.4: SENSING WITH CONTEXTS: CRYING REASON CLASSIFICATION FOR INFANT CARE CENTER WITH ENVIRONMENTAL FUSION |
| Chun-Min Chang; National Tsing Hua University |
| Huan-Yu Chen; National Tsing Hua University |
| Hsiang-Chun Chen; National Tsing Hua University |
| Chi-Chun Lee; National Tsing Hua University |
| F-1-1.5: SPEAKER AGE ESTIMATION USING AGE-DEPENDENT INSENSITIVE LOSS |
| Yuki Kitagishi; NTT |
| Hosana Kamiyama; NTT |
| Atsushi Ando; NTT |
| Naohiro Tawara; NTT |
| Takeshi Mori; NTT |
| Satoshi Kobashikawa; NTT |
| F-1-1.6: DEEP MULTILAYER PERCEPTRONS FOR DIMENSIONAL SPEECH EMOTION RECOGNITION |
| Bagus Tris Atmaja; JAIST |
| Masato Akagi; Japan Advanced Institute of Science and Technology |