ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC)
Call for Participation
As cars become an indispensable part of human daily life, a secure and comfortable driving environment is more and more attractive. The touch-based interaction in the traditional cockpit is easy to distract the driver’s attention, leading to inefficient operations and potential security risks. Thus, the concept of intelligent cockpit is gradually on the rise.
The intelligent cockpit aims to achieve a seamless driving experience for people by integrating multimodal intelligent interactions, like speech, gestures, body, etc., with different driving functions, like commands recognition, entertainment, navigation, etc. As a natural human-computer interaction method, a robust speech or command recognition system is crucial to the intelligent cockpit. Although speech recognition has achieved great progress in lots of applications, there are still many challenges in the driving scenario. First of all, the acoustic environment of the cockpit is complex. Since the cockpit is a closed and irregular space, it has special room impulse response (RIR), resulting in special reverberation conditions. In addition, there are various kinds of noise during driving from both inside and outside, such as wind, engine, wheel, background music and interfering speaker, etc. Secondly, the main content of intelligent cockpit speech interaction is the user’s command recognition, which includes controlling the air conditioner, playing songs, navigating, etc. These commands may involve a large number of named entities such as contacts, singer names and point of interest (POI).
Nowadays there is a large amount of open-source data for speech recognition, and the model trained with open-source data has achieved good performance in many applications. However, such models often show poor performance in the intelligent cockpit scene because of the special acoustic environment and content characteristics. Therefore, we launch the Intelligent Cockpit Speech Recognition Challenge (ICSRC), in which we will release an intelligent cockpit dataset and aim to explore speech recognition techniques in intelligent cockpit scenes. The corpus consists of 20 hours of real-world recorded data collected by a Hi-Fi microphone placed in a car in different driving conditions. This competition consists of 2 tracks with different limits of model configurations.
Timeline
- September 3rd, 2022: Release of the Eval data.
September 10th, 2022: Registration deadline, the due date for participants to join the challenge.
September 11th,2022: Release of the baseline system and open the leaderboard for ranking on the Eval data.
September 28th, 2022: Release of the Test data.
September 30th, 2022: Final submission deadline.
October 8th, 2022: Release of the results and rankings.
October 14th, 2022: Paper submission deadline.
October 24th, 2022: Camera-ready paper submission deadline.
December 11th − 14th, 2022: ISCSLP main conference, challenge session and award ceremony
Organizers
- Lei Xie, Professor, Northwestern Polytechnical University, China lxie@nwpu.edu.cn
- Longbiao Wang, Professor, Tianjin University, China longbiao wang@tju.edu.cn
- Eng Siong Chng, Associate Professor, Nanyang Technological University aseschng@ntu.edu.sg
- Hui Bu, CEO, AIShell , China buhui@aishelldata.com
- Binbin Zhang, WeNet Open Source Community, China binbzha@qq.com