Skip to main content

Automatic Speech Recognition and Understanding Over Noisy Air Traffic Control VHF Channels in Singapore

Paper ID

SIDs-2024-024

Conference

SESAR Innovation Days

Year

2024

Theme

Speech Recognition

Project Name

Keywords:

Robust Automatic Speech Recognition; Natural Language Processing; Air Traffic Control Communications; Spoken Language Understanding; Signal Processing

Authors

Phuong Tuan Dat, Luong Trung Tuan, Jayakrishnan Melur Madhathil and Tran Huy Dat

DOI

https://doi.org/10.61009/SID.2024.1.13

Abstract

In recent years, the demand for Automatic Speech Recognition (ASR) and Spoken Language Understanding (SLU) systems within the Air Traffic Control (ATC) domain has been increasing, especially systems that can be applied in practice. These systems are essential for reducing the workload of pilots and air traffic control officers (ATCOs) and ensuring the utmost accuracy in communication between pilots and ATCOs. However, ATC remains a low-resource and challenging domain. This paper presents our work on developing an ASR engine and an SLU system for ATC in Singapore, addressing these challenges. We introduce the Singapore Air Traffic Control (S-ATC) dataset, aimed at fostering research in this demanding field. We then discuss our contributions to constructing an efficient ASR system tailored for the ATC domain. Experimental results are provided to evaluate the effectiveness of combining an ASR system with a Natural Language Processing (NLP) model versus an End-to-End system for the SLU tasks in this specific domain. Additionally, we try to implement a model for Speaker Role Detection (SRD) task and propose ideas to enhance the efficiency of these systems in the ATC domain in the future.