Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages
Abstract
This paper provides an overall introduction of our Automatic Speech Recognition (ASR) systems for Southeast Asian languages. As not much existing work has been carried out on such regional languages, a few difficulties should be addressed before building the systems: limitation on speech and text resources, lack of linguistic knowledge, etc. This work takes Bahasa Indonesia and Thai as examples to illustrate the strategies of collecting various resources required for building ASR systems.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2022
- DOI:
- 10.48550/arXiv.2210.03580
- arXiv:
- arXiv:2210.03580
- Bibcode:
- 2022arXiv221003580W
- Keywords:
-
- Computer Science - Computation and Language;
- Electrical Engineering and Systems Science - Audio and Speech Processing;
- I.2.7
- E-Print:
- Published by the 2017 IEEE International Conference on Orange Technologies (ICOT 2017)