Neural Text Normalization in Speech-to-Text Systems with Rich Features

Tran, Oanh Thi and Bui, Viet The (2021) Neural Text Normalization in Speech-to-Text Systems with Rich Features. Applied Artificial Intelligence, 35 (3). pp. 193-205. ISSN 0883-9514

[thumbnail of Neural Text Normalization in Speech to Text Systems with Rich Features.pdf] Text
Neural Text Normalization in Speech to Text Systems with Rich Features.pdf - Published Version

Download (3MB)

Abstract

This paper presents the task of normalizing Vietnamese transcribed texts in Speech-to-Text (STT) systems. The main purpose is to develop a text normalizer that automatically converts proper nouns and other context-specific formatting of the transcription such as dates, time, and numbers into their appropriate expressions. To this end, we propose a solution that exploits deep neural networks with rich features followed by manually designed rules to recognize and then convert these text sequences. We also introduce a new corpus of 13 K spoken sentences to facilitate the process of the text normalization. The experimental results on this corpus are quite promising. The proposed method yields 90.67% in the F1 score in recognizing sequences of texts that need converting. We hope that this initial work will inspire other follow-up research on this important but unexplored problem.

Item Type: Article
Subjects: Asian STM > Computer Science
Depositing User: Managing Editor
Date Deposited: 17 Jun 2023 05:16
Last Modified: 31 Oct 2023 04:48
URI: http://journal.send2sub.com/id/eprint/1765

Actions (login required)

View Item
View Item