Paper
11 October 2023 Research on English speaking assessment algorithms based on deep learning
Hongyan Cai, Lei Liu, Juan Zhang
Author Affiliations +
Proceedings Volume 12800, Sixth International Conference on Computer Information Science and Application Technology (CISAT 2023); 1280011 (2023) https://doi.org/10.1117/12.3003946
Event: 6th International Conference on Computer Information Science and Application Technology (CISAT 2023), 2023, Hangzhou, China
Abstract
The mainstream deep learning spoken language evaluation algorithms are based on speech recognition to perform mispronunciation determination. In the process of speech recognition, Deep Neural Network-Hidden Markov Model (DNN-HMM) has better performance than Gaussian Mixture Model-Hidden Markov Model (GMM-HMM). In this paper, the structure of speech recognition system is analyzed and the current mainstream English speaking evaluation algorithms are analyzed, and it is concluded that the current mainstream evaluation algorithms are basically not end-to-end and have low evaluation metrics. Therefore, this paper designs an end-to-end speech evaluation model based on the theory of deep learning, and conducts experiments for this model under the TIMIT speech dataset to verify and analyze the advantages of this paper's model in temporal modeling, which shows that the speech recognition model designed in this paper has better performance in spoken language evaluation.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Hongyan Cai, Lei Liu, and Juan Zhang "Research on English speaking assessment algorithms based on deep learning", Proc. SPIE 12800, Sixth International Conference on Computer Information Science and Application Technology (CISAT 2023), 1280011 (11 October 2023); https://doi.org/10.1117/12.3003946
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Detection and tracking algorithms

Education and training

Systems modeling

Speech recognition

Acoustics

Feature extraction

Back to Top