Deep Learning For Speech Analysis And Evaluation

dc.contributor.advisor	Nguyen, Thi Thanh Sang
dc.contributor.author	Nguyen, Phu Vinh
dc.date.accessioned	2024-03-15T03:20:30Z
dc.date.available	2024-03-15T03:20:30Z
dc.date.issued	2021
dc.identifier.uri	http://keep.hcmiu.edu.vn:8080/handle/123456789/4568
dc.description.abstract	For speech analysis, the process involves learning and recognizing the various features of an audio clip that can be used to analyze a language. This procedure works by converting audio files to spectrograms and then applying a neural network to learn and recognize the various features of the audio. The main objective of this thesis is to analyze the languages out of the various speakers that were recorded in the Mozilla’s Common Voice1 and VIVOS Corpus2 dataset. The recordings were analyzed by recording 10 seconds of each utterance. The datasets are then split into training and test sets. The results of these tests reveal an overall accuracy of 99%.	en_US
dc.language.iso	en	en_US
dc.subject	Deep learning	en_US
dc.title	Deep Learning For Speech Analysis And Evaluation	en_US
dc.type	Thesis	en_US