Text independent speaker recognition based on MFCC and machine learning
Yükleniyor...
Dosyalar
Tarih
2021
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Selçuk Üniversitesi
Erişim Hakkı
info:eu-repo/semantics/openAccess
Özet
Speaker recognition (SR) is the process of recognizing the voice of human from a group of speech samples with artificial intelligence. SR models are used in various human-voice based security platforms and authentication problems. In this paper, a text-independent speaker recognition model was developed for the problem with 60 different speakers. Obtaining the distinctive features of speaker expressions during the model design phase is an important point. In this study, the MFCC algorithm, which is the most common method used to obtain short-time features, is used to extract features of speech signals. The classification performance of the proposed model and commonly used 11 different machine learning methods has been evaluated on Audio-MNIST dataset, and the results were shown comparatively. As a result, 97.1% classification rate was achieved with SVM classifier. In addition, precision, recall and f-score values are 98.0%, 97.1% and 97.4%, respectively. The results show that the proposed model produces successful results for all classes and is a widely applicable approach to different types of speaker datasets.
Açıklama
Anahtar Kelimeler
Speaker Recognition, Text-Independent, Human Voice, MFCC, Performance Analysis, Machine Learning
Kaynak
Selcuk University Journal of Engineering Sciences
WoS Q Değeri
Scopus Q Değeri
Cilt
20
Sayı
3
Künye
Hizlisoy, S., Arslan, R. S., (2021). Text independent speaker recognition based on MFCC and machine learning. Selcuk University Journal of Engineering Sciences, 20 (03), 73-78.