Derin öğrenme ile sesli komut tanıma

Ana Sayfa
→
Enstitüler
→
Fen Bilimleri Enstitüsü
→
Elektrik-Elektronik Mühendisliği A.B.D. Koleksiyonu
→
Öğe Göster

Derin öğrenme ile sesli komut tanıma

Ateş, Emre

Tarih: 2019

Özet:

Sesli komut tanıma sistemleri, insan-makine etkileşimde yaygın olarak kullanılmaktadır. Ses tanımada akustik ve dil modelleri kullanılmaktadır. Ses tanımadaki başarım bu iki modele bağlıdır. Veritabanı dile ait özellikleri ne kadar iyi kapsarsa modelleme de o kadar iyi olmaktadır. Akustik ve dil modellemesi için çok disiplinli bir çalışma yapılması gereklidir. Bu tez çalışmasında, ses tanıma sistemlerinin daha kolay geliştirilmesi için derin öğrenme tabanlı yöntemin uçtan uca ses tanımada kullanılması ve sesli komut tanıma başarımlarının elde edilmesi üzerinde çalışılmıştır. Bu çalışmada, RNN yapısı ile Speech Commands Dataset içerisindeki temel komut kelimelerinin sınıflandırma başarımları elde edilmiştir. Deneysel çalışma sonunda %70,63 dogru sınıflandırma başarımı elde edilmiştir.

Voice command recognition systems are widely used in human-machine interaction. Acoustic and language models are used in voice recognition. The performance of voice recognition depends on these two models. The better the database covers the language features, the better the modeling. A multidisciplinary study is required for acoustics and language modeling. In this thesis, it has been studied to use end-to-end voice recognition method and to achieve voice command recognition achievements in order to develop voice recognition systems more easily. In this study, classification performance of basic command words in Speech Commands Dataset is obtained with RNN structure. At the end of the experimental study, 70.63% correct classification performance was obtained.

Tüm öğe kaydını göster