Türkçe sürekli konuşma tanıma sistemlerinin internet protokolu üzerindeki başarımların araştırılması ve eniyilenmsi

Ana Sayfa
→
Enstitüler
→
Fen Bilimleri Enstitüsü
→
Elektrik-Elektronik Mühendisliği A.B.D. Koleksiyonu
→
Öğe Göster

Türkçe sürekli konuşma tanıma sistemlerinin internet protokolu üzerindeki başarımların araştırılması ve eniyilenmsi

Uzun, İsmail

Tarih: 2007

Özet:

Bu çalısmada Türkçe sürekli konusma tanıma sistemlerinin Internet protokolu (IP) üzerinde dağıtık yapı modeli kullanılarak paket kayıplı durumlarda basarımları arastırılmıs, paket kayıp telafi yöntemlerinin basarımları karsılastırılmıs ve uyarlamalı kestirim yöntemi gelistirilerek basarımın zaman içerisinde iyilestirilmesi hedeflenmistir. IP üzerinden ses iletimi genellikle güvenli olmayan UDP protokolu kullanılarak paketler halinde iletilir. Paketlerin iletimi sırasında yönlendirici ve ağ geçitlerindeki sıkısıklık ve is istasyonlarındaki yük fazlalığı gibi nedenlerden dolayı kayıplar olusabilmektedir. IP üzerinden konusma tanıma sistemlerinin basarımlarını arttırmak için kayıp paketlerin telafi edilmesi gerekir. Bu çalısmada, sürekli konusma tanıma sistemi olarak ses öznitelik vektörlerinin iletildiği dağıtık yapı modeli kulanılmıs ve ses öznitelik vektörleri UDP paketleri seklinde iletilmistir. Alıcı tarafında paket kayıp telafi sistemi tanımlanmıstır. Telafi sistemi, en uygun yöntemin seçildiği ön islem asaması, telafi asaması ve uyarlamalı yöntemler için parametrelerin güncellendiği ardislem asamalarından olusmaktadır. Kayıp telafi sistemi için dinamik tampon bellek tasarımı yapılmıs ve uygulanacak telafi yöntemi seçimi için ölçütler belirlenmistir. Telafi yöntemi olarak aradeğerleme ve kestirim yöntemleri uygulanmıstır. Kestirim yöntemlerinde mevcut uygulamalarda ses öznitelik vektörlerinin genis anlamda durağan kabullenmesi yapılmakta ve eğitim amaçlı seçilen veriden elde edilen ortalama değer ve ortak değisinti değerleri telafi asamasında kullanılmaktadır. Bu çalısmada, ses öznitelik vektörlerinin zaman içerisindeki ortalama değeri ve ortak değisinti değerleri uyarlamalı olarak güncellenmektedir. Böylece, telafi sistemi her yeni konusmada kendini güncelleyerek basarımını arttırmaktadır.

In this study, the performance of Turkish continuous speech recognition (TCSR) over the Internet protocol with distributed architecture is investigated, packet loss compensation (PLC) algorithms are compared and an adaptive estimation method where system performance increases adaptively is developed. In voice over IP (VoIP), voice is usually transmitted as data packets using the unreliable UDP protocol. During transmission, packet losses may occur depending on congestion in routers and gateways. Packet losses should be compensated to increase the performance of continuous speech recognition over IP. A distributed speech recognition architecture is used where only speech feature vectors are carried over UDP. For the receiver side, a PLC system, consisting of preprocessing, compensation and post-processing stages, is designed. In the pre-processing stage, speech feature vectors are buffered and compensation method is determined depending on packet loss type. The compensation method is then applied in the compensation stage. In the post processing stage, the mean and covariance of speech feature vectors are updated adaptively and the outputs are later used in estimation methods to increase the performance. A dynamic buffer is designed for the compensation and criteria to choose the method for the compensation of packet losses is defined. In this study, interpolation and estimation techniques are investigated for the PLC algorithm. In the current estimation methods which are used for PLC, training data are assumed to be wide sense stationary. The estimation method is enhanced by adaptation of mean and covariance parameters with each new speech packet.

Tüm öğe kaydını göster