Multivoice Text-to-Speech Synthesis for Natural-Language Interfaces of Intelligent Systems

Zahariev, V. A.; Petrovsky, A. A.

Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/12073

Title:	Multivoice Text-to-Speech Synthesis for Natural-Language Interfaces of Intelligent Systems
Other Titles:	Мультиголосовой синтез речи по тексту для построения естественно-языковых интерфейсов интеллектуальных систем
Authors:	Zahariev, V. A. Petrovsky, A. A.
Keywords:	материалы конференций;text-to-speech system;voice conversion;natural- language interface
Issue Date:	2017
Publisher:	БГУИР
Citation:	Zahariev, V. A. Multivoice Text-to-Speech Synthesis for Natural-Language Interfaces of Intelligent Systems / V. A. Zahariev, A. A. Petrovsky // Открытые семантические технологии проектирования интеллектуальных систем = Open Semantic Technologies for Intelligent Systems (OSTIS-2017) : материалы международной научно-технической конференции (Минск, 16-18 февраля 2017 года) / редкол.: В. В. Голенков (отв. ред.) [и др.]. – Минск : БГУИР, 2017. – С. 167–170.
Abstract:	The paper considers the application of voice conversion technology for developing multivoice text-to-speech synthesis (MVTTS) in natural-language interfaces of intelligent systems. The main features of the proposed integrated architecture of MVTTS system are presented. Voice conversion model based on multiple regression mapping function and Gaussian mixture models, as well as the method of text-independent learning based on hidden Markov models and modiﬁed Viterbi algorithm are observed. Experimental evaluation of the effectiveness of the proposed solutions on the characteristics of naturality and similarity of synthesized speech has been done.
Alternative abstract:	В докладе рассмотрены вопросы применения технологии конверсии голоса для построения мультиголосовых систем синтез речи по тексту (МГСРТ) для создания естественно-языковых интерфейсов интеллектуальных систем. Особенности предлагаемой интегрированной архитектуры МГСРТ на базе технологии конверсии голоса, функция конверсии голоса на основе модели Гауссовых смесей и множественной регрессионной функции отображения, а также метод текстонезависимого обучения на базе скрытых Марковских моделей и модифицированного алгоритма Витерби. Приведены экспериментальные оценки эффективности предлагаемых решений по характеристикам узнаваемости и натуральности синтезируемой речи.
URI:	https://libeldoc.bsuir.by/handle/123456789/12073
Appears in Collections:	OSTIS-2017

Files in This Item:

File	Description	Size	Format
Zahariev_Multivoice.PDF		368.79 kB	Adobe PDF	View/Open

Show full item record Google Scholar