Skip navigation
Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/30401
Title: Convolutional neural network with semantically meaningful activations for speech analysis
Authors: Vashkevich, R.
Azarov, E. S.
Keywords: материалы конференций;semantic speech analysis;voice activity detection;convolution neural network;VAD;CNN
Issue Date: 2018
Publisher: БГУИР
Citation: Vashkevich, R. Convolutional neural network with semantically meaningful activations for speech analysis / R. Vashkevich, E. S. Azarov // Открытые семантические технологии проектирования интеллектуальных систем = Open Semantic Technologies for Intelligent Systems (OSTIS-2018) : материалы международной научно-технической конференции (Минск, 15 - 17 февраля 2018 года) / редкол. : В. В. Голенков (отв. ред.) [и др.]. – Минск : БГУИР, 2018. – С. 227 – 230.
Abstract: Semantic analysis of speech is more prospective compared to analysis of text since speech contains more in- formation that is important for understanding. The most im- portant distinguishing feature of speech is intonation, which is inaccessible in the text analysis. For successful semantic analysis of speech it is necessary to transform the speech signal into features with semantic interpretation. The mathematical apparatus of convolutional neural networks (CNN) seems suitable to implement this kind of transformation. However there is a scalability problem that makes it hard to combine many CNN’s in a single solution. To overcome this we propose to develop a CNN model with semantically meaningful activations i.e. the model that is capable of semantic interpretation of its internal states. The ultimate goal of the transform is to extract all semantically meaningful information, however the present work is confined to voice activity detection (VAD) and intonation extraction. Unlike other VADs based on artificial neural networks, the proposed model does not require a lot of computing resources and has a comparable or even better performance.
URI: https://libeldoc.bsuir.by/handle/123456789/30401
Appears in Collections:OSTIS-2018

Files in This Item:
File Description SizeFormat 
Vashkevich_Convolutional.PDF201.88 kBAdobe PDFView/Open
Show full item record Google Scholar

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.