Sound Signal Invariant DAE Neural Network-Based Quantizer Architecture of Audio/Speech Coder Using the Matching Pursuit Algorithm

Avramov, V. V.; Herasimovich, V.; Petrovsky, A. A.

Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/33887

Title:	Sound Signal Invariant DAE Neural Network-Based Quantizer Architecture of Audio/Speech Coder Using the Matching Pursuit Algorithm
Authors:	Avramov, V. V. Herasimovich, V. Petrovsky, A. A.
Keywords:	публикации ученых;stepwise activation function;audio/speech coding;quantization;neural network
Issue Date:	2018
Publisher:	Springer
Citation:	Avramov, V. Sound Signal Invariant DAE Neural Network-Based Quantizer Architecture of Audio/Speech Coder Using the Matching Pursuit Algorithm / V. Avramov, V. Herasimovich, A. Petrovsky // Advances in Neural Networks – ISNN 2018 : Proceedings 15th International Symposium on Neural Networks, Minsk, Belarus, June 25–28, 2018. – 2018. – P. 511–520.
Abstract:	The paper is devoted to the quantization algorithm development based on the neural networks framework. This research is considered in the context of the scalable real-time audio/speech coder based on the perceptually adaptive matching pursuit algorithm. The encoder parameterizes the input sound signal frame with some amount of real numbers that are need to be compactly represented in binary form, i.e. quantized. The neural network quantization approach gives great opportunity for such a goal because the data quantized in whole vector but not in separate form and it can effectively use correlations between each element of the input coded vector. Deep autoencoder (DAE) neural network-based architecture for the quantization part of the encoding algorithm is shown. Its structure and learning features are described. Conducted experiments points out the big compression ratio with high reconstructed signal quality of the developed audio/speech coder quantization scheme.
URI:	https://libeldoc.bsuir.by/handle/123456789/33887
DOI:	https://doi.org/10.1007/978-3-319-92537-0_59
Appears in Collections:	Публикации в зарубежных изданиях

Files in This Item:

File	Description	Size	Format
Avramov_Sound.pdf		84.71 kB	Adobe PDF	View/Open

Show full item record Google Scholar