Skip navigation
Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/61457
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChen Zhengyu-
dc.coverage.spatialМинскen_US
dc.date.accessioned2025-09-05T07:45:32Z-
dc.date.available2025-09-05T07:45:32Z-
dc.date.issued2025-
dc.identifier.citationChen Zhengyu. Software for recognizing speaker by voice / Chen Zhengyu // Информационная безопасность : сборник материалов 61-й научной конференции аспирантов, магистрантов и студентов БГУИР, Минск, 21–25 апреля 2025 г. / Белорусский государственный университет информатики и радиоэлектроники. – Минск, 2025. – С. 13–17.en_US
dc.identifier.urihttps://libeldoc.bsuir.by/handle/123456789/61457-
dc.description.abstract. FBank (Filter Bank) is a front-end processing algorithm that processes audio in a way similar to the human ear and extracts features to improve the performance of speech recognition. The system uses an efficient context-aware masking-based network, CAM++, which uses a densely connected time-delay neural network (D-TDNN) as the backbone and adopts a novel multi-granularity pooling to capture different levels of context information.Based on the respective advantages of FBank and CAM++ models, this study designs a software for recognizing speaker by voice and implements the system through pytorch.en_US
dc.language.isoenen_US
dc.publisherБГУИРen_US
dc.subjectматериалы конференцийen_US
dc.subjectspeaker recognitionen_US
dc.subjectfeature Extractionen_US
dc.subjectneural networksen_US
dc.titleSoftware for recognizing speaker by voiceen_US
Appears in Collections:Информационная безопасность : материалы 61-й научной конференции аспирантов, магистрантов и студентов (2025)

Files in This Item:
File Description SizeFormat 
Chen_Zhengyu_Software.pdf299.19 kBAdobe PDFView/Open
Show simple item record Google Scholar

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.