From words to vectors: text vectorization techniques in natural language processing

Krez, K. S.

Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/59949

Title:	From words to vectors: text vectorization techniques in natural language processing
Authors:	Krez, K. S.
Keywords:	материалы конференций;encoding;vectorization;text;algorithm;analysis
Issue Date:	2025
Publisher:	БГУИР
Citation:	Krez, K. S. From words to vectors: text vectorization techniques in natural language processing / K. S. Krez // Электронные системы и технологии : сборник материалов 61-й научной конференции аспирантов, магистрантов и студентов БГУИР, Минск, 21–25 апреля 2025 г. / Белорусский государственный университет информатики и радиоэлектроники ; редкол.: Д. В. Лихаческий [и др.]. – Минск, 2025. – С. 60–62.
Abstract:	This paper discusses the process of text vectorization, which is a key step in Natural Language Processing (NLP). The main vectorization methods are described, including One-Hot Encoding, Bag of Words, TF-IDF and Word Embeddings. The advantages and disadvantages of each method are analyzed, as well as their application in various NLP tasks. Text vectorization converts textual data into numerical vectors that can be processed by machine learning algorithms. This is necessary because computers cannot directly interpret text.
URI:	https://libeldoc.bsuir.by/handle/123456789/59949
Appears in Collections:	Электронные системы и технологии : материалы 61-й конференции аспирантов, магистрантов и студентов (2025)

Files in This Item:

File	Description	Size	Format
Krez_From.pdf		315.02 kB	Adobe PDF	View/Open

Show full item record Google Scholar