Title: | From words to vectors: text vectorization techniques in natural language processing |
Authors: | Krez, K. S. |
Keywords: | материалы конференций;encoding;vectorization;text;algorithm;analysis |
Issue Date: | 2025 |
Publisher: | БГУИР |
Citation: | Krez, K. S. From words to vectors: text vectorization techniques in natural language processing / K. S. Krez // Электронные системы и технологии : сборник материалов 61-й научной конференции аспирантов, магистрантов и студентов БГУИР, Минск, 21–25 апреля 2025 г. / Белорусский государственный университет информатики и радиоэлектроники ; редкол.: Д. В. Лихаческий [и др.]. – Минск, 2025. – С. 60–62. |
Abstract: | This paper discusses the process of text vectorization, which is a key step in Natural Language Processing (NLP). The main vectorization methods are described, including One-Hot Encoding, Bag of Words, TF-IDF and Word Embeddings. The advantages and disadvantages of each method are analyzed, as well as their application in various NLP tasks. Text vectorization converts textual data into numerical vectors that can be processed by machine learning algorithms. This is necessary because computers cannot directly interpret text. |
URI: | https://libeldoc.bsuir.by/handle/123456789/59949 |
Appears in Collections: | Электронные системы и технологии : материалы 61-й конференции аспирантов, магистрантов и студентов (2025)
|