Agent-space transfer learning for sign-based world model

Rovbo, M. A.

Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/38682

Title:	Agent-space transfer learning for sign-based world model
Other Titles:	Передача опыта в задаче обучения в пространстве агента для знаковых моделей мира
Authors:	Rovbo, M. A.
Keywords:	материалы конференций;transfer learning;reinforcement learning;semiotic control;robotics
Issue Date:	2020
Publisher:	БГУИР
Citation:	Rovbo, M. A. Agent-space transfer learning for sign-based world model / Maksim A. Rovbo // Открытые семантические технологии проектирования интеллектуальных систем = Open Semantic Technologies for Intelligent Systems (OSTIS-2020) : сборник научных трудов / Белорусский государственный университет информатики и радиоэлектроники ; редкол.: В. В. Голенков (гл. ред.) [и др.]. – Минск, 2020. – Вып. 4. – С. 327–332.
Abstract:	Several algorithms were studied for transfer learning problem for a foraging task. These algorithms are suitable for a semiotic control system to facilitate experience transfer between two learning agents with different agentspace descriptions of the state in a predicate form. They require that the target agent’s description of the task is a subset of the source agent’s description. The algorithms are based on the Q-learning algorithm and uses a custom transfer function to initialize the target agent’s state-action table, which matches the corresponding predicates. The test problem used to test the algorithms is a foraging task, where an agent must gather randomly generated food in a grid world. The results show that they provide improvement in the learning curve for the target agent after transferring experience from an agent with more input predicates. The improvement is inconsistent and sometimes does not bring noticeable difference in performance, but the target agent performs at least as good as an agent with no prior experience.
Alternative abstract:	Исследовано несколько алгоритмов для задачи передачи опыта при обучении на примере задачи фуражировки. Эти алгоритмы подходят для семиотической системы управления, чтобы облегчить передачу опыта между двумя обучающимися агентами с различными описаниями состояния в пространстве агента в форме предикатов. Они требуют, чтобы описание задачи целевого агента было подмножеством описания исходного агента. Алгоритмы основаны на алгоритме Q-обучения и используют специальную функцию передачи опыта, инициализирующую таблицу состояний-действий целевого агента, которая сопоставляет соответствующие предикаты. Тестовая задача, используемая для оценки алгоритмов, является задачей фуражировки, когда агент должен собирать случайно сгенерированную пищу в мире-сетке.
URI:	https://libeldoc.bsuir.by/handle/123456789/38682
Appears in Collections:	OSTIS-2020

Files in This Item:

File	Description	Size	Format
Rovbo_Agent-space.pdf		177.04 kB	Adobe PDF	View/Open

Show full item record Google Scholar