Header menu link for other important links
X
Advice Replay approach for richer knowledge transfer in Teacher Student framework
Published in International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
2019
Volume: 4
   
Pages: 1997 - 1999
Abstract
One of the major drawbacks of RL is the low sample efficiency of the learning algorithms. In many cases domain expertise can help to mitigate this effect. Teacher-Student framework is one such paradigm, where a more experienced agent (teacher) upon being queried helps to accelerate the student's learning by providing advice on the action to take in a given state. Real world teachers not only provide the action to take in a given state but also provide a more informative signal using the synthesis of knowledge they may have gained with experience. With this motivation, we propose a richer advising framework where the teacher augments the student's knowledge by also providing the expected long term reward of following that action. The student can then use this value to steadily guide its Q-Network in the correct direction which can lead to a quicker convergence. To help student relive the advices received throughout its learning, we introduce an additional memory called the Advice Replay Memory (ARM). Results show that a student following our approach (a) is able to exploit the environment better, and (b) has a steeper learning curve. © 2019 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.
About the journal
JournalProceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
PublisherInternational Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
ISSN15488403
Open AccessNo
Concepts (13)
  •  related image
    Deep learning
  •  related image
    Knowledge management
  •  related image
    Learning algorithms
  •  related image
    Multi agent systems
  •  related image
    Reinforcement learning
  •  related image
    Students
  •  related image
    DOMAIN EXPERTISE
  •  related image
    Knowledge transfer
  •  related image
    LEARNING CURVES
  •  related image
    Real-world
  •  related image
    STUDENT FRAMEWORK
  •  related image
    Transfer learning
  •  related image
    Autonomous agents