PLURELEARN (249254)

  https://cordis.europa.eu/project/id/249254

  FP7 (2007-2013)

  Plural Reinforcement Learning

  Marie Curie Action: "Reintegration Grants" (FP7-PEOPLE-2009-RG)

  reinforcement learning

  2009-11-01 Start Date (YY-MM-DD)

  2013-10-31 End Date (YY-MM-DD)

  € 100,000 Total Cost


  Description

We propose a new paradigm for learning in complex high-dimensions dynamic environments. Our goal is to develop algorithms, theory, and applications that use plurality of learning approaches and models in a synergetic way. Our paradigm considers the task of learning a control policy by combining trial and error in the style of reinforcement learning with learning from a competent teacher whose interaction with the environment can be observed. Instead of using the teacher for imitation, our paradigm is focused on learning good representations of the world-model. We consider four specific issues in the new paradigm: (i) The usage of iteration and reiteration between learning from a teacher and reinforcement learning. (ii) Learning representation and structure from the teacher. (iii) Optimizing policies based on learned representations and reasoning about model uncertainty. (iv) Learning sub-strategies from a teacher and when and how to use them. We will develop algorithms and theory pertaining to the new paradigm and will apply it in two challenging domains: a fighter jet simulator and a network operating center simulator.


  Complicit Organisations

1 Israeli organisation participates in PLURELEARN.

Country Organisation (ID) VAT Number Role Activity Type Total Cost EC Contribution Net EC Contribution
Israel TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY (999907720) IL557585585 coordinator HES € 0 € 100,000 € 0