PLURELEARN (249254)
https://cordis.europa.eu/project/id/249254
FP7 (2007-2013)
Plural Reinforcement Learning
Marie Curie Action: "Reintegration Grants" (FP7-PEOPLE-2009-RG)
reinforcement learning
2009-11-01 Start Date (YY-MM-DD)
2013-10-31 End Date (YY-MM-DD)
€ 100,000 Total Cost
Description
We propose a new paradigm for learning in complex high-dimensions dynamic environments. Our goal is to develop algorithms, theory, and applications that use plurality of learning approaches and models in a synergetic way. Our paradigm considers the task of learning a control policy by combining trial and error in the style of reinforcement learning with learning from a competent teacher whose interaction with the environment can be observed. Instead of using the teacher for imitation, our paradigm is focused on learning good representations of the world-model. We consider four specific issues in the new paradigm: (i) The usage of iteration and reiteration between learning from a teacher and reinforcement learning. (ii) Learning representation and structure from the teacher. (iii) Optimizing policies based on learned representations and reasoning about model uncertainty. (iv) Learning sub-strategies from a teacher and when and how to use them. We will develop algorithms and theory pertaining to the new paradigm and will apply it in two challenging domains: a fighter jet simulator and a network operating center simulator.
Complicit Organisations
1 Israeli organisation participates in PLURELEARN.Country | Organisation (ID) | VAT Number | Role | Activity Type | Total Cost | EC Contribution | Net EC Contribution |
---|---|---|---|---|---|---|---|
Israel | TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY (999907720) | IL557585585 | coordinator | HES | € 0 | € 100,000 | € 0 |