(Part 2/4) In our introduction to meta-reinforcement learning, we presented the main concepts of meta-RL: Meta-Environments are associated with a distribution of distinct MDPs called tasks. The goal of Meta-RL is to learn to leverage prior experience to adapt quickly to new tasks. In Meta-RL, we learn an algorithm during a step called meta-training. At meta-testing, we apply this… Read more »
