Artificial agent learns representative trajectories by imitation
How does a cheetah learn to chase gazelles? By imitating their motion!
In this application:
- I introduced a method for trajectory imitation by bringing ideas from unsupervised image to image translation to the reinforcement learning realm.
- The agent is given the current position of the trajectory to imitate and, at each step, has to decide the steering direction and speed to guess the next position.
- By presenting the agent with many episodes involving a distribution of trajectories, it learns to rollout a typical motion from a single initial state.
For more details, check out the paper.