🔎By acting greedily w.r.t. sampled models, the exploration of the agent is naturally driven through uncertainty over models of the environments.
The diversity among decoded latent trajectories of different sampled models starting at the same initial state can be seen below: