Efficient Learning In Linearly Solvable Mdp Models.