Towards Generalization and Efficiency in Reinforcement Learning