Modeling The Visual World: Reconstruction And Neural Episodic Representation