Video Understanding With Minimal Human Supervision