Novel Neural Architectures & Algorithms For Efficient Inference