Deep learning acceleration on edge devices with algorithm/hardware co-design