Hardware-Software Co-Design for Optimizing MPI Programs in Data Center Network