Probing and enhancing the reliance of Transformer models on poetic information