Text this: Attention-enhanced multimodal feature fusion network for clothes-changing person re-identification