Text this: Channel-shuffled transformers for cross-modality person re-identification in video