Text this: Explainable vision transformer for automatic visual sleep staging on multimodal PSG signals