Text this: Multi-Task Supervised Alignment Pre-Training for Few-Shot Multimodal Sentiment Analysis