Text this: A multimodal differential privacy framework based on fusion representation learning