Text this: Meaningful Multimodal Emotion Recognition Based on Capsule Graph Transformer Architecture