Text this: Multi-branch convolutional neural network with cross-attention mechanism for emotion recognition