Text this: Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition