Text this: Initiating language engagement with multimodal learning tasks