Text this: Linguistic-visual based multimodal Yi character recognition