Text this: Dynamic Warping Network for Semantic Video Segmentation