Text this: Spatial-Temporal Sequence Attention Based Efficient Transformer for Video Snow Removal