Text this: A Dual-Channel and Frequency-Aware Approach for Lightweight Video Instance Segmentation