Text this: Three-stage cascade architecture-based siamese sliding window network algorithm for object tracking