Text this: A lightweight mechanism for vision-transformer-based object detection