Text this: An Efficient Method for Automatic Video Annotation and Retrieval in Visual Sensor Networks