Text this: E-CLIP: An Enhanced CLIP-Based Visual Language Model for Fruit Detection and Recognition