Text this: Monocular vision guided deep reinforcement learning UAV systems with representation learning perception