Text this: Multi-vessel target tracking with camera fusion for unmanned surface vehicles