Text this: A two-stage multi-scale attention-based network for weakly supervised cataract fundus image enhancement