Text this: Hierarchical deep learning framework for automated marine vegetation and fauna analysis using ROV video data