Text this: Echocardiographic video-driven multi-task learning model for coronary artery disease diagnosis and severity grading