Text this: A large-scale open image dataset for deep learning-enabled intelligent sorting and analyzing of raw coal