Text this: A framework for assessing reliability of observer annotations of aerial wildlife imagery, with insights for deep learning applications