Text this: A compound-target pairs dataset: differences between drugs, clinical candidates and other bioactive compounds