Text this: Statistical distributions of thesauri and their uses in classification of science publications