Text this: Moving LLM evaluation forward: lessons from human judgment research