Text this: A simulated dataset for proactive robot task inference from streaming natural language dialogues