Text this: Reinforcement learning and digital twin-driven optimization of production scheduling with the digital model playground