Text this: A data and knowledge-driven practice for ensuring stability in ultra-large intelligent computing clusters