Text this: A Multimodal Large Language Model Framework for Intelligent Perception and Decision-Making in Smart Manufacturing