Text this: Optimizing document management and retrieval with multimodal transformers and knowledge graphs