Text this: From KL Divergence to Wasserstein Distance: Enhancing Autoencoders with FID Analysis