Text this: Explorations into Deep Learning Text Architectures for Dense Image Captioning