Text this: Documenting Geographically and Contextually Diverse Language Data Sources