Text this: Self-adaptive spatial-temporal network based on heterogeneous data for air quality prediction