Text this: Multi-Scale Building Load Forecasting Without Relying on Weather Forecast Data: A Temporal Convolutional Network, Long Short-Term Memory Network, and Self-Attention Mechanism Approach