TDCN: A novel temporal depthwise convolutional network for short-term load forecasting

Accurate and efficient short-term load forecasting (STLF) is crucial for the reliable and economic operation of the electric grid. However, with the growing integration of renewable energy sources like wind power and photovoltaics, load data have become increasingly complex and nonlinear, making acc...

Full description

Saved in:
Bibliographic Details
Main Authors: Mingping Liu, Chenxu Xia, Yuxin Xia, Suhui Deng, Yuhao Wang
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:International Journal of Electrical Power & Energy Systems
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S0142061525000638
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurate and efficient short-term load forecasting (STLF) is crucial for the reliable and economic operation of the electric grid. However, with the growing integration of renewable energy sources like wind power and photovoltaics, load data have become increasingly complex and nonlinear, making accurate forecasting a challenging task in modern power systems. While numerous STLF models have been developed, many of these models are complex hybrid structures that struggle with issues such as overfitting, stacking errors, high computational costs, and suboptimal generalization. To address these challenges, this paper proposes a novel temporal depthwise convolutional network model for STLF. First, a dilated causal convolution is employed to optimize the depthwise convolution, taking advantage of its ability to exponentially increase sampling points and expand the receptive field, thereby improving the capture of temporal information within low-dimensional channels. Next, pointwise convolution networks are utilized to adjust the channel dimension. The feature map is initially expanded to higher dimension and then projected back to a low-dimensional matrix, forming an improved depthwise separable convolution model with an inverted bottleneck structure. This design minimizes information loss and leakage during the transformation of compressed feature space, leading to enhanced prediction accuracy while reducing the number of training parameters. Finally, layer normalization and Gaussian error linear unit are incorporated to further improve the model’s convergence and nonlinear representation capabilities. To evaluate the effectiveness and generalization of the proposed model, experiments are conducted using two real-world datasets. Comparative experiments with other state-of-the-art methods are also performed. The results demonstrate that the proposed model outperforms existing approaches in terms of prediction accuracy, computational efficiency, and generalization.
ISSN:0142-0615