A Study on Provincial-Level Carbon Emission Forecasting Based on Multimodal Data and Spatio-Temporal Neural Networks

Authors

  • Yimai Wang
  • Xinyi Lin
  • Xiaowei Zhu
  • Yuwei Wang
  • Xinyu Wang

DOI:

https://doi.org/10.6911/

Keywords:

Carbon emissions forecasting; multimodal data; graph convolutional recurrent networks; long short-term memory networks; provincial-level analysis; spatio-temporal dependence.

Abstract

Against the backdrop of global warming and the ‘dual carbon’ targets, accurate forecasting of provincial carbon emissions is of great significance for formulating differentiated emission reduction policies and optimising energy structures. This study addresses issues such as the inability of traditional statistical models to capture non-linear characteristics, the limited predictive accuracy of single machine learning models, and the failure of existing deep learning methods to account for the spatial spillover effects of carbon emissions. The study first constructs a provincial multi-modal dataset integrating features such as GDP, industrial output, energy prices and policy intensity, and establishes a spatial adjacency matrix based on geographical proximity; secondly, it proposes a hybrid GCRN-GCNLSTM architecture that accounts for the spatio-temporal dependencies of carbon emissions; finally, model interpretability is enhanced through feature importance analysis and residual diagnostics. Using six major Chinese provinces—Shandong, Guangdong, Jiangsu, Hebei, Zhejiang and Henan—as the empirical subjects, the study conducted a comparative evaluation of ARIMA, LSTM, GCN-LSTM and ensemble models. The results indicate that the LSTM model performed best on the test set, with a coefficient of determination (R²) of 0.8383, a mean absolute error (MAE) of 0.4191 million tonnes, and a root mean square error (RMSE) of 0.5102 million tonnes; the mean absolute percentage error (MAPE) of the ARIMA model was 22.01%, indicating a significant linear component in the carbon emissions time series; The GCN-LSTM model, which incorporates geospatial information, did not outperform the LSTM model. Analysis suggests this may be attributable to the time lag associated with spatial spillover effects, an inappropriate graph structure design, and spatial redundancy in the economic features. The contribution of this study lies in proposing a ‘multimodal features + time series modelling’ forecasting paradigm, systematically evaluating the applicability of various models in provincial carbon emissions forecasting, and providing data support and theoretical grounds for improvements in spatial modelling and the formulation of differentiated emission reduction policies.

Downloads

Download data is not yet available.

References

[1] Desai J ,Ismail M ,Prabhakaran V , et al.GRaph-based analysis for stroke prediction (GRASP): A multi-modal model for identifying first ischemic stroke in high-risk population using UK biobank [J]. Computers in Biology and Medicine, 2026, 209111674-111674.DOI:10.1016/J.COMPBIOMED.2026.111674.

[2] Manandhar S ,Dhamdhere R ,Bahadur T , et al.(1325) - Multimodal Fusion of Biopsy and Electrocardiogram Data for Rejection Risk Prediction After Heart Transplantation[J].Journal of Heart and Lung Transplantation,2026,45(5S):612-613.DOI:10.1016/J.HEALUN.2026.02.1338.

[3] Wang B ,Xu K .Contrastive-Aware Hierarchical Multimodal Fusion Network for Hyperspectral and LiDAR Joint Classification [J]. Applied Soft Computing, 2026, 198115266-115266.DOI:10.1016/J.ASOC.2026.115266.

[4] Cekderi B A ,Guo S ,Han L .AI-assisted synthetic medical report generation using radiomic feature correlation-based PCA scores of multimodal data for dementia progression[J].Information Fusion,2026,134104369-104369.DOI:10.1016/J.INFFUS.2026.104369.

[5] Zeng Y ,Zhou X ,Yang Y , et al.Application prospect of machine learning bridging rock/lithology identification and engineering rock mass characterization: A review[J].Tunnelling and Underground Space Technology incorporating Trenchless Technology Research,2026,174107693-107693.DOI:10.1016/J.TUST.2026.107693.

[6] Weeraratna C ,McGain F ,Udy A A , et al.Sustainable continuous renal replacement therapy: The influence of blood flow rates, effluent dose, autoeffluent, and citrate anticoagulation on carbon dioxide emissions [J]. Critical Care and Resuscitation, 2026, 28(2): 100176-100176. DOI:10.1016/J.CCRJ.2026.100176.

[7] Fitha J F ,Mathew M ,Burman D K P , et al.Dynamics and future projections of Indian forest carbon stocks under different emission pathways using CMIP6 and LPJ-GUESS[J].Environmental Research: Climate,2026,5(2):025019-025019.DOI:10.1088/2752-5295/AE4F15.

[8] Zhu J ,Xie J .The impact of China's carbon emission trading system on urban green development: The mediating role of green technology and green finance [J]. Journal of Asian Economics, 2026, 104102177-102177.DOI:10.1016/J.ASIECO.2026.102177.

[9] Wang J ,Wang F ,Li Z , et al.Extended carbon emission efficiency of coal resource-based cities in China: from the perspective of water-land-energy-food pressure and vegetation carbon sinks [J]. Sustainable Energy Technologies and Assessments, 2026, 89104969-104969.DOI:10.1016/J.SETA.2026.104969.

[10] Wu T ,Cheng X ,Huang Y , et al.Lithium-Ion Battery SOH Prediction Method Based on Feature Type Analysis and GAPSO-GCRN[J].Journal of The Electrochemical Society,2025,172(11):110517-110517.DOI:10.1149/1945-7111/AE1AB4.

[11] Lan C ,Chen H ,Zhang L , et al.Research on Speech Enhancement Algorithm by Fusing Improved EMD and GCRN Networks [J]. Circuits, Systems, and Signal Processing, 2024, 43(7): 4588-4604.DOI:10.1007/S00034-024-02677-3.

[12] Garg S ,Ahuja R ,Singh R .A three-tier energy efficient architecture integrating virtual machine allocation and consolidation leveraging NSGA-II and LSTM for cloud data center[J].Sustainable Computing: Informatics and Systems, 2026, 50101363-101363.DOI:10.1016/J.SUSCOM.2026.101363.

[13] Huang X ,Huang Z ,Liu B , et al.Hierarchical LSTM with synergistic fuzzy information granulation for multivariate long-term forecasting [J]. Fuzzy Sets and Systems, 2026, 538109897-109897.DOI:10.1016/J.FSS.2026.109897.

[14] Jeong J ,Goodall L J ,Quinn D J .Comparing strategies for training LSTM models for street-scale urban flood prediction in Norfolk, Virginia[J].Journal of Hydrology: Regional Studies,2026,65103405-103405.DOI:10.1016/J.EJRH.2026.103405.

[15] A M H ,Sivakumar S ,Ramesh S .Advanced LSTM-based approach for fault detection and shading pattern identification in solar photovoltaic system using the Internet of Things[J].Next Energy, 2026, 12100622-100622.DOI:10.1016/J.NXENER.2026.100622.

Downloads

Published

2026-05-14

Issue

Section

Articles

How to Cite

Wang, Y., Lin, X., Zhu, X., Wang, Y., & Wang, X. (2026). A Study on Provincial-Level Carbon Emission Forecasting Based on Multimodal Data and Spatio-Temporal Neural Networks. World Scientific Research Journal, 12(5), 103-113. https://doi.org/10.6911/