Research on Functional Data Method Based on Air Quality Index of Sichuan Basin
DOI:
https://doi.org/10.6911/WSRJ.202510_11(10).0005Keywords:
Air quality; Functional principal component analysis; Non-negative matrix factorization; Cluster analysis.Abstract
So far, there have been relatively few studies on air quality in the Sichuan Basin. This paper uses the functional data analysis method to study the air quality data of the Sichuan Basin. Firstly, the discrete air quality data of 21 prefecture-level cities in the Sichuan Basin were smoothed using the Fourier basis function to transform them into continuous functional data. Then, the principal component analysis of the smoothed air quality index was conducted. Secondly, the non-negative matrix factorization method is introduced to achieve data dimensionality reduction and retain key information through the linear combination of basis vectors, and to conduct cluster analysis on the air quality data of the Sichuan Basin. The research results show that the fluctuation of the air quality index in the Sichuan Basin has a distinct seasonal pattern. The cumulative variance contribution rate of the first three principal components reached 95.3%, which can explain most of the original data. According to the clustering results, the Sichuan Basin can be divided into three types of regions, namely the core area, the marginal area and the mountainous area in southwest China. These studies provide data support and theoretical basis for the future pollution prevention and control policies in the Sichuan Basin.
Downloads
References
[1] J.O. Ramsay: When the data are functions, Psychometrika, Vol. 47 (1982) No. 4, p.379-396.
[2] J. Suhaila, A.A. Jemain, M.F. Hamdan and W.Z.W. Zin: Comparing rainfall patterns between regions in Peninsular Malaysia via a functional data analysis technique, Journal of Hydrology, Vol. 411 (2011) No. 3-4, p.197-206.
[3] H. Sørensen, J. Goldsmith and L.M. Sangalli: An introduction with medical applications to functional data analysis, Statistics in Medicine, Vol. 32 (2013) No. 30, p.5222-5240.
[4] Y. Méneroux, A. Le Guilcher, G. Saint Pierre, M. Ghasemi Hamed, S. Mustière and O. Orfila: Traffic signal detection from in-vehicle GPS speed profiles using functional data analysis and machine learning, International Journal of Data Science and Analytics, Vol. 10 (2020) No. 1, p.101-119.
[5] B.M. Petronio, M. Pietrantonio, M. Pietroletti and N. Cardellicchio: Metal Speciation and Bio-Availability in Marine Sediments of Northern Adriatic Sea, Proc. Seventh FECS Conference on Environmental Science and Pollution Research (Rome, Italy, 2000), Vol. 1, p.320.
[6] J.H. Bai: Functional Data Analysis of Air Quality in Fen-Wei Plain (MS., Guangxi Normal University, China 2023), p.15.
[7] M. Li: Analysis of water consumption data function based on principal component analysis method (MS., Hefei University of Technology, China 2015), p.10.
[8] D.I.N.G. Hui, W.C. Xu, H.B.U. Zhu, G.W.A.N.G. Guochang, Z.H.A.N.G. Tao and Z.H.A.N.G. Riquan: Review of Regression Analysis for Functional Data, Chinese Journal of Applied Probability and Statistics, Vol. 34 (2018) No. 6, p.630-654. (In Chinese)
[9] W.P. Wang: Based on the Theory of Functional Data to Classify the Working Population (MS., Northeast Normal University, China 2010), p.10.
[10] X.J. Cheng: Functional Data Analysis of Air Quality in Yangtze River Delta (MS., Guangxi Normal University, China 2021), p.20.
[11] Y. Zhang: Research on Functional Data Analysis Method Based on Runoff Data of the Main Stream of the Yellow River (MS., Lanzhou University of Finance and Economics, China 2024), p.22.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 World Scientific Research Journal

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.