Xie Shun, Sun Xiaogong, Zhang Suping, et al. Precipitation forecast correction in South China based on SVD and machine learning. J Appl Meteor Sci, 2022, 33(3): 293-304. DOI:  10.11898/1001-7313.20220304.
Citation: Xie Shun, Sun Xiaogong, Zhang Suping, et al. Precipitation forecast correction in South China based on SVD and machine learning. J Appl Meteor Sci, 2022, 33(3): 293-304. DOI:  10.11898/1001-7313.20220304.

Precipitation Forecast Correction in South China Based on SVD and Machine Learning

DOI: 10.11898/1001-7313.20220304
  • Received Date: 2021-11-02
  • Rev Recd Date: 2022-01-24
  • Publish Date: 2022-05-31
  • Precipitation can be induced by various weather systems and a series of complex physical processes, so its prediction is relatively difficult in weather forecasting. Due to the limitation of numerical model, the prediction error is inevitable. It is a hot topic in meteorological research and operation to explore a more effective method to correct the model product, and to improve the interpretation and applicability. To explore a more effective model product error correction method, a combination of correction methods is put forward, based on singular value decomposition (SVD) and machine learning, including multiple linear regression, LASSO regression and Ridge regression. The results are compared with the traditional matrix coefficient method, and then correction models are tested in pre-flood season precipitation forecast in South China, by correcting European Centre for Medium-Range Weather Forecasts (EC) product. The result shows that the proposed correction models combining SVD and machine learning can effectively reduce the error of EC product. The maximum optimization rate root mean square error is 4.2%, and more than 69% of the stations are optimized to different degrees. These correction models have better robustness to deal with the problem of collinearity between factors, and the correction effect is better than that of the traditional matrix coefficient method. Furthermore, the weighted integration of multiple correction models is carried out by assigning different weights to different models, and the root mean square error by the integrated approach in South China is smaller than EC product and any single correction model. It shows that the weighted ensemble method can better integrate the advantages of multiple correction models and enlarge the advantages. For the weighted ensemble of multiple correction models, it is not only better than the precipitation prediction results of EC product, but also better than any one of the integrated correction models. Its optimization rate of root mean squared error can achieve 5.7%, and more than 77% of the stations are optimized to different degrees.

  • Fig. 1  Distribution of ground sites in South China

    Fig. 2  Correction process

    Fig. 3  Root mean square error of EC product

    Fig. 4  Comparison of root mean square error difference of different models, method to EC product before and after correction

    (a)model Ⅰ, (b)model Ⅱ, (c)model Ⅲ, (d)model Ⅳ, (e)weighted ensemble

    Fig. 5  Ratio of positive and negative sites to the correction amplitude

    (a)model Ⅰ, (b)model Ⅱ, (c)model Ⅲ, (d)model Ⅳ, (e)weighted ensemble

    Fig. 6  Distribution of individual representative sites

    Fig. 7  Comparison of positive and negative correction sites

    (RMSE1 is root mean square error of the EC product, RMSE2 is root mean square error of weighted ensemble)

    Fig. 8  Comparison of two precipitation cases on 12 Jun 2018 and 9 May 2019 before and after correction

    Table  1  Cumulative variance contribution of the top 10 modes

    模态序号 累计方差贡献
    1 0.5162
    2 0.6220
    3 0.6763
    4 0.7220
    5 0.7516
    6 0.7716
    7 0.7888
    8 0.8014
    9 0.8112
    10 0.8197
    DownLoad: Download CSV

    Table  2  Correction effect in different models and method

    模型 均方根误差/(mm·d-1) 优化率/%
    模型Ⅳ 13.26 4.1
    模型Ⅱ 13.26 4.1
    模型Ⅲ 13.24 4.2
    模型Ⅰ 13.24 4.2
    加权集成方法 13.01 5.7
    DownLoad: Download CSV
  • [1]
    Luo Y L, Zhang R H, Wan Q L, et al. The Southern China monsoon rainfall experiment (SCMREX). Bull Amer Meteor Soc, 2016, 98(5): 999-1013.
    [2]
    Wilks D S. Comparison of ensemble-MOS methods in the Lorenz '96 setting. Meteor Appl, 2006, 13(3): 243-256. doi:  10.1017/S1350482706002192
    [3]
    Scheuerer M, Hamill T M. Statistical postprocessing of ensemble precipitation forecasts by fitting censored, shifted Gamma distributions. Mon Wea Rev, 2015, 143(11): 4578-4596. doi:  10.1175/MWR-D-15-0061.1
    [4]
    Ye D Z, Yan Z W, Dai X G, et al. A Discussion of future system of weather and climate prediction. Meteor Mon, 2006, 32(4): 3-8. https://www.cnki.com.cn/Article/CJFDTOTAL-QXXX200604000.htm
    [5]
    He L F, Chen T, Kong Q. A review of studies on prefrontal torrential rain in South China. J Appl Meteor Sci, 2016, 27(5): 559-569. doi:  10.11898/1001-7313.20160505
    [6]
    Wu N G, Wen Z P, Deng W J, et al. Advances in warm-sector heavy rainfall during the first rainy season in South China. J Meteor Sci, 2020, 40(5): 605-616. https://www.cnki.com.cn/Article/CJFDTOTAL-QXKX202005005.htm
    [7]
    Ding Y H. The major advances and development process of the theory of heavy rainfalls in China. Torrential Rain and Disasters, 2019, 38(5): 395-406. https://www.cnki.com.cn/Article/CJFDTOTAL-HBQX201905002.htm
    [8]
    Applequist S, Gahrs G E, Pfeffer R L, et al. Comparison of methodologies for probabilistic quantitative precipitation forecasting. Wea Forecasting, 1991, 17(4): 783-799.
    [9]
    Bi B G, Dai K, Wang Y, et al. Advances in techniques of quantitative precipitation forecast. J Appl Meteor Sci, 2016, 27(5): 534-549. doi:  10.11898/1001-7313.20160503
    [10]
    Li L, Zhu Y J. The establishment and research of T213 precipitation calibration system. J Appl Meteor Sci, 2006, 17(Suppl Ⅰ): 130-134. https://www.cnki.com.cn/Article/CJFDTOTAL-YYQX2006S1018.htm
    [11]
    Sun J, Cheng G G, Zhang X L. An improved bias removed method for precipitation prediction and its application. J Appl Meteor Sci, 2015, 26(2): 173-184. doi:  10.11898/1001-7313.20150205
    [12]
    Zhu Q G, Chen X G. Objective division of natural rainfall regions in China. J Nanjing Ins Meteor, 1992, 15(4): 467-475. https://www.cnki.com.cn/Article/CJFDTOTAL-NJQX199204002.htm
    [13]
    Su H J, Wang Q G, Yang J, et al. Error correction on summer model precipitation of China based on the singular value decomposition. Acta Phys Sinica, 2013, 62(10): 494-503. https://www.cnki.com.cn/Article/CJFDTOTAL-WLXB201310076.htm
    [14]
    Qiu C J, Chou J F. Similarity of weather forecast-dynamic method. Chinese J Atmos Sci, 1989, 13(1): 22-28. https://www.cnki.com.cn/Article/CJFDTOTAL-DQXK198901002.htm
    [15]
    Ren H L, Chou J F. Analogue correction method of errors by combining both statistical and dynamical methods together. Acta Meteor Sinica, 2005, 63(6): 988-993. doi:  10.3321/j.issn:0577-6619.2005.06.015
    [16]
    Wang J X. The correlation analysis of precipitation fields in the middle and lower reaches in Changjiang River during Meiyu period and 500 hPa monthly mean height fields. Scientia Meteorologica Sinica, 1989, 32(3): 311-321. https://www.cnki.com.cn/Article/CJFDTOTAL-QXKX198903009.htm
    [17]
    Liu Z X, Lian Y, Shen B Z, et al. Seasonal variation features of 500 hPa height in North Pacific oscillation region and its effect on precipitation in Northeast China. J Appl Meteor Sci, 2003, 14(5): 553-561. doi:  10.3969/j.issn.1001-7313.2003.05.005
    [18]
    You F C, Ding Y G, Zhou Y, et al. Applicability of singular value decomposition and singular cross-spectrum to diagnose of rainfall in North China. J Appl Meteor Sci, 2003, 14(2): 176-187. doi:  10.3969/j.issn.1001-7313.2003.02.005
    [19]
    Qin Z K, Lin Z H, Chen H, et al. EOF/SVD-based short-term climate prediction error correction method and its application. Acta Meteor Sinica, 2011, 69(2): 289-296. https://www.cnki.com.cn/Article/CJFDTOTAL-QXXB201102008.htm
    [20]
    Liu J Y, Deng L J, Fu G B, et al. The applicability of two statistical down-scaling methods to the Qinling Mountains. J Appl Meteor Sci, 2018, 29(6): 737-747. doi:  10.11898/1001-7313.20180609
    [21]
    Ahijevych D, Pinto J O, Williams J K, et al. Probabilistic forecasts of mesoscale convective system initiation using the Random Forest data mining technique. Wea Forecasting, 2016, 31(2): 581-599. doi:  10.1175/WAF-D-15-0113.1
    [22]
    Lin J L, Jin L, Peng H Y. Application of numerical forecast products to regional rainfall forecasting by artificial neural network. Meteor Sci Technol, 2006, 34(1): 12-17. https://www.cnki.com.cn/Article/CJFDTOTAL-QXKJ200601002.htm
    [23]
    Krishnamurti T N, Kishtawal C M, Larow T E, et al. Improved weather and seasonal climate forecasts from multi-model super-ensemble. Science, 1999, 285(5433): 1548-1550. doi:  10.1126/science.285.5433.1548
    [24]
    Li S, Wang Y, Yuan H, et al. Ensemble mean forecast skill and applications with the T213 ensemble prediction system. Adv Atmos Sci, 2016, 33(11): 1297-1305. doi:  10.1007/s00376-016-6155-2
    [25]
    Ma Q, Gong J D, Li L, et al. Study of bias-correction and consensus in regional multi-model super-ensemble forecast. Meteor Mon, 2008, 34(3): 42-48. https://www.cnki.com.cn/Article/CJFDTOTAL-QXXX200803009.htm
    [26]
    Zhi X F, Zhao C. Heavy precipitation forecasts based on multi-model ensemble members. J Appl Meteor Sci, 2020, 31(3): 303-314. doi:  10.11898/1001-7313.20200305
    [27]
    Chen Y W, Huang X M, Li Y et al. Ensemble learning for bias correction of station temperature forecast based on ECMWF products. J Appl Meteor Sci, 2020, 31(4): 494-503. doi:  10.11898/1001-7313.20200411
    [28]
    Wei F Y. Modern Statistical Technnology in Climatological Diagnoses and Prediction. Beijing: China Meteorological Press, 1999.
    [29]
    Huang J Y. Statistic Analysis and Forecast Methods in Meteorology. Beijing: China Meteorological Press, 2000.
    [30]
    Shi N. Multi-analysis in Meteorological Research and Prediction. Beijing: China Meteorological Press, 2002.
    [31]
    Abramson N, Braverman D J, Sebestyen G S. Pattern recognition and machine learning. Pub ASA, 2006, 103(4): 886-887.
    [32]
    Hart P E. The condensed nearest neighbor rule. IEEE Tran Inf Theory, 1968, 14(3): 515-516. doi:  10.1109/TIT.1968.1054155
    [33]
    Xi C, Ishwaran H. Random forests for genomic data analysis. Genomics, 2012, 99(6): 323-329. doi:  10.1016/j.ygeno.2012.04.003
    [34]
    Hu H Y, Lee Y C, Yen T M, et al. Using BPNN and DEMATEL to modify importance-performance analysis model-A study of the computer industry. Exp Sys Appl, 2009, 36(6): 9969-9979. doi:  10.1016/j.eswa.2009.01.062
    [35]
    Aheto J M K, Duah H O, Agbadi P, et al. A predictive model, and predictors of under-five child malaria prevalence in Ghana: How do LASSO, Ridge and Elastic net regression approaches compare?. Prev Med Rep, 2021, 25: 101475.
    [36]
    Melkumova L E, Shatskikh S Y. Comparing Rridge and LASSO estimators for data analysis. Procedia Engineering, 2017, 201: 746-755. doi:  10.1016/j.proeng.2017.09.615
  • 加载中
  • -->

Catalog

    Figures(8)  / Tables(2)

    Article views (885) PDF downloads(252) Cited by()
    • Received : 2021-11-02
    • Accepted : 2022-01-24
    • Published : 2022-05-31

    /

    DownLoad:  Full-Size Img  PowerPoint