基于聚类天气分型的KNN方法在风预报中的应用

陈豫英; 刘还珠; 陈楠; 曾晓青; 马金仁; 刘迁迁; 马筛艳

基于聚类天气分型的KNN方法在风预报中的应用

Application of KNN to Wind Forecast Based on Clustering Synoptic Patterns

摘要

摘要: 以模式识别和相似预报思想为基础, 建立基于自组织神经网络 (SOM) 的聚类天气分型和交叉验证的K最近邻域非参数估计仿真模型 (KNN)。该模型首先以自组织神经网络技术对西北地区的高空流场和高度场进行聚类分型, 针对不同天气形势下的历史样本, 通过交叉检验, 分别寻求各类天气型下的最佳K组合。为了验证聚类天气分型对KNN方法的影响, 使用2003—2006年冬半年T213数值预报产品和宁夏日最大风速资料, 同时建立了宁夏冬半年日最大风速≥6 m/s天气分型和未分型的KNN预报模型, 并对2007年1—5月进行了预报试验, 预报评估结果表明:天气分型后的预报模型总体上降低了预报空报率, 提高了预报准确率, 特别是某些类天气型, 提高幅度更大, 为分类相似预报开拓了思路。

Abstract: Based on the model identification and an analogue forecasting, a new approach based on Self-Organizing feature Map (SOM) and cross validation is constructed, which is called K-nearest neighbor nonparametric estimation bootstrap model (KNN). 500 hPa geopotential height and 700 hPa u, v wind field over Northwest China are analyzed by the model clusterings at first, then the optimal K combination is sought using cross validation aiming at past samples under different weather patterns. Forecasting identification value of each synoptic pattern is determined by K-data, according to historical record. When forecasting in real time, what kind of synoptic pattern is to be known first, then K-data of different time is used to compute the nearest neighbor of real forecasting predictor to historical material predictor. Finally forecasting conclusion is obtained by using the standard of forecasting identification value. In order to validate the effect on cluster synoptic pattern to KNN, T213 NWP products from 2003 to 2006 in winter half year and the data of daily maximum velocity in Ningxia are used to construct prediction models of daily maximum velocity≥6 m/s pattern in Ningxia under synoptic and non-synoptic patterns at one time, data from Jan to May in 2007 is used for forecast experiments. The forecast evaluation results show that although the probability of original sample is reduced when adding the Self-Organizing feature Map of KNN, more false alarms in forecasting are avoided, so that the effect of forecasting is improved in general, especially the forecasting effects of some synoptic patterns compared with those that aren't patterned. The result is that the forecasting information of Ningxia high wind can be reflected by improved KNN. What's worth pointing out is that, the number of synoptic patterns is reduced when patterned, so the forecasting will be effected to some extent. It has a good effect for meteorological observing station which has more original samples, but it is not good for the ones that have less original samples. Therefore if there are more historical data which can reflect the wide range of system changing, the forecast accuracy will be improved significantly and it has a great value for operational usage. Classification analogue prediction thinking can be expanded by these results.

HTML全文

参考文献(18)

施引文献

资源附件(0)