中国地面月气候资料质量控制方法的研究
The Quality Control of Surface Monthly Climate Data in China
-
摘要: 由于气象台站信息变化以及观测规范的频繁变动, 使中国地面气象资料序列比较复杂。该文在分析中国地面历史气象资料中可能存在的非均一性和错误性、研究月气候资料序列存在的可能分布状态的基础上, 提出了中国地面月气候资料质量控制技术。通过分析认为, 中国地面月气候资料的质量控制, 应把下列3种方式结合起来, 进行综合质量检测:①把12个月的气候资料序列联合起来, 进行长时间连续性错误资料检测; ②把时间序列变换为接近正态分布的均一序列后, 进行时间域和空间域的奇异值检测; ③对检测出的连续性可疑资料和单个数据点的奇异值进行人工分析辨别。利用上述质量控制方法, 对1971—2000年中国地面700多基准基本站月统计气候资料进行了质量检查, 最后介绍了质量检查结果。Abstract: It is generally agreed that outliers detection as well as outliers identification is of primary importance to quality control (QC) of observational data. Using some traditional quality techniques such as high-low extreme check, confidence limit control, internal consistency check etc, China historical surface meteorological data has been examined over and over, but there is a wide variety of erroneous values that are not been detected yet. If the continuity and distribution state of a data series and the outliers existence are not been understood well beforehand, some special erroneous values cannot be detected. The surface climate data series become more complex and inhomogeneous as a result of station moves, changes in the environment surrounding a station, and frequent changes in observational criterion in China. Therefore, the distribution of the data series from a great number of stations in China is not a normal distribution. Though discontinuities and inhomogeneities in time series are not of the field of QC, they have an effect upon traditional QC result. On the other hand, there would be problems in the sequential monthly climate data even the data among years as a result of measurement instrument errors, that of instrument calibration, a gradual shift in the physical characteristics of the instrument apparatus, or misoperation by observers etc if the above problems could not be solved in time. Perhaps the kind of data are not of great difference to the normal data, but they have certain impact on climate analysis. After analyzing the inhomogeneities, distribution state of the series and erroneous data in existence in China historical surface climate data, the QC method of surface monthly climate data in China has been developed, which is a breakthrough to traditional QC techniques of monthly climate data. It turns out that the quality control of China surface monthly climate data should include the following three steps: The check of continuous erroneous data after integrating the 12 monthly time series into a new individual series; the temporal check and spatial check of outliers after time series converted from likely inhomogeneous distribution to homogeneous one; manual advanced identification of continuous suspicious data and outliers. With the above QC method, about 250000 surface monthly climate data of base stations in China from 1971 to 2000 is examined. The climate data contains more than 10 monthly variables: Surface air temperature, surface air relative humidity, wind speed, skin surface temperature, eight layers of soil temperature, sunshine duration, pan evaporation, frozen earth depth and snow depth etc. 136 erroneous monthly climate data referring to various variables are detected in total. The causes of erroneous monthly data according with original data such as hourly data is the following: Use other station data or monthly data to substitute true data; miss-recording such as enlarging or reducing 10 times in original data; the original data should not be equal to 0, but the record data is 0; the measurement instrument is mal functioning.
-
Key words:
- quality control;
- surface monthly climate data;
- China
-
图 1 月气候资料序列存在的4种时间变化曲线图
(图中与各序列曲线相交的水平直线均为相应序列的平均值线。曲线1为均一序列, 曲线2为仅存在个别奇异值的序列, 曲线3为无奇异值却有类似非均一性存在的序列, 曲线4为既有奇异值又有类似非均一性存在的序列)
Fig. 1 Four types of monthly climatic data series
(The horizontal lines show respective averages of the series, line 1 is homogeneous time series, line 2 is time series with some outliers, line 3 is probable inhomogeneous time series without any outliers, line 4 is probable inhomogeneous time series with some outliers)
表 1 1971—2000年中国基准基本站地面信息化资料错情
Table 1 Information about surface meteorological data in error of base stations in China from 1971 to 2000
-
[1] Eischeid Jon C, Bruce Baker, Tom Karl, et al.The quality control of long-term climatological data using objective data analysis.J Appl Met, 1995, 34:2787-2795. doi: 10.1175/1520-0450(1995)034<2787:TQCOLT>2.0.CO;2 [2] Grant Eugene L, Richard S Leavenworth. Statistical Quality Control. New York: McGraw-Hill Book Company, 1972: 1-694. [3] Lev S Gandin. Complex quality control of meteorological observations. Mon Wea Rev, 1988, 116(5): 1137-1156. doi: 10.1175/1520-0493(1988)116<1137:CQCOMO>2.0.CO;2 [4] 任芝花, 刘小宁, 杨文霞.极端异常气象资料的综合性质量控制与分析.气象学报, 2005, 63(4): 526-533. http://www.cnki.com.cn/Article/CJFDTOTAL-QXXB200504013.htm [5] Igor Zahumensk. Guidelines on Quality Control Procedures for Data from Automatic Weather Stations. Expert Team on Requirements for Data from Automatic Weather Stations, Third Session, WMO, 2004. https://www.wmo.int/pages/prog/www/OSY/Meetings/ET-AWS3/Doc4(1).pdf [6] Song Feng, Qi Hu, Qian Weihong. Quality control of daily meteorological data in China, 1951-2000: A new dataset. Int J Climatol, 2004, 24: 853-870. doi: 10.1002/(ISSN)1097-0088 [7] Lanzante J R.Resistant, robust and nonparametric techniques for the analysis of climate data: Theory and examples, including applications to historical radiosonde station data. Int J Climatol, 1996, 16: 1197-1226. doi: 10.1002/(ISSN)1097-0088 [8] 刘黎平, 张沛源, 梁海河, 等.双多普勒雷达风场反演误差和资料的质量控制.应用气象学报, 2003, 14(1): 17-29. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20030103&flag=1 [9] 周尚河.全国高空资料质量控制和建库方法的研究.应用气象学报, 2000, 11(3): 364-370. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20000353&flag=1 [10] 熊安元.北欧气象观测资料的质量控制.气象科技, 2003, 31 (5): 314-320. http://www.cnki.com.cn/Article/CJFDTOTAL-QXKJ200305012.htm [11] 刘小宁, 任芝花.地面气象资料质量控制方法研究概述.气象科技, 2005, 33(3): 199-203. http://www.cnki.com.cn/Article/CJFDTOTAL-QXKJ200503001.htm [12] 王伯民.基本气象资料质量控制综合判别法的研究.应用气象学报, 2004, 15(增刊): 50-59. http://www.cnki.com.cn/Article/CJFDTOTAL-YYQX2004S1008.htm [13] Peterson T C, Vose R S, Schmoyer R, et al. Global historical climatology network (GHCN) quality control of monthly temperature data. Int J Climatol, 1998, 18: 1169-1179. doi: 10.1002/(ISSN)1097-0088 [14] 屠其璞, 王俊德, 丁裕国, 等.气象应用概率统计学.北京:气象出版社, 1984: 38-41. [15] 吴增祥.气象台站历史沿革信息及其对观测资料序列均一性影响的初步分析.应用气象学报, 2005, 16(4): 461-467. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20050458&flag=1 [16] Easterling D R, Peterson T C, Karl T R. On the development and use of homogenized climate datasets. J Climate, 1996, 9: 1429-1434. doi: 10.1175/1520-0442(1996)009<1429:OTDAUO>2.0.CO;2 [17] WMO. Guidelines on Climate Metadata and Homogenization. WMO/TD, 2003, 1186: 1-27.