国家气象计算网格的设计与建立
Design and Establishment of a Nationwide Meteorological Computational Grid
-
摘要: 气象部门内高性能计算资源与支持能力具有地域分布不均匀的特点,地方部门与国家级单位相比,在资源丰富程度和开发应用水平上都存在较大差距。针对气象部门国家级、地方单位的资源整合、共享和管理的需求,该文提出了一个国家气象计算网格的设计方案。设计方案采用国家级、区域、省级三级节点布局,通过跨广域范围的气象宽带网互联,基于整合的资源平台设置了资源管理、应用服务和用户接口等功能层模块。利用UNICORE等关键技术,开发实现了气象计算网格软件模块。建成了一个全国分布的6个网格节点,提供网格中间件和定制业务运行两种资源共享服务方式。业务运行以来,为资源匮乏地区的气象用户共享分发预报服务产品,为当地的减灾防灾工作做出了贡献。Abstract: Weather forecast is one major application area of high performance computing technology. The running of meteorological numerical models demands strong high performance computing resource support to ensure the timeliness of numerical weather prediction systems. However, high performance computing resources and supporting capabilities are characterized by geographically contagious distribution in CMA. Local meteorological bureaus are well behind national institutions, whether in the possession of HPC resources or application development capabilities. High performance computing in meteorological field has some typical features in accordance with the requirements for grid computing, such as computational intensiveness, distributed and cooperative mass data access. Regarding the requirements of resource integration, sharing and management by local and national institutions in CMA, a design scheme of nationwide meteorological computational Grid is proposed. Grid technology is used to form abstract virtual resources on heterogeneous computing resources in meteorological department, so as to shield the heterogeneity of the underlying physical systems. Through orderly management and collaborative computing, the service platform implements effective aggregation and comprehensive utilization of resources. The design scheme employs a 3 level layout of national, regional and provincial nodes, constituting a distributed, tightly coupled network computing sharing system. The nodes are interconnected by WAN based meteorological broadband network. Upon the resource aggregation platform, function modules are intercalated for resource management, application services and user interfaces. With key technologies like UNICORE, function modules are developed and implemented. 6 geographically distributed nodes are established. UNICORE gateway services are deployed onto the meteorological broadband network, interacting with one another via grid communication protocols. 9 heterogeneous high performance computers in different places have been integrated and make up a meteorological computing resource pool. Two types of sharing services are provided, grid middleware and customized operations, on the nationwide meteorological computational Grid. By means of customized operations, three model application systems are set up. Since establishment and operational running, the meteorological computational Grid disseminates numerical weather prediction products to users in remote or resource poor areas, and thus provides strong support for disastrous weather prediction services and important events meteorological assurance, which plays an important role in local disaster prevention and mitigation efforts.
-
表 1 气象计算网格整合的高性能计算机系统
Table 1 High performance computer systems aggregated on the nationwide meteorological computational Grid
表 2 资源共享服务方式
Table 2 Resource sharing service means
-
[1] 郑明典.数值天气预报近期的发展趋势.物理双月刊,2001,23(3):422-426. [2] 洪文董.高性能计算机的发展与气象的应用.计算机工程与应用,2004,40(5):32-35. http://www.cnki.com.cn/Article/CJFDTOTAL-JSGG200405010.htm [3] Foster I,Kesselman C,Tuecke S.The anatomy of the Grid:Enabling scalable virtual organizations.International Journal of Supercomputer Applications,2001,15(3):200-222. doi: 10.1177/109434200101500302 [4] 王彬,许卓群.基于网格计算技术的高性能计算解决方案及其实现.计算机工程与应用,2003,39(36):1-4. http://www.cnki.com.cn/Article/CJFDTOTAL-JSGG200336000.htm [5] 宗翔,王彬.国家级气象高性能计算机管理与应用网络平台设计.应用气象学报,2006,17(5):629-634. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=200605106&flag=1 [6] Linked Environments for Atmospheric Discovery(LEAD).[2010-03-02]http://lead.ou.edu. [7] Earth System Grid.[2010-03-03]http://www.earthsystemgrid.org. [8] SIMDAT Website.http://www.ecmwf.int/services/grid/simdat/2010-03-03. [9] 王彬,许卓群.面向服务的网格体系架构.计算机应用研究,2007,24(8):13-16;21. http://www.cnki.com.cn/Article/CJFDTOTAL-JSYJ200708003.htm [10] 杨学胜,张卫民,陈德辉.网格计算及其在气象中的应用.气象,2005,31(2):79-82. http://www.cnki.com.cn/Article/CJFDTOTAL-QXXX200502018.htm [11] UNICORE Project Homepage.[2008-09-15]http://www.unicore.eu. [12] 王彬,宗翔.UNICORE技术调研分析报告//国家气象信息中心2007年度科技年会论文集,2008:91-97. [13] 刘桂英,李祖华,王彬.CMAGrid中作业调度插件的设计与实现.高性能计算技术,2009(2):48-52. http://www.cnki.com.cn/Article/CJFDTOTAL-JSJC200609038.htm [14] 曹燕,王彬,李娟.国家气象应用网格平台用户安全的设计和实现//国家气象信息中心2008年度科技年会论文集.2009:61-67. [15] 王彬,常飚,朱江,等.气象计算网格平台资源监视模块的设计与实现.应用气象学报,2009,20(5):642-648. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20090517&flag=1 [16] 王彬,宗翔,魏敏.一个精细粒度实时计算资源管理系统.应用气象学报,2008,19(4):507-511. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20080416&flag=1 [17] 谌芸,田浩,宗翔,等.青藏高原基于网格计算的MM5应用系统.气象与环境科学,2007,30(1):4-9. http://www.cnki.com.cn/Article/CJFDTOTAL-HNQX200701000.htm [18] 李永生,王彬,肖文名,等.广州区域中心气象计算网格节点的设计与实现//国家气象信息中心2008年度科技年会论文集.2009:30-36. [19] 李集明,沈文海,王国复.气象信息共享平台及其关键技术研究.应用气象学报,2006,17(5):621-628. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=200605105&flag=1 [20] 周峥嵘,王(王争),何文春.分布式气象元数据同步系统的探索研究.应用气象学报,2010,21(1):121-128. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20100117&flag=1