Ren Disheng, Shen Xueshun, Xue Jishan, et al. The optimized design of stack for GRAPES's adjoint mode. J Appl Meteor Sci, 2011, 22(3): 362-366.
Citation: Ren Disheng, Shen Xueshun, Xue Jishan, et al. The optimized design of stack for GRAPES's adjoint mode. J Appl Meteor Sci, 2011, 22(3): 362-366.

The Optimized Design of Stack for GRAPES's Adjoint Mode

  • Received Date: 2010-04-27
  • Rev Recd Date: 2011-04-01
  • Publish Date: 2011-06-30
  • The four dimensional variational data assimilation system (4DVAR) of GRAPES (Global/Region Assimilation and Prediction System) can use different meteorological data from different areas of different times obtained to optimize the quality of forecast based on an initialization background. As the core of the 4DVAR, tangent mode and adjoint mode can adjust the initialization background through using the deviation of the estimate of 3DVAR and observation.When a segment of the adjoint mode is run, the initial state of corresponding nonlinear mode might be needed as input. In order to balance the disadvantage of whole storage and whole computation, a double chained stack is used to store an interim data's snap for implementing the adjoint mode. Adopting the whole storage can speed up the adjoint mode prominently, but this may lead to the relation of first in and first out (FIFO) among some data blocks, which conflicts with the configuration of the double chained stack. A nested and double chained stack is proposed based on original double chained stack, using a kid chained stack to separate the data blocks that have FIFO relations. Data block pops first must be pushes in kid chained stack, and then can be popped at any time as needed. The nested and double chained stack can meet these requirements of different data blocks, FIFO or FILO, and satisfy the requirement of adjoint mode better. The result of experiment shows these approaches can double the operational speed with 30% extra memory.
  • Fig. 1  The FIFO (First In First Out) relationship of checking-point

    Fig. 2  The schematic diagram of nested and double chained stack

    Fig. 3  An example of 6-hour 850 hPa geo-potential height forecast of GRAPES (unit: gpm)

    (a) the result before optimization, (b) the result after optimization

    Fig. 4  The dynamic memory consumption of GRAPES adjiont mode

    (a) total amount of temporary data storage before and after optimization, (b) total memory consumption of adjoint mode before and after optimization

    Table  1  The result before and after optimization

    时间 堆栈最大值/MB 程序占用总内存/MB ad_integrat执行时间/s
    优化前 约55 约200 49.79
    优化后 约110 约255 24.10
    DownLoad: Download CSV
  • [1]
    薛纪善.新世纪初我国数值天气预报的科技创新研究.应用气象学报, 2006, 17(5):601-610.
    Chen Dehui, Xue Jishan. GRAPES-CMA's New Generation of Weather and Climate Model: Scientific Design and Development Progresses. Proceedings of the 2004 Workshop on the Solution of Partial Differential Equations on the Sphere, 2004.
    Xue Jishan. Progresses of researches on numerical weather prediction in China: 1999—2002. Adv Atmos Sci, 2005, 21(3):467-474.
    张华, 薛纪善, 庄世宇. GRAPES三维变分同化系统得理想试验.气象学报, 2004, 62(1):31-41. doi:  10.11676/qxxb2004.004
    庄世宇, 薛纪善, 朱国富, 等. GRAPES全球三维变分同化系统——基本设计方案与理想试验.大气科学, 2005, 29(6):872-884.
    Xue Jishan. Development of 3DVAR for Operational Application in CMA. Proceedings of 4th WMO International Symposium on Assimilation of Observations in Meteorology and Oceanography, WMO/TD-No.1316, Geneva: WMO, 2005.
    陈德辉, 杨学胜, 胡江林, 等.多尺度通用动力模式框架的设计策略.应用气象学报, 2003, 14(4): 452-461.
    薛纪善, 陈德辉.数值预报系统GRAPES的科学设计与应用.北京:科学出版社, 2008:54-60.
    张林, 朱宗申. GRAPES模式切线性垂直扩散方案的误差分析和改进.应用气象学报, 2008, 19(2):194-200.
    陈德辉, 沈学顺.新一代数值预报系统GRAPES的研究进展.应用气象学报, 2006, 17(6):773-777.
    伍湘君, 金之雁, 陈德辉, 等.新一代数值预报模式GRAPES的并行计算方案设计与实现.计算机研究与发展, 2007, 44(3):510-515.
    伍湘君, 金之雁, 黄丽萍, 等. GRAPES模式软件框架与实现.应用气象学报, 2005, 16(4):539-546.
    黄丽萍, 伍湘君, 金之雁. GRAPES模式标准初始化方案设计与实现.应用气象学报, 2005, 16(3):374-384.
    Laurent Hascoet, Valerie Pascual. TAPENADE 2.1 User's Guide. 2004. hhtp://
    陈峰峰, 王光辉, 沈学顺, 等. Cascade插值方法在GRAPES模式中的应用.应用气象学报, 2009, 20(2):164-170.
  • 加载中
  • -->


    Figures(4)  / Tables(1)

    Article views (4047) PDF downloads(1138) Cited by()
    • Received : 2010-04-27
    • Accepted : 2011-04-01
    • Published : 2011-06-30


    DownLoad:  Full-Size Img  PowerPoint