Luo Jingning, Liu Liwei. Research and implementation of remote sensing big data distributed technology. J Appl Meteor Sci, 2017, 28(5): 621-631. DOI:  10.11898/1001-7313.20170510.
Citation: Luo Jingning, Liu Liwei. Research and implementation of remote sensing big data distributed technology. J Appl Meteor Sci, 2017, 28(5): 621-631. DOI:  10.11898/1001-7313.20170510.

Research and Implementation of Remote Sensing Big Data Distributed Technology

DOI: 10.11898/1001-7313.20170510
  • Received Date: 2017-04-06
  • Rev Recd Date: 2017-07-27
  • Publish Date: 2017-09-30
  • In the past ten years, the global various digital information grows explosively, and a big data era with massive data production, sharing and application is opened. In this decade, with the development of information technology, distributed storage and computing technology get great development to deal with the explosive growth of information, and the knowledge system and technical reserves are established gradually. In China, research on big data and distributed computing is being carried out widely. For satellite remote sensing data of large volume and rapid growth, the traditional archive-callback-application cannot meet demands of data analysis and data mining in the era of big data.The traditional file-based way has many limitations, especially when used for cloud computing and in-telligent services, and it is very difficult to use. The big data grid model and distributed model is the key to solve the bottleneck, enabling real-time computing and on-demand services, and therefore it has important reference significance. It overcomes the temporal and spatial fragmentation problem, making the remote sensing data possible to be stored, calculated and applied as a whole. Based on the Hilbert curve grid hash algorithm, a distributed system containing fundamental structure of grid, time slice and physical layer is established, demonstrating excellent parallel read-write performance. Hilbert hash algorithm has stable discrete degree, which is the key for the grid model to maintain spatial correlation and to map two-dimensional space to one-dimensional sequence.Using the distributed system, instead of traditional way of data file organization and management, properties flexible and intuitive data acquisition are realized. Users can truly experience a new way of what you see is what you get and what you get is what you need to get. The future system which is based on the data model, will greatly increase the work efficiency, make the focus from the data itself to data applications. Internet-based cloud computing grid cell calculation can be realized, and the extension ability of the whole system can achieve linear growth, based on the general hardware and software platform. The implementation of this system will greatly improve the work efficiency, completing high-speed parallel data reading and writing, making on-demand data application more smoothly.
  • Fig. 1  The basic structure of big data grid model

    Fig. 2  Hilbert curve fractal(from reference [22])

    Fig. 3  Dispersion analysis of three subspace

    (a)row order curve, (b)Z-order curve, (c)Hilbert curve

    Fig. 4  Satellite remote sensing distributed data system architecture

    Fig. 5  Data and control flow for grid written

    Fig. 6  Different amount of data read performance test

    Fig. 7  Read performance test of node

    Fig. 8  Interactive interface of data retrieval and processing system based on spatial correlation

    Fig. 9  Interactive interface of satellite remote sensing database and data service system in Inner Mongolia

    Table  1  Results of dispersion calculation

    散列曲线子空间 行序曲线 Z曲线 希尔伯特曲线
    网格数量(NGrid) 9 8 8 9 8 8 9 8 8
    序列段数量(Msep) 3 8 1 3 8 1 2 4 4
    离散度(D) 0.33 1 0.125 0.33 1 0.125 0.22 0.5 0.5
    DownLoad: Download CSV
  • [1]
    Viktor M S, Kenneth C.Big Data:A Revolution That Will Transform How We Live.Work and Think.Houghton Mifflin Harcourt Publishing Company, 2013:9-23.
    [2]
    周峥嵘, 王琤, 何文春.分布式气象元数据同步系统的探索研究.应用气象学报, 2010, 21(1):121-128. doi:  10.11898/1001-7313.20100117
    [3]
    何文春, 高峰, 许艳, 等.气候监测业务专题数据服务系统的设计与实现.应用气象学报, 2012, 23(5):624-630. doi:  10.11898/1001-7313.20120514
    [4]
    Ian F, Yong Z, Ioan R.Cloud Computing and Grid Computing 360-Degree Compared.Grid Computing Environments Workshop, 2008.2008:1-10. http://www.academia.edu/2330905/Cloud_Computing_and_Grid_Computing_360_Degree_Compared
    [5]
    Michael A, Armando F, Rean G.Above the clouds:A berkeley view of cloud computing.Communications of the ACM, 2010, 53(4):50-58. doi:  10.1145/1721654
    [6]
    Sanjay G, Howard G, Leung S T.The Google File System.SOSP 03 Proceedings of the Nineteenth ACM symposium on Operating Systems Principles, 2003:29-43. https://www.cs.umd.edu/class/spring2011/cmsc818k/Lectures/gfs-hdfs.pdf
    [7]
    Fay C, Jeffrey D, Sanjay G.Bigtable:A distributed storage system for structured data.ACM Transactions on Computer Systems, 2008, 26(2):205-218. doi:  10.1145/1365815.1365816
    [8]
    Jeffrey D, Sanjay G.MapReduce:Simplied data processing on large clusters.Communications of the ACM-50th Anniversary Issue(1958-2008), 2008, 51(1):107-113.
    [9]
    覃雄派, 王会举, 杜小勇, 等.大数据分析——RDBMS与MapReduce的竞争与共生.软件学报, 2010, 23(1):32-45. http://www.cnki.com.cn/Article/CJFDTOTAL-GPRJ201307036.htm
    [10]
    陈康, 郑纬民.云计算:系统实例与研究现状.软件学报, 2009, 20(5):1337-1348. http://www.cnki.com.cn/Article/CJFDTOTAL-RJXB200905028.htm
    [11]
    王意洁, 孙伟东, 周松, 等.云计算环境下的分布存储关键技术.软件学报, 2012, 23(4):962-986. http://www.cnki.com.cn/Article/CJFDTOTAL-DZRU201618163.htm
    [12]
    孙勇, 林菲, 王宝军.面向云计算的键值型分布式存储系统研究.电子学报, 2013, 41(7):1406-1411. http://www.cnki.com.cn/Article/CJFDTOTAL-DZXU201307026.htm
    [13]
    吴吉义, 傅建庆, 平玲娣, 等.一种对等结构的云存储系统研究.电子学报, 2011, 38(5):1100-1107. http://www.cnki.com.cn/Article/CJFDTOTAL-DZXU201105020.htm
    [14]
    李永生, 曾沁, 徐美红, 等.基于Hadoop的数值预报产品服务平台设计与实现.应用气象学报, 2015, 26(1):122-128. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20150113&flag=1
    [15]
    高峰, 王国复, 孙超, 等.后台管理模式在数据共享平台中的应用.应用气象学报, 2011, 22(3):367-374. doi:  10.11898/1001-7313.20110314
    [16]
    金之雁, 王鼎兴.一种在异构系统中实现负载平衡的方法.应用气象学报, 2003, 14(4):410-418. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20030451&flag=1
    [17]
    陈钻, 李海胜.新型台风海洋网络气象信息系统的设计与实现.应用气象学报, 2012, 23(2):245-250. doi:  10.11898/1001-7313.20120214
    [18]
    赵立成, 王素娟, 施进明.国家卫星气象中心信息共享体制研究与技术实现.应用气象学报, 2002, 13(5):627-632. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20020580&flag=1
    [19]
    钱建梅, 郑旭东.国家卫星气象中心气象卫星资料存档系统.应用气象学报, 2003, 14(6):756-762. http://qikan.camscma.cn/jams/ch/reader/view_abstract.aspx?file_no=20030695&flag=1
    [20]
    李德仁.论广义空间信息网格和狭义空间信息网格.遥感学报, 2005, 9(5):513-519. http://www.cnki.com.cn/Article/CJFDTOTAL-YGXB200505000.htm
    [21]
    Reinhold B.Linear Algebra and Projective Geometry.New York:Academic Press, 1952:39-70.
    [22]
    David H.Ueber die Reellen Züge Algebraischer Curven.Mathematische Annalen, 1891, 38(1):115-138. doi:  10.1007/BF01212696
    [23]
    Duvall P, Keesling J, Vince A.The Hausdorff dimension of the boundary of a self-similar tile.Journal of the London Mathematical Society, 2000, 61(3):748-760. doi:  10.1112/jlms.2000.61.issue-3
    [24]
    Mandelbrot B B.How long is the coast of Britain? Statistical self-similarity and fractional dimension.Science, 1967, 156:636-638. doi:  10.1126/science.156.3775.636
    [25]
    刘润涛, 陈琳琳, 田广悦.Z曲线网格划分的最近邻查询.计算机工程与应用, 2013, 49(22):123-126. doi:  10.3778/j.issn.1002-8331.1208-0249
  • 加载中
  • -->

Catalog

    Figures(9)  / Tables(1)

    Article views (3073) PDF downloads(554) Cited by()
    • Received : 2017-04-06
    • Accepted : 2017-07-27
    • Published : 2017-09-30

    /

    DownLoad:  Full-Size Img  PowerPoint