You are not currently logged in.
Access JSTOR through your library or other institution:
If You Use a Screen ReaderThis content is available through Read Online (Free) program, which relies on page scans. Since scans are not currently available to screen readers, please contact JSTOR User Support for access. We'll provide a PDF copy for your screen reader.
Sector and Sphere: The Design and Implementation of a High-Performance Data Cloud
Yunhong Gu and Robert L. Grossman
Philosophical Transactions: Mathematical, Physical and Engineering Sciences
Vol. 367, No. 1897, Crossing Boundaries: Computational Science, e-Science and Global e-Infrastructure I. Selected Papers from the UK e-Science All Hands Meeting 2008 (Jun. 28, 2009), pp. 2429-2445
Published by: Royal Society
Stable URL: http://www.jstor.org/stable/40485591
Page Count: 17
You can always find the topics here!Topics: Slavery, Cloud computing, Datasets, Data processing, Network servers, Experimentation, Application programming interfaces, Image files, Data files, Web pages
Were these topics helpful?See something inaccurate? Let us know!
Select the topics that are inaccurate.
Since scans are not currently available to screen readers, please contact JSTOR User Support for access. We'll provide a PDF copy for your screen reader.
Preview not available
Cloud computing has demonstrated that processing very large datasets over commodity clusters can be done simply, given the right programming model and infrastructure. In this paper, we describe the design and implementation of the Sector storage cloud and the Sphere compute cloud. By contrast with the existing storage and compute clouds, Sector can manage data not only within a data centre, but also across geographically distributed data centres. Similarly, the Sphere compute cloud supports user-defined functions (UDFs) over data both within and across data centres. As a special case, MapReduce-style programming can be implemented in Sphere by using a Map UDF followed by a Reduce UDF. We describe some experimental studies comparing Sector/Sphere and Hadoop using the Terasort benchmark. In these studies, Sector is approximately twice as fast as Hadoop. Sector/Sphere is open source.
Philosophical Transactions: Mathematical, Physical and Engineering Sciences © 2009 Royal Society