Web Search For A Planet: The Google Cluster Architecture

E-Book Overview

Few Web services require as much computation per request as search engines. On average, a single query on Google reads hundreds of megabytes of data and consumes tens of billions of CPU cycles. Supporting a peak request stream of thousands of queries per second requires an infrastructure comparable in size to that of the largest supercomputer installations. Combining more than 15,000 commodity-class PCs with fault-tolerant software creates a solution that is more cost-effective than a comparable system built out of a smaller number of high-end servers.Here we present the architecture of the Google cluster, and discuss the most important factors that influence its design: energy efficiency and price-performance ratio. Energy efficiency is key at our scale of operation, as power consumption and cooling issues become significant operational factors, taxing the limits of available data center power densities.Our application affords easy parallelization: Different queries can run on different processors, and the overall index is partitioned so that a single query can use multiple processors. Consequently, peak processor performance is less important than its price/performance. As such, Google is an example of a throughput-oriented workload, and should benefit from processor architectures that offer more on-chip parallelism, such as simultaneous multithreading or on-chip multiprocessors.

E-Book Content

WEB SEARCH FOR A PLANET: THE GOOGLE CLUSTER ARCHITECTURE AMENABLE TO EXTENSIVE PARALLELIZATION, GOOGLE’S WEB SEARCH APPLICATION LETS DIFFERENT QUERIES RUN ON DIFFERENT PROCESSORS AND, BY PARTITIONING THE OVERALL INDEX, ALSO LETS A SINGLE QUERY USE MULTIPLE PROCESSORS. TO HANDLE THIS WORKLOAD, GOOGLE’S ARCHITECTURE FEATURES CLUSTERS OF MORE THAN 15,000 COMMODITYCLASS PCS WITH FAULT-TOLERANT SOFTWARE. THIS ARCHITECTURE ACHIEVES SUPERIOR PERFORMANCE AT A FRACTION OF THE COST OF A SYST
You might also like

Calculs Et Visualisation En Nombres Complexes
Authors: Testard L.    114    0


Computer Graphics And Geometric Modeling. Mathematics
Authors: Max K. Agoston    176    0


Scientific Visualization: The Visual Extraction Of Knowledge From Data
Authors: Georges-Pierre Bonneau , Thomas Ertl , Gregory M. Nielson    152    0


Beginning Python
Authors: Peter C. Norton , Alex Samuel , Dave Aitel , Eric Foster-Johnson , Leonard Richardson , Jason Diamond , Aleatha Parker , Michael Roberts    197    0


Linear Programming: Introduction
Authors: Dantzig G. , Thapa M.    203    0


Combinatorial Optimization. Theory And Algorithms
Authors: Bernhard Korte , Jens Vygen    218    0


Tex By Topic: A Texnician's Reference
Authors: Victor Eijkhout    127    0


A Service Creation Environment Based On End To End Composition Of Web Services
Authors: Agarwal V. , Dasgupta K. , Karnik N.    152    0



Oracle 9i. Backup And Recovery Concepts
Authors: Ashdown L.    94    0