Google’s Redundant, Fault-Tolerant System Worked with Cheap, Low-Quality, Failure-Prone Equipment

(p. 183) Google was a tough client for Exodus; no company had ever jammed so many servers into so small an area. The typical practice was to put between five and ten servers on a rack; Google managed to get eighty servers on each of its racks. The racks were so closely arranged that it was difficult for a human being to squeeze into the aisle between them. To get an extra rack in, Google had to get Exodus to temporarily remove the side wall of the cage. “The data centers had never worried about how much power and AC went into each cage, because it was never close to being maxed out,” says Reese. “Well, we completely maxed out. It was on an order of magnitude of a small suburban neighborhood,” Reese says. Exodus had to scramble to install heavier circuitry. Its air-conditioning was also overwhelmed, and the colo bought a portable AC truck. They drove the eighteen-wheeler up to the colo, punched three holes in the wall, and pumped cold air into Google’s cage through PVC pipes.
. . .
The key to Google’s efficiency was buying low-quality equipment dirt cheap and applying brainpower to work around the inevitably high failure rate. It was an outgrowth of Google’s earliest days, when Page and Brin had built a server housed by Lego blocks. “Larry and Sergey proposed that we design and build our own servers as cheaply as we can– massive numbers of servers connected to a high-speed network,” says Reese. The conventional wisdom was that an equipment failure should be regarded as, well, a failure. Generally the server failure rate was between 4 and 10 percent. To keep the failures at the lower end of the range, technology companies paid for high-end equipment from Sun Microsystems or EMC. “Our idea was completely opposite,” says Reese. “We’re going to build hundreds and thousands of cheap servers knowing from the get-go that a certain percentage, maybe 10 percent, are going to fail,” says Reese. Google’s first CIO, Douglas Merrill, once noted that the disk drives Google purchased were “poorer quality than you would put into your kid’s computer at home.”
(p. 184) But Google designed around the flaws. “We built capabilities into the software, the hardware, and the network–network– the way we hook them up, the load balancing, and so on– to build in redundancy, to make the system fault-tolerant,” says Reese. The Google File System, written by Jeff Dean and Sanjay Ghemawat, was invaluable in this process: it was designed to manage failure by “sharding” data, distributing it to multiple servers. If Google search called for certain information at one server and didn’t get a reply after a couple of milliseconds, there were two other Google servers that could fulfill the request.

Source:
Levy, Steven. In the Plex: How Google Thinks, Works, and Shapes Our Lives. New York: Simon & Schuster, 2011.
(Note: ellipsis added.)

Links, Diamond Videos, and Podcasts

UNECE · Innovation Matters: Innovative Dynamism

Innovation history and policies continue to be the themes of this second part of my conversation with Lars Anders Joensson on the United Nations’s Innovation Matters podcast. The discussion of “Innovation Matters: Innovative Dynamism” is mostly related to the process of innovative dynamism as discussed in my book Openness to Creative Destruction. Anders was especially energized in this second part of the conversation. (Recorded Weds., Aug. 3, 2022; posted Thurs., Sept. 19, 2024.) [To play this podcast, you click on the white-arrow-in-the-red-circle, in the upper left hand corner.]

UNECE · Innovation Matters: Openness to creative destruction (part 1) - lessons from history

I discuss innovation history and policies on the United Nations's Innovation Matters podcast in this first part of a conversation with Lars Anders Joensson that was recorded on Weds., Aug. 3, 2022 and was posted on Fri., Feb. 24, 2023. The discussion was mostly based on my book Openness to Creative Destruction. [To play this podcast, you click on the white-arrow-in-the-red-circle, in the upper left hand corner.]

I discuss "Policy Hurdles in the Fight against Aging" on Caleb O. Brown's Cato Daily Podcast that was recorded on Sun., April 3, 2022 and was posted on Fri., May 27, 2022. The discussion is based on research that I am conducting for a chapter of my next book which will be on Less Costs, More Cures: Unbinding Medical Entrepreneurs. [To play this podcast, you click on the white-arrow-in-the-light-blue-circle, in the lower left hand corner.]

On Nov. 3, 2021, I presented "Galilean Science: The Impediment to Progress When Science as Doctrine Wins Over Science as Process" at an Organisation [sic] for Economic Co-operation and Development (OECD) workshop on "AI and the Future of Science." I am grateful to Alistair Nolan for inviting me to participate.

Dr. Derek Yonai of the Koch Center for Leadership and Ethics posted on Tues., March 9, 2021 my half-hour "Innovation Unbound" lecture on how regulations bind innovators.

Petition Seeks to Increase Nebraska Minimum wage

The above story, by reporter Brent Weber, ran on WOWT’s 10 PM news on Tuesday, Aug. 10, 2021. It includes a couple of brief comments by me near the end.

Kate Wand slightly edited my AIER article "When I Knew More Than Hayek," and transformed it into a video she titled "Hayek, Covid & The Use of Knowledge in Society." This is the YouTube version of the video that "premiered" on Jan. 4, 2021. If you click above, the video should play right within my blog.

The YouTube version of the full hour and 15 minute EconTalk podcast on Openness to Creative Destruction, that was posted on August 12, 2019. The host and interviewer was Russ Roberts of Stanford University's Hoover Institution. If you click above, the podcast should play right within my blog.

Arthur Diamond: Sustaining Innovative Dynamism

The URL for the 29 minute "Arthur Diamond: Sustaining Innovative Dynamism" episode of Jim Pethokoukis's Political Economy podcast at the American Enterprise Institute (AEI) web site. Jim interviewed me on my book Openness to Creative Destruction. The episode was posted on July 29, 2020.

The YouTube version of the full hour and 8 minute Econonomics for Entrepreneurs podcast on Openness to Creative Destruction, that was posted on Oct. 22, 2019. The host and interviewer was Hunter Hastings of the Mises Institute. If you click above, the podcast should play right within my blog.

Innovation and Creative Destruction

The URL for the 55 minute "Innovation and Creative Destruction" episode of the Cato Institute's Free Thoughts podcast hosted by Aaron Ross Powell and Trevor Burrus. They interviewed me on my book Openness to Creative Destruction in an episode that was posted on February 29, 2020.

"Wilbur Wright Circles Manhattan": brief musings on Wilbur Wright, flight, and my Openness to Creative Destruction book.

Free to Try a Cure for Covid-19

The URL for the 35 minute "Free to Try a Cure for Covid-19" episode of David Forsyth's Freedom Adventure podcast. In an episode that was posted on Aug. 5, 2020, David interviewed me on how to speed therapies, or a vaccine, for Covid-19, and on my book Openness to Creative Destruction.

Arthur Diamond Interviews on Jim Blassingame's The Small Business Advocate

The URL leads to links to a series of interviews on topics including my book Openness to Creative Destruction, entrepreneurship, regulations, labor markets, and policies to speed vaccines and cures for Covid-19.

Art Diamond's personal website artdiamond.com

Art Diamond's academic website at UNO

"Cafe Hayek" (Don Boudreaux's excellent blog)

The StatCounter number above reports the number of "page loads" since the counter was installed late on 2/26/08. Page loads are defined on the site as "The number of times your page has been visited."

View My Stats

Leave a Reply Cancel reply