<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: How Google processes 20 petabytes of data each day</title>
	<atom:link href="http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/feed/" rel="self" type="application/rss+xml" />
	<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/</link>
	<description>EBB is the ebiquity research group\\\'s blog at the University of Maryland, Baltimore County (UMBC).  We focus on technologies that facilitate the design, implementation and control of distributed, intelligent information systems -- mobile and pervasive computing, ad hoc networking, multiagent systems, knowledge representation and reasoning, and the semantic web.  As the tides of technology ebb and flow, we hope the good ideas wash up on our beach and the bad ones drift back out to sea.</description>
	<lastBuildDate>Thu, 19 Nov 2009 00:39:45 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Smart Grid Data About to Swamp Utilities</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-31152</link>
		<dc:creator>Smart Grid Data About to Swamp Utilities</dc:creator>
		<pubDate>Mon, 12 Oct 2009 22:14:41 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-31152</guid>
		<description>[...] updates energy info every 15 minutes can deliver 400 MB per smart meter per year. For perspective, Google processes about 20 PB of data per day, and 1 PB is equivalent to the amount of data contained in 20 million [...]</description>
		<content:encoded><![CDATA[<p>[...] updates energy info every 15 minutes can deliver 400 MB per smart meter per year. For perspective, Google processes about 20 PB of data per day, and 1 PB is equivalent to the amount of data contained in 20 million [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin P. Rice</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-29223</link>
		<dc:creator>Kevin P. Rice</dc:creator>
		<pubDate>Fri, 24 Apr 2009 01:15:22 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-29223</guid>
		<description>correction: bits</description>
		<content:encoded><![CDATA[<p>correction: bits</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin P. Rice</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-29222</link>
		<dc:creator>Kevin P. Rice</dc:creator>
		<pubDate>Fri, 24 Apr 2009 01:14:56 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-29222</guid>
		<description>By the way... a byte is not necessarily eight bites (IEEE 1541-2002). We should be talking about octets! LOL!</description>
		<content:encoded><![CDATA[<p>By the way&#8230; a byte is not necessarily eight bites (IEEE 1541-2002). We should be talking about octets! LOL!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin P. Rice</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-29221</link>
		<dc:creator>Kevin P. Rice</dc:creator>
		<pubDate>Fri, 24 Apr 2009 01:12:58 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-29221</guid>
		<description>ISO/IEC 80000-13:2008 has been adopted by law in some countries. Therefore,

20 PB (petabytes) = 20,000,000,000,000,000 bytes
20 PiB (pebibytes) = 22,517,998,136,852,480 bytes

If you feel the difference is not worth discussing, maybe you&#039;d like to loan the excess out of your bandwidth?</description>
		<content:encoded><![CDATA[<p>ISO/IEC 80000-13:2008 has been adopted by law in some countries. Therefore,</p>
<p>20 PB (petabytes) = 20,000,000,000,000,000 bytes<br />
20 PiB (pebibytes) = 22,517,998,136,852,480 bytes</p>
<p>If you feel the difference is not worth discussing, maybe you&#8217;d like to loan the excess out of your bandwidth?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RpgFan</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-28397</link>
		<dc:creator>RpgFan</dc:creator>
		<pubDate>Thu, 26 Feb 2009 23:54:17 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-28397</guid>
		<description>4GB is actually correct. 4GB = 4000MB = 4000000KB.
However, Windows (and other operating systems) use base 2 since computers use base 2.  Why HDD manufacturers use the SI system while everybody else uses the binary system is beyond me.  Perhaps it is due to how HDDs are designed?

I&#039;m not big on hardware though, so I&#039;m not sure if pendrives, SD cards, flash cards and other such media as well as RAM use the SI system or the binary system.

Still, the part of ISO/IEC 80000-13 dealing with binary prefixes may be standardized, so I wonder if we should start using the binary prefixes now...  After all, IPv6 hasn&#039;t really caught on, despite the fact that it is such a wonderful solution.  Maybe binary prefixes will go the same way.  Plenty of people who implement the technology, but few people who actually adopt the usage of it...  :(

As for the number &quot;20 petabytes&quot;, you really must wonder whether it was 20 &quot;petabytes&quot; or 20 &quot;pebibytes&quot;.  :P</description>
		<content:encoded><![CDATA[<p>4GB is actually correct. 4GB = 4000MB = 4000000KB.<br />
However, Windows (and other operating systems) use base 2 since computers use base 2.  Why HDD manufacturers use the SI system while everybody else uses the binary system is beyond me.  Perhaps it is due to how HDDs are designed?</p>
<p>I&#8217;m not big on hardware though, so I&#8217;m not sure if pendrives, SD cards, flash cards and other such media as well as RAM use the SI system or the binary system.</p>
<p>Still, the part of ISO/IEC 80000-13 dealing with binary prefixes may be standardized, so I wonder if we should start using the binary prefixes now&#8230;  After all, IPv6 hasn&#8217;t really caught on, despite the fact that it is such a wonderful solution.  Maybe binary prefixes will go the same way.  Plenty of people who implement the technology, but few people who actually adopt the usage of it&#8230;  <img src='http://ebiquity.umbc.edu/blogger/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' /> </p>
<p>As for the number &#8220;20 petabytes&#8221;, you really must wonder whether it was 20 &#8220;petabytes&#8221; or 20 &#8220;pebibytes&#8221;.  <img src='http://ebiquity.umbc.edu/blogger/wp-includes/images/smilies/icon_razz.gif' alt=':P' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brandon Hansen</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-28380</link>
		<dc:creator>Brandon Hansen</dc:creator>
		<pubDate>Thu, 26 Feb 2009 17:51:59 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-28380</guid>
		<description>I just read the other day that Google is starting to keep their indexing in RAM, which is stored across 1,000 servers.  That is insane.

They are definitely doing some crazy stuff over there.</description>
		<content:encoded><![CDATA[<p>I just read the other day that Google is starting to keep their indexing in RAM, which is stored across 1,000 servers.  That is insane.</p>
<p>They are definitely doing some crazy stuff over there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: omar fidel</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-18127</link>
		<dc:creator>omar fidel</dc:creator>
		<pubDate>Sun, 02 Mar 2008 21:55:52 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-18127</guid>
		<description>the wasted electricity to run these amusements must cost millions, when will people come to terms with efficiency. If the internet becomes the new TV, will it be as mindless, how can it not.  Consumption of mass media includes infotainment, music and visual popular culture. Use sparingly, consume less and do more. Afterall everything is an abstraction from something, find the source.</description>
		<content:encoded><![CDATA[<p>the wasted electricity to run these amusements must cost millions, when will people come to terms with efficiency. If the internet becomes the new TV, will it be as mindless, how can it not.  Consumption of mass media includes infotainment, music and visual popular culture. Use sparingly, consume less and do more. Afterall everything is an abstraction from something, find the source.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick B</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-17824</link>
		<dc:creator>Nick B</dc:creator>
		<pubDate>Mon, 18 Feb 2008 23:52:22 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-17824</guid>
		<description>@Simon: A petabyte can be described as 10^15 bytes or 2^50 bytes, depending on your standard... so obviously 10*2^51 = 20 * 2^50 = 20 petabytes in that particular notation.

Some light reading on the topic:
http://en.wikipedia.org/wiki/Binary_prefix
http://en.wikipedia.org/wiki/Petabyte

And this is how I finally realised why my new 4GB iPod only holds 3.73GB according to Windows... Apple describes it in base 10 (so 4 * 10^9 bytes) and Windows describes it in base 2 (about 3.73 * 2^30).  I feel cheated :(</description>
		<content:encoded><![CDATA[<p>@Simon: A petabyte can be described as 10^15 bytes or 2^50 bytes, depending on your standard&#8230; so obviously 10*2^51 = 20 * 2^50 = 20 petabytes in that particular notation.</p>
<p>Some light reading on the topic:<br />
<a href="http://en.wikipedia.org/wiki/Binary_prefix" rel="nofollow">http://en.wikipedia.org/wiki/Binary_prefix</a><br />
<a href="http://en.wikipedia.org/wiki/Petabyte" rel="nofollow">http://en.wikipedia.org/wiki/Petabyte</a></p>
<p>And this is how I finally realised why my new 4GB iPod only holds 3.73GB according to Windows&#8230; Apple describes it in base 10 (so 4 * 10^9 bytes) and Windows describes it in base 2 (about 3.73 * 2^30).  I feel cheated <img src='http://ebiquity.umbc.edu/blogger/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Akshay Java</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-16500</link>
		<dc:creator>Akshay Java</dc:creator>
		<pubDate>Wed, 09 Jan 2008 23:13:20 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-16500</guid>
		<description>I have been looking into MapReduce lately and I found Hadoop, which is an open source implementation  lucene.apache.org/hadoop/
There is also an interesting article on how to run MapReduce using Amazon EC2
http://wiki.apache.org/lucene-hadoop/AmazonEC2</description>
		<content:encoded><![CDATA[<p>I have been looking into MapReduce lately and I found Hadoop, which is an open source implementation  lucene.apache.org/hadoop/<br />
There is also an interesting article on how to run MapReduce using Amazon EC2<br />
<a href="http://wiki.apache.org/lucene-hadoop/AmazonEC2" rel="nofollow">http://wiki.apache.org/lucene-hadoop/AmazonEC2</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Simon Reinhardt</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/comment-page-1/#comment-16491</link>
		<dc:creator>Simon Reinhardt</dc:creator>
		<pubDate>Wed, 09 Jan 2008 17:21:53 +0000</pubDate>
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/01/09/how-google-processes-20-petabytes-of-data-each-day/#comment-16491</guid>
		<description>&gt;&gt; If big numbers numb your mind, 20 petabytes is 20,000,000,000,000,000 bytes (or 22,517,998,136,852,480 for the obsessive-compulsives among us)

Why would it be 2^51 * 10? Peta means 10^15 or 1 000 000 000 000 000 so 20 000 000 000 000 000 is correct.

Nonetheless, very impressive. :-)</description>
		<content:encoded><![CDATA[<p>&gt;&gt; If big numbers numb your mind, 20 petabytes is 20,000,000,000,000,000 bytes (or 22,517,998,136,852,480 for the obsessive-compulsives among us)</p>
<p>Why would it be 2^51 * 10? Peta means 10^15 or 1 000 000 000 000 000 so 20 000 000 000 000 000 is correct.</p>
<p>Nonetheless, very impressive. <img src='http://ebiquity.umbc.edu/blogger/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>
