<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>UMBC ebiquity &#187; Database</title>
	<atom:link href="http://ebiquity.umbc.edu/blogger/category/database/feed/" rel="self" type="application/rss+xml" />
	<link>http://ebiquity.umbc.edu/blogger</link>
	<description>EBB is the ebiquity research group\\\'s blog at the University of Maryland, Baltimore County (UMBC).  We focus on technologies that facilitate the design, implementation and control of distributed, intelligent information systems -- mobile and pervasive computing, ad hoc networking, multiagent systems, knowledge representation and reasoning, and the semantic web.  As the tides of technology ebb and flow, we hope the good ideas wash up on our beach and the bad ones drift back out to sea.</description>
	<lastBuildDate>Mon, 30 Jan 2012 02:42:30 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Free online courses on AI, databases and machine learning</title>
		<link>http://ebiquity.umbc.edu/blogger/2011/08/16/free-online-courses-on-ai-databases-and-machine-learning/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2011/08/16/free-online-courses-on-ai-databases-and-machine-learning/#comments</comments>
		<pubDate>Tue, 16 Aug 2011 05:32:13 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[CS]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Social media]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=4116</guid>
		<description><![CDATA[TweetStanford is experimenting with an interesting idea &#8212; offering some of their most popular undergraduate computer science courses online for free and simultaneously with their regular offerings. An AI course was announced several weeks ago and now there are similar offerings for databases and machine learning. These are taught by first rate instructors (who are [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton4116" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2011%2F08%2F16%2Ffree-online-courses-on-ai-databases-and-machine-learning%2F&amp;text=Free%20online%20courses%20on%20AI%2C%20databases%20and%20machine%20learning&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2011%2F08%2F16%2Ffree-online-courses-on-ai-databases-and-machine-learning%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p>Stanford is experimenting with an interesting <a href="http://www.nytimes.com/2011/08/16/science/16stanford.html">idea</a> &#8212; offering some of their most popular undergraduate computer science courses online for free and simultaneously with their regular offerings.  An AI course was <a href="http://my.umbc.edu/groups/csee/media/1200">announced</a> several weeks ago and now there are similar offerings for databases and machine learning.  These are taught by first rate instructors (who are also top researchers!) and are the same courses that Stanford students take.</p>
<ul>
<li>&#8220;A bold experiment in distributed education, <a href="http://www.ai-class.com/">&#8220;Introduction to Artificial Intelligence&#8221;</a> will be offered free and online to students worldwide during the fall of 2011. The course will include feedback on progress and a statement of accomplishment. Taught by Sebastian Thrun and Peter Norvig, the curriculum draws from that used in Stanford&#8217;s introductory Artificial Intelligence course. The instructors will offer similar materials, assignments, and exams.&#8221;</li>
<li>&#8220;A bold experiment in distributed education, <a href="http://www.db-class.org/">&#8220;Introduction to Databases&#8221;</a> will be offered free and online to students worldwide during the fall of 2011. Students will have access to lecture videos, receive regular feedback on progress, and receive answers to questions. When you successfully complete this class, you will also receive a statement of accomplishment. Taught by Professor Jennifer Widom, the curriculum draws from Stanford&#8217;s popular Introduction to Databases course.&#8221;</li>
<li>&#8220;A bold experiment in distributed education, <a href="http://www.ml-class.org/">&#8220;Machine Learning&#8221;</a> will be offered free and online to students worldwide during the fall of 2011. Students will have access to lecture videos, lecture notes, receive regular feedback on progress, and receive answers to questions. When you successfully complete the class, you will also receive a statement of accomplishment. Taught by Professor Andrew Ng, the curriculum draws from Stanford&#8217;s popular Machine Learning course.&#8221;</li>
</ul>
<p>If successful, this might be a game changer.  Two weeks after the online AI course was announced, 56,000 students had signed up!  The approach might work for many disciplines, not just CS. The <a href="http://www.khanacademy.org/">Kahn Academy</a> is a related effort.</p>
<p>Universities should keep an eye on them and think about how to adapt if they are successful.  Most of our students will probably benefit from taking our traditional courses.  If so, we should be able to explain the benefits from taking them (and make sure we deliver those benefits).  At the same time, we may want to leverage the online material from these courses in a synergistic way.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2011/08/16/free-online-courses-on-ai-databases-and-machine-learning/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google acquires Metaweb and Freebase</title>
		<link>http://ebiquity.umbc.edu/blogger/2010/07/16/google-acquires-metaweb-and-freebase/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2010/07/16/google-acquires-metaweb-and-freebase/#comments</comments>
		<pubDate>Fri, 16 Jul 2010 19:30:34 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[sEARCH]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social media]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=3058</guid>
		<description><![CDATA[TweetGoogle announced today that it has acquired Metaweb, the company behind Freebase &#8212; a free, semantic database of &#8220;over 12 million people, places, and things in the world.&#8221; This is from their announcement on the Official Google blog: &#8220;Over time we’ve improved search by deepening our understanding of queries and web pages. The web isn’t [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton3058" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2010%2F07%2F16%2Fgoogle-acquires-metaweb-and-freebase%2F&amp;text=Google%20acquires%20Metaweb%20and%20Freebase&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2010%2F07%2F16%2Fgoogle-acquires-metaweb-and-freebase%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p>Google <a href="http://googleblog.blogspot.com/2010/07/deeper-understanding-with-metaweb.html">announced</a> today that it has acquired <a href="http://www.metaweb.com/">Metaweb</a>, the company behind <a href="http://www.freebase.com/">Freebase</a> &#8212; a free, semantic database of &#8220;over 12 million people, places, and things in the world.&#8221; This is from their <a href="This is from their announcement on the Official Google blog: ">announcement</a> on the Official Google blog:</p>
<blockquote><p> &#8220;Over time we’ve improved search by deepening our understanding of queries and web pages. The web isn’t merely words — it’s information about things in the real world, and understanding the relationships between real-world entities can help us deliver relevant information more quickly. &#8230;  With efforts like <a href="http://googleblog.blogspot.com/2009/05/more-search-options-and-other-updates.html">rich snippets</a> and the <a href="http://googleblog.blogspot.com/2010/05/understanding-web-to-find-short-answers.html">search answers feature</a>, we’re just beginning to apply our understanding of the web to make search better. Type [barack obama birthday] in the search box and see the answer right at the top of the page. Or search for [events in San Jose] and see a list of specific events and dates. We can offer this kind of experience because we understand facts about real people and real events out in the world. But what about [colleges on the west coast with tuition under $30,000] or [actors over 40 who have won at least one oscar]?  These are hard questions, and we’ve acquired Metaweb because we believe working together we’ll be able to provide better answers.&#8221;
</p></blockquote>
<p>In their announcement, Google promises to continue to maintain Freebase &#8220;as a free and open database for the world&#8221; and invites other web companies use and contribute to it.</p>
<p>Freebase is a system very much in the linked open data spirit, even thought RDF is not its native representation.  It&#8217;s content is available as RDF and there are many links that bind it to the LOD cloud.  Moreover, Freebase has a very good wiki-like interface allowing people to upload, extend and edit both its schema and data.</p>
<p>Here&#8217;s a video on the concepts behind Metaweb which are, of course, also those underlying the Semantic Web.  What the difference &#8212; I&#8217;d say a combination of representational details and centralized (Metaweb) vs. distributed (Semantic Web).</p>
<p><center><object width="500" height="301"><param name="movie" value="http://www.youtube.com/v/TJfrNo3Z-DU&amp;hl=en_US&amp;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/TJfrNo3Z-DU&amp;hl=en_US&amp;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="500" height="301"></embed></object></center></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2010/07/16/google-acquires-metaweb-and-freebase/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>NOSQL: distributed key-value data stores</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/07/02/nosql-distributed-key-value-data-stores/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/07/02/nosql-distributed-key-value-data-stores/#comments</comments>
		<pubDate>Thu, 02 Jul 2009 14:17:10 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=2081</guid>
		<description><![CDATA[TweetComputerWorld has an article on the &#8220;nosql&#8221; movement and a recent nosql meetup held in San Francisco, No to SQL? Anti-database movement gains steam. Nosql systems are distributed, non-relational data stores that typically use a simple key-value approach to indexing and retrieving data and use a simple procedural query API rather than a sophisticated declarative [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton2081" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2009%2F07%2F02%2Fnosql-distributed-key-value-data-stores%2F&amp;text=NOSQL%3A%20distributed%20key-value%20data%20stores&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2009%2F07%2F02%2Fnosql-distributed-key-value-data-stores%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p>ComputerWorld has an article on the &#8220;nosql&#8221; movement and a recent <a href="http://nosql.eventbrite.com/">nosql meetup</a> held in San Francisco, <a href="http://www.computerworld.com/action/article.do?command=viewArticleBasic&#038;articleId=9135086">No to SQL? Anti-database movement gains steam</a>.  Nosql systems are distributed, non-relational data stores that typically use a simple <a href="http://en.wikipedia.org/wiki/Associative_array">key-value</a> approach to indexing and retrieving data and use a simple procedural query API rather than a sophisticated declarative query language.</p>
<blockquote><p>
&#8220;The inaugural get-together of the burgeoning NoSQL community crammed 150 attendees into a meeting room at CBS Interactive. Like the Patriots, who rebelled against Britain&#8217;s heavy taxes, NoSQLers came to share how they had overthrown the tyranny of slow, expensive relational databases in favor of more efficient and cheaper ways of managing data.</p>
<p>&#8220;Relational databases give you too much. They force you to twist your object data to fit a RDBMS [relational database management system],&#8221; said Jon Travis, principal engineer at Java toolmaker SpringSource, one of the 10 presenters at the NoSQL confab (PDF). NoSQL-based alternatives &#8220;just give you what you need,&#8221; Travis said.&#8221;
</p></blockquote>
<p>There were presentation on nine different &#8216;nosql&#8217; databases: Voldemort, Cassandra, Dynomite, HBase, Hypertable, CouchDB, VPork, MongoDb as well as general presentations by Google&#8217;s Jonas Karlsson, and Cloudera&#8217;s Todd Lipcon.</p>
<p>Johan Oskarsson of Last.fm wrote a <a href="http://blog.oskarsson.nu/2009/06/nosql-debrief.html">debriefing post</a> on his blog.</p>
<blockquote><p>
&#8220;The relatively young but rapidly growing &#8220;nosql&#8221; community met last Thursday in San Francisco. The idea was to give attendees a solid introduction to how distributed, non relational databases work as well as an overview of the various projects out there.&#8221;
</p></blockquote>
<p>and provides <a href="http://blog.oskarsson.nu/2009/06/nosql-debrief.html">links</a> to the presentation slides and videos.  You can also <a href="http://vimeo.com/videos/search:nosql">search for NOSQL on Vimeo</a> to get the videos.</p>
<p>I learned of this meeting on Hacker News, where you can find some interesting <a href="http://news.ycombinator.com/item?id=683807">comments</a>.</p>
<p>Of course their are many popular key-value stores that are not designed to support the highly-scalable distributed needs of many Web applications.  I found, for example, that as a persistent RDF store for rdflib, Sleepycat out performed MySQL.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/07/02/nosql-distributed-key-value-data-stores/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Price Waterhouse Coopers bullish on the Semantic Web</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/05/29/price-waterhouse-coopers-bullish-on-the-semantic-web/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/05/29/price-waterhouse-coopers-bullish-on-the-semantic-web/#comments</comments>
		<pubDate>Fri, 29 May 2009 13:57:34 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[ontology]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1913</guid>
		<description><![CDATA[TweetPrice Waterhouse Coopers is one of the largest &#8220;professional services&#8221; organization and has always been strong on technology consulting and advice. The Spring issue of their quarterly Technology Forecast journal focuses on the Semantic Web. This is from the table of contents 04 Spinning a data Web. Semantic Web technologies could revolutionize enterprise decision making [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton1913" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2009%2F05%2F29%2Fprice-waterhouse-coopers-bullish-on-the-semantic-web%2F&amp;text=Price%20Waterhouse%20Coopers%20bullish%20on%20the%20Semantic%20Web&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2009%2F05%2F29%2Fprice-waterhouse-coopers-bullish-on-the-semantic-web%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p><a href="http://en.wikipedia.org/wiki/PricewaterhouseCoopers">Price Waterhouse Coopers</a> is one of the largest &#8220;professional services&#8221; organization and has always been strong on technology consulting and advice.  The Spring issue of their quarterly Technology Forecast journal focuses on the Semantic Web.  This is from the table of contents</p>
<p><img align="right" src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/05/pwc-tech-forecast-spring-2009.jpg" alt="pwc-tech-forecast-spring-2009" title="pwc-tech-forecast-spring-2009" width="215" height="278" class="alignnone size-full wp-image-1919" /></p>
<ul>
<li><b>04 Spinning a data Web</b>. Semantic Web technologies could revolutionize enterprise decision making and information sharing. Here’s why.</li>
<li><b>20 Making Semantic Web connections</b>. Linked Data technology can change the business of enterprise data management.</li>
<li><b> 16 Traversing the Giant Global Graph</b>. Tom Scott of BBC Earth describes how everyone benefits from interoperable data.</li>
<li><b> 28 From folksonomies to ontologies</b>. Uche Ogbuji of Zepheira discusses how early adopters are introducing Semantic Web to the enterprise.</li>
<li><b> 40 How the Semantic Web might improve cancer treatment</b>. M. D. Anderson’s Lynn Vogel explores new techniques for combining clinical and research data.</li>
<li><b> 46 Semantic technologies at the ecosystem level</b>. Frank Chum of Chevron talks about the need for shared ontologies in the oil and gas industry.</li>
</ul>
<p>You can download the free 58 report <a href="http://www.pwc.com/extweb/onlineforms.nsf/weblookup/USENGALLSTechnologyforecast:Downloadvalidatedspring2009">here</a>. You can also read a note on the issue in <a href="http://www.readwriteweb.com/archives/semantic_web_enterprise_pwc.php">ReadWriteWeb</a>, which focuses on linked data and interoperability.</p>
<blockquote><p> &#8220;A new PricewaterhouseCoopersTechnology report explains how the Semantic Web and Linked Data can help enterprises manage their large scale data better. The PwC Center for Technology and Innovation team spent several months researching and analyzing the problem of data silos in enterprises &#8211; and what solutions are being developed to help with that problem. The answer, according to PwC, is Semantic Web techniques. PwC believes that the Semantic Web offers a practical way to address the problem of large-scale data integration. &#8230; &#8220;</p></blockquote>
<p>(Spotted on <a href="http://lists.w3.org/Archives/Public/public-lod/2009May/0317.html">publi-lod@w3.org</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/05/29/price-waterhouse-coopers-bullish-on-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Hadoop user group for the Baltimore-DC region</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/02/08/hadoop-user-group-for-the-baltimoredc-region/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/02/08/hadoop-user-group-for-the-baltimoredc-region/#comments</comments>
		<pubDate>Sun, 08 Feb 2009 15:10:17 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[High performance computing]]></category>
		<category><![CDATA[MC2]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1764</guid>
		<description><![CDATA[TweetA Hadoop User Group (HUG) has formed for the Washington DC area via meetup.com. &#8220;We&#8217;re a group of Hadoop &#038; Cloud Computing technologists / enthusiasts / curious people who discuss emerging technologies, Hadoop &#038; related software development (HBase, Hypertable, PIG, etc). Come learn from each other, meet nice people, have some food/drink.&#8221; The group defines [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton1764" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2009%2F02%2F08%2Fhadoop-user-group-for-the-baltimoredc-region%2F&amp;text=Hadoop%20user%20group%20for%20the%20Baltimore-DC%20region&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2009%2F02%2F08%2Fhadoop-user-group-for-the-baltimoredc-region%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p>A <a href="http://www.meetup.com/Hadoop-DC/">Hadoop User Group</a> (HUG) has formed for the Washington DC area via meetup.com.</p>
<blockquote><p> &#8220;We&#8217;re a group of <a href="http://en.wikipedia.org/wiki/Hadoop">Hadoop</a> &#038; <a href="http://en.wikipedia.org/wiki/Cloud_computing">Cloud Computing</a> technologists / enthusiasts / curious people who discuss emerging technologies, Hadoop &#038; related software development (<a href="http://hadoop.apache.org/hbase/">HBase</a>, <a href="http://hypertable.org/">Hypertable</a>, <a href="http://hadoop.apache.org/pig/">PIG</a>, etc). Come learn from each other, meet nice people, have some food/drink.&#8221; </p></blockquote>
<p>The group defines it&#8217;s geographic location as Columbia MD and their first <a href="http://www.meetup.com/Hadoop-DC/messages/boards/thread/6218422">HUG meetup</a> was held last Wednesday at the BWI Hampton Inn.  In addition to informal social interactions, it featured two presentations:</p>
<ul>
<li> Amir Youssefi from Yahoo! presented an overview of Hadoop. Amir is a member of the Cloud Computing and Data Infrastructure group at Yahoo!, and will be discussing Multi-Dataset Processing (Joins) using Hadoop and Hadoop Table.</li>
<li> Introduction to complex, fault tolerant data processing workflows using Cascading and Hadoop by Scott Godwin &#038; Bill Oley</li>
</ul>
<p>If you&#8217;re in Maryland and interested you can join the group at <a href="http://www.meetup.com/Hadoop-DC/">meetup.com</a> and get announcements for future meetings.  It might provide a good way to learn more about new software to exploit computing clusters and cloud computing.</p>
<p>(Thanks to <a href="http://www.cpdiehl.org/">Chris Diehl</a> for alerting me to this)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/02/08/hadoop-user-group-for-the-baltimoredc-region/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Database researchers identify hot research topics</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/08/25/database-researchers-assess-hot-research-topics/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/08/25/database-researchers-assess-hot-research-topics/#comments</comments>
		<pubDate>Mon, 25 Aug 2008 08:30:56 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Computing Research]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social media]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1601</guid>
		<description><![CDATA[TweetDatabases are a fundamental technology for most information systems and especially those based on the web. A group of senior database researchers met recently to assess the state of database research, as documented in site. So, where did the Semantic Web fit into their vision? &#8220;In late May, 2008, a group of database researchers, architects, [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton1601" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2008%2F08%2F25%2Fdatabase-researchers-assess-hot-research-topics%2F&amp;text=Database%20researchers%20identify%20hot%20research%20topics&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2008%2F08%2F25%2Fdatabase-researchers-assess-hot-research-topics%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p>Databases are a fundamental technology for most information systems and especially those based on the web.  A group of senior database researchers met recently to assess the state of database research, as documented in <a href="http://db.cs.berkeley.edu/claremont/">site</a>.  So, where did the Semantic Web fit into their vision?</p>
<blockquote><p> &#8220;In late May, 2008, a group of database researchers, architects, users and pundits met at the Claremont Resort in Berkeley, California to discuss the state of the research field and its impacts on practice. This was the seventh meeting of this sort in twenty years, and was distinguished by a broad consensus that we are at a turning point in the history of the field, due both to an explosion of data and usage scenarios, and to major shifts in computing hardware and platforms. Given these forces, we are at a time of opportunity for research impact, with an unusually large potential for influential results across computing, the sciences and society. This report details that discussion, and highlights the group&#8217;s consensus view of new focus areas, including new database engine architectures, declarative programming languages, the interplay of structured and unstructured data, cloud data services, and mobile and virtual worlds.&#8221;  </p></blockquote>
<p> On the site you can read the post-meeting <a href="http://db.cs.berkeley.edu/claremont/claremontreport08.pdf">report</a>, view the participants <a href="">presentations on DB research directions</a> and talks and <a href="http://groups.google.com/group/claremontreport">discuss</a> the report on a Google group.</p>
<p>It&#8217;s a good report with lots of interesting things in it and definitely worth reading, but I was disappointed to find that it makes <b>no mention</b> of the Semantic Web, RDF, OWL, ontologies, AI, knowledge bases, or reasoning.  Here&#8217;s a word cloud (generated with <a href="http://wordle.net/">wordle</a>) generated from the report, which provides a 10,000 foot view of it&#8217;s content.</p>
<p><center><br />
<a href='http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/08/claremontwordcloud1.png'><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/08/claremontwordcloud1.png" alt="word cloud generated from The Claremont Database Research Self-Assessment Meeting report" title="claremont-word-cloud" width="450" height="294" /></a><br />
</center></p>
<p>The reports says that it was &#8220;surprisingly easy for the group to reach consensus on a set of research topics to highlight for investigation in coming years&#8221;.  Those topics are:</p>
<ul>
<li>Revisiting Database Engines</li>
<li>Declarative Programming for Emerging Platforms</li>
<li>The Interplay of Structured and Unstructured Data</li>
<li>Cloud Data Services</li>
<li>Mobile Applications and Virtual Worlds</li>
</ul>
<p>There is clearly overlap between the database and semantic web communities in the first three topics.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/08/25/database-researchers-assess-hot-research-topics/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Hypertable 0.9 alpha</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/#comments</comments>
		<pubDate>Sat, 09 Feb 2008 02:36:52 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/</guid>
		<description><![CDATA[TweetHypertable 0.9 alpha is out. &#8220;Hypertable is a high performance distributed data storage system designed to support applications requiring maximum performance, scalability, and reliability. Hypertable will be particularly invaluable to any organization that needs to manage rapidly evolving data to support demanding real-time applications. Modeled after Google&#8217;s well known Bigtable project, Hypertable is designed to [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton1418" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2008%2F02%2F08%2Fhypertable-09-alpha%2F&amp;text=Hypertable%200.9%20alpha&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2008%2F02%2F08%2Fhypertable-09-alpha%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p><img src='http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/02/gra-tesseract.gif' alt='hypertable' align="right" /><a href="http://hypertable.org/">Hypertable</a> 0.9 alpha is out.</p>
<blockquote><p>
&#8220;Hypertable is a high performance distributed data storage system designed to support applications requiring maximum performance, scalability, and reliability.  Hypertable will be particularly invaluable to any organization that needs to manage rapidly evolving data to support demanding real-time applications. Modeled after Google&#8217;s well known <a href="http://en.wikipedia.org/wiki/BigTable">Bigtable</a> project, Hypertable is designed to manage the storage and processing of information on a large cluster of commodity servers, providing resilience to machine and component failures. Hypertable seeks to set the open source standard for highly available, petabyte scale, database systems. &#8221; (<a href="http://hypertable.org/about.html">link</a>)
</p></blockquote>
<p><strong>Update:</strong> LinuxWorld has an article, <a href="http://www.linuxworld.com/news/2008/020608-hypertable.html">Zvents releases open-source cluster database</a>, on the release along with a <a href="http://www.linuxworld.com/podcasts/linux/2008/013008-linuxcast.html">podcast</a> with Doug Judd, principal search architect for Zvents. </p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How YouTube scales MySQL for its large databases</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/#comments</comments>
		<pubDate>Fri, 28 Dec 2007 15:04:29 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/</guid>
		<description><![CDATA[TweetLike most research labs, we rely on MySQL whenever we need a database. And like most (I&#8217;m guessing, here), it&#8217;s common to overhear something like the following in our lab &#8212; &#8220;We really need to replace MySQL with Oracle or DB2 in X so it can handle the load.&#8221; But we never get around to [...]]]></description>
			<content:encoded><![CDATA[<div id="tweetbutton1375" class="tw_button" style="clear:left; float: left; margin-right: 10px; margin-top:10px; margin-left: -80;float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2007%2F12%2F28%2Fhow-youtube-scales-mysql-for-its-large-databases%2F&amp;text=How%20YouTube%20scales%20MySQL%20for%20its%20large%20databases&amp;related=ebiquity&amp;lang=en&amp;count=vertical&amp;counturl=http%3A%2F%2Febiquity.umbc.edu%2Fblogger%2F2007%2F12%2F28%2Fhow-youtube-scales-mysql-for-its-large-databases%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://ebiquity.umbc.edu/blogger/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div><p>Like most research labs, we rely on MySQL whenever we need a database. And like most (I&#8217;m guessing, here), it&#8217;s common to overhear something like the following in our lab &#8212; &#8220;We really need to replace MySQL with Oracle or DB2 in X so it can handle the load.&#8221; But we never get around to it.</p>
<p>Maybe we don&#8217;t have to.  Check out <a href="http://itc.conversationsnetwork.org/shows/detail3299.html">Scaling MySQL at YouTube</a>, a keynote talk by YouTube DBA Paul Tuckfield at the <a href="http://conferences.oreillynet.com/mysqluc2007/">2007 MySQL Conference</a> put online by Conversationnetwork.org.</p>
<blockquote><p>
&#8220;In mid 2006, YouTube served approximately 100 million videos in a single day. To maintain a website of that scale, one would imagine YouTube has hundreds of DBAs. But in fact, there are just three people that make it all work. Paul Tuckfield, the MySQL DBA at YouTube shares horror stories about scalability at YouTube and how he coped with them to keep the show going everyday, while learning important lessons along the way. &#8230;  According to him, the three important reasons for YouTube&#8217;s scalability are Python, <a href="http://en.wikipedia.org/wiki/Memcached">Memcache</a> and <a href="http://dev.mysql.com/doc/refman/5.0/en/replication.html">MySQL replication</a>, the last having the most impact. Most people think that the answer to scalability is in upgrading hardware and CPU power. Adding CPUs doesn&#8217;t work on its own; wisdom is in getting the maximum amount of RAM for the CPU and then fine tuning.&#8221; (<a href="http://itc.conversationsnetwork.org/shows/detail3299.html">src</a>)
</p></blockquote>
<p><center></p>
<div class="audio-player-container player"><script type="text/javascript"> function FlashRequest(command, args) {}</script></p>
<div class="audio-player-placeholder"><object width="400" height="27" align="middle" codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=7,0,0,0" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"><param value="best" name="quality"/><param value="never" name="allowScriptAccess"/><param value="window" name="wmode"/><param value="http://www.google.com/reader/ui/3247397568-audio-player.swf?audioUrl=http%3A%2F%2Ffeeds.conversationsnetwork.org%2F~r%2Fgigavox%2Fchannel%2Fitconversations%2F~5%2F207402349%2FITC.mySQL-PaulTuckfield-2007.04.26.mp3" name="movie"/><embed width="400" height="27" pluginspage="http://www.macromedia.com/go/getflashplayer" flashvars="playerMode=embedded" wmode="window" bgcolor="#ffffff" quality="best" allowscriptaccess="never" src="http://www.google.com/reader/ui/3247397568-audio-player.swf?audioUrl=http%3A%2F%2Ffeeds.conversationsnetwork.org%2F~r%2Fgigavox%2Fchannel%2Fitconversations%2F~5%2F207402349%2FITC.mySQL-PaulTuckfield-2007.04.26.mp3" type="application/x-shockwave-flash" classname="audio-player-embed"/></object></div>
</div>
<p></center></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

