<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>UMBC ebiquity &#187; Database</title>
	<atom:link href="http://ebiquity.umbc.edu/blogger/category/database/feed/" rel="self" type="application/rss+xml" />
	<link>http://ebiquity.umbc.edu/blogger</link>
	<description>EBB is the ebiquity research group\\\'s blog at the University of Maryland, Baltimore County (UMBC).  We focus on technologies that facilitate the design, implementation and control of distributed, intelligent information systems -- mobile and pervasive computing, ad hoc networking, multiagent systems, knowledge representation and reasoning, and the semantic web.  As the tides of technology ebb and flow, we hope the good ideas wash up on our beach and the bad ones drift back out to sea.</description>
	<lastBuildDate>Fri, 20 Nov 2009 13:50:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>NOSQL: distributed key-value data stores</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/07/02/nosql-distributed-key-value-data-stores/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/07/02/nosql-distributed-key-value-data-stores/#comments</comments>
		<pubDate>Thu, 02 Jul 2009 14:17:10 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=2081</guid>
		<description><![CDATA[ComputerWorld has an article on the &#8220;nosql&#8221; movement and a recent nosql meetup held in San Francisco, No to SQL? Anti-database movement gains steam.  Nosql systems are distributed, non-relational data stores that typically use a simple key-value approach to indexing and retrieving data and use a simple procedural query API rather than a sophisticated [...]]]></description>
			<content:encoded><![CDATA[<p>ComputerWorld has an article on the &#8220;nosql&#8221; movement and a recent <a href="http://nosql.eventbrite.com/">nosql meetup</a> held in San Francisco, <a href="http://www.computerworld.com/action/article.do?command=viewArticleBasic&#038;articleId=9135086">No to SQL? Anti-database movement gains steam</a>.  Nosql systems are distributed, non-relational data stores that typically use a simple <a href="http://en.wikipedia.org/wiki/Associative_array">key-value</a> approach to indexing and retrieving data and use a simple procedural query API rather than a sophisticated declarative query language.</p>
<blockquote><p>
&#8220;The inaugural get-together of the burgeoning NoSQL community crammed 150 attendees into a meeting room at CBS Interactive. Like the Patriots, who rebelled against Britain&#8217;s heavy taxes, NoSQLers came to share how they had overthrown the tyranny of slow, expensive relational databases in favor of more efficient and cheaper ways of managing data.</p>
<p>&#8220;Relational databases give you too much. They force you to twist your object data to fit a RDBMS [relational database management system],&#8221; said Jon Travis, principal engineer at Java toolmaker SpringSource, one of the 10 presenters at the NoSQL confab (PDF). NoSQL-based alternatives &#8220;just give you what you need,&#8221; Travis said.&#8221;
</p></blockquote>
<p>There were presentation on nine different &#8216;nosql&#8217; databases: Voldemort, Cassandra, Dynomite, HBase, Hypertable, CouchDB, VPork, MongoDb as well as general presentations by Google&#8217;s Jonas Karlsson, and Cloudera&#8217;s Todd Lipcon.</p>
<p>Johan Oskarsson of Last.fm wrote a <a href="http://blog.oskarsson.nu/2009/06/nosql-debrief.html">debriefing post</a> on his blog.</p>
<blockquote><p>
&#8220;The relatively young but rapidly growing &#8220;nosql&#8221; community met last Thursday in San Francisco. The idea was to give attendees a solid introduction to how distributed, non relational databases work as well as an overview of the various projects out there.&#8221;
</p></blockquote>
<p>and provides <a href="http://blog.oskarsson.nu/2009/06/nosql-debrief.html">links</a> to the presentation slides and videos.  You can also <a href="http://vimeo.com/videos/search:nosql">search for NOSQL on Vimeo</a> to get the videos.</p>
<p>I learned of this meeting on Hacker News, where you can find some interesting <a href="http://news.ycombinator.com/item?id=683807">comments</a>.</p>
<p>Of course their are many popular key-value stores that are not designed to support the highly-scalable distributed needs of many Web applications.  I found, for example, that as a persistent RDF store for rdflib, Sleepycat out performed MySQL.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/07/02/nosql-distributed-key-value-data-stores/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Price Waterhouse Coopers bullish on the Semantic Web</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/05/29/price-waterhouse-coopers-bullish-on-the-semantic-web/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/05/29/price-waterhouse-coopers-bullish-on-the-semantic-web/#comments</comments>
		<pubDate>Fri, 29 May 2009 13:57:34 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[ontology]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1913</guid>
		<description><![CDATA[Price Waterhouse Coopers is one of the largest &#8220;professional services&#8221; organization and has always been strong on technology consulting and advice.  The Spring issue of their quarterly Technology Forecast journal focuses on the Semantic Web.  This is from the table of contents


04 Spinning a data Web. Semantic Web technologies could revolutionize enterprise decision [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/PricewaterhouseCoopers">Price Waterhouse Coopers</a> is one of the largest &#8220;professional services&#8221; organization and has always been strong on technology consulting and advice.  The Spring issue of their quarterly Technology Forecast journal focuses on the Semantic Web.  This is from the table of contents</p>
<p><img align="right" src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/05/pwc-tech-forecast-spring-2009.jpg" alt="pwc-tech-forecast-spring-2009" title="pwc-tech-forecast-spring-2009" width="215" height="278" class="alignnone size-full wp-image-1919" /></p>
<ul>
<li><b>04 Spinning a data Web</b>. Semantic Web technologies could revolutionize enterprise decision making and information sharing. Here’s why.</li>
<li><b>20 Making Semantic Web connections</b>. Linked Data technology can change the business of enterprise data management.</li>
<li><b> 16 Traversing the Giant Global Graph</b>. Tom Scott of BBC Earth describes how everyone benefits from interoperable data.</li>
<li><b> 28 From folksonomies to ontologies</b>. Uche Ogbuji of Zepheira discusses how early adopters are introducing Semantic Web to the enterprise.</li>
<li><b> 40 How the Semantic Web might improve cancer treatment</b>. M. D. Anderson’s Lynn Vogel explores new techniques for combining clinical and research data.</li>
<li><b> 46 Semantic technologies at the ecosystem level</b>. Frank Chum of Chevron talks about the need for shared ontologies in the oil and gas industry.</li>
</ul>
<p>You can download the free 58 report <a href="http://www.pwc.com/extweb/onlineforms.nsf/weblookup/USENGALLSTechnologyforecast:Downloadvalidatedspring2009">here</a>. You can also read a note on the issue in <a href="http://www.readwriteweb.com/archives/semantic_web_enterprise_pwc.php">ReadWriteWeb</a>, which focuses on linked data and interoperability.</p>
<blockquote><p> &#8220;A new PricewaterhouseCoopersTechnology report explains how the Semantic Web and Linked Data can help enterprises manage their large scale data better. The PwC Center for Technology and Innovation team spent several months researching and analyzing the problem of data silos in enterprises &#8211; and what solutions are being developed to help with that problem. The answer, according to PwC, is Semantic Web techniques. PwC believes that the Semantic Web offers a practical way to address the problem of large-scale data integration. &#8230; &#8220;</p></blockquote>
<p>(Spotted on <a href="http://lists.w3.org/Archives/Public/public-lod/2009May/0317.html">publi-lod@w3.org</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/05/29/price-waterhouse-coopers-bullish-on-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hadoop user group for the Baltimore-DC region</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/02/08/hadoop-user-group-for-the-baltimoredc-region/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/02/08/hadoop-user-group-for-the-baltimoredc-region/#comments</comments>
		<pubDate>Sun, 08 Feb 2009 15:10:17 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[High performance computing]]></category>
		<category><![CDATA[MC2]]></category>
		<category><![CDATA[cloud computing]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1764</guid>
		<description><![CDATA[A Hadoop User Group (HUG) has formed for the Washington DC area via meetup.com.
 &#8220;We&#8217;re a group of Hadoop &#038; Cloud Computing technologists / enthusiasts / curious people who discuss emerging technologies, Hadoop &#038; related software development (HBase, Hypertable, PIG, etc). Come learn from each other, meet nice people, have some food/drink.&#8221; 
The group defines [...]]]></description>
			<content:encoded><![CDATA[<p>A <a href="http://www.meetup.com/Hadoop-DC/">Hadoop User Group</a> (HUG) has formed for the Washington DC area via meetup.com.</p>
<blockquote><p> &#8220;We&#8217;re a group of <a href="http://en.wikipedia.org/wiki/Hadoop">Hadoop</a> &#038; <a href="http://en.wikipedia.org/wiki/Cloud_computing">Cloud Computing</a> technologists / enthusiasts / curious people who discuss emerging technologies, Hadoop &#038; related software development (<a href="http://hadoop.apache.org/hbase/">HBase</a>, <a href="http://hypertable.org/">Hypertable</a>, <a href="http://hadoop.apache.org/pig/">PIG</a>, etc). Come learn from each other, meet nice people, have some food/drink.&#8221; </p></blockquote>
<p>The group defines it&#8217;s geographic location as Columbia MD and their first <a href="http://www.meetup.com/Hadoop-DC/messages/boards/thread/6218422">HUG meetup</a> was held last Wednesday at the BWI Hampton Inn.  In addition to informal social interactions, it featured two presentations:</p>
<ul>
<li> Amir Youssefi from Yahoo! presented an overview of Hadoop. Amir is a member of the Cloud Computing and Data Infrastructure group at Yahoo!, and will be discussing Multi-Dataset Processing (Joins) using Hadoop and Hadoop Table.</li>
<li> Introduction to complex, fault tolerant data processing workflows using Cascading and Hadoop by Scott Godwin &#038; Bill Oley</li>
</ul>
<p>If you&#8217;re in Maryland and interested you can join the group at <a href="http://www.meetup.com/Hadoop-DC/">meetup.com</a> and get announcements for future meetings.  It might provide a good way to learn more about new software to exploit computing clusters and cloud computing.</p>
<p>(Thanks to <a href="http://www.cpdiehl.org/">Chris Diehl</a> for alerting me to this)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/02/08/hadoop-user-group-for-the-baltimoredc-region/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Database researchers identify hot research topics</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/08/25/database-researchers-assess-hot-research-topics/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/08/25/database-researchers-assess-hot-research-topics/#comments</comments>
		<pubDate>Mon, 25 Aug 2008 08:30:56 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Computing Research]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social media]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1601</guid>
		<description><![CDATA[Databases are a fundamental technology for most information systems and especially those based on the web.  A group of senior database researchers met recently to assess the state of database research, as documented in site.  So, where did the Semantic Web fit into their vision?
 &#8220;In late May, 2008, a group of database [...]]]></description>
			<content:encoded><![CDATA[<p>Databases are a fundamental technology for most information systems and especially those based on the web.  A group of senior database researchers met recently to assess the state of database research, as documented in <a href="http://db.cs.berkeley.edu/claremont/">site</a>.  So, where did the Semantic Web fit into their vision?</p>
<blockquote><p> &#8220;In late May, 2008, a group of database researchers, architects, users and pundits met at the Claremont Resort in Berkeley, California to discuss the state of the research field and its impacts on practice. This was the seventh meeting of this sort in twenty years, and was distinguished by a broad consensus that we are at a turning point in the history of the field, due both to an explosion of data and usage scenarios, and to major shifts in computing hardware and platforms. Given these forces, we are at a time of opportunity for research impact, with an unusually large potential for influential results across computing, the sciences and society. This report details that discussion, and highlights the group&#8217;s consensus view of new focus areas, including new database engine architectures, declarative programming languages, the interplay of structured and unstructured data, cloud data services, and mobile and virtual worlds.&#8221;  </p></blockquote>
<p> On the site you can read the post-meeting <a href="http://db.cs.berkeley.edu/claremont/claremontreport08.pdf">report</a>, view the participants <a href="">presentations on DB research directions</a> and talks and <a href="http://groups.google.com/group/claremontreport">discuss</a> the report on a Google group.</p>
<p>It&#8217;s a good report with lots of interesting things in it and definitely worth reading, but I was disappointed to find that it makes <b>no mention</b> of the Semantic Web, RDF, OWL, ontologies, AI, knowledge bases, or reasoning.  Here&#8217;s a word cloud (generated with <a href="http://wordle.net/">wordle</a>) generated from the report, which provides a 10,000 foot view of it&#8217;s content.</p>
<p><center><br />
<a href='http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/08/claremontwordcloud1.png'><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/08/claremontwordcloud1.png" alt="word cloud generated from The Claremont Database Research Self-Assessment Meeting report" title="claremont-word-cloud" width="450" height="294" /></a><br />
</center></p>
<p>The reports says that it was &#8220;surprisingly easy for the group to reach consensus on a set of research topics to highlight for investigation in coming years&#8221;.  Those topics are:</p>
<ul>
<li>Revisiting Database Engines</li>
<li>Declarative Programming for Emerging Platforms</li>
<li>The Interplay of Structured and Unstructured Data</li>
<li>Cloud Data Services</li>
<li>Mobile Applications and Virtual Worlds</li>
</ul>
<p>There is clearly overlap between the database and semantic web communities in the first three topics.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/08/25/database-researchers-assess-hot-research-topics/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Hypertable 0.9 alpha</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/#comments</comments>
		<pubDate>Sat, 09 Feb 2008 02:36:52 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/</guid>
		<description><![CDATA[Hypertable 0.9 alpha is out.

&#8220;Hypertable is a high performance distributed data storage system designed to support applications requiring maximum performance, scalability, and reliability.  Hypertable will be particularly invaluable to any organization that needs to manage rapidly evolving data to support demanding real-time applications. Modeled after Google&#8217;s well known Bigtable project, Hypertable is designed to [...]]]></description>
			<content:encoded><![CDATA[<p><img src='http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/02/gra-tesseract.gif' alt='hypertable' align="right" /><a href="http://hypertable.org/">Hypertable</a> 0.9 alpha is out.</p>
<blockquote><p>
&#8220;Hypertable is a high performance distributed data storage system designed to support applications requiring maximum performance, scalability, and reliability.  Hypertable will be particularly invaluable to any organization that needs to manage rapidly evolving data to support demanding real-time applications. Modeled after Google&#8217;s well known <a href="http://en.wikipedia.org/wiki/BigTable">Bigtable</a> project, Hypertable is designed to manage the storage and processing of information on a large cluster of commodity servers, providing resilience to machine and component failures. Hypertable seeks to set the open source standard for highly available, petabyte scale, database systems. &#8221; (<a href="http://hypertable.org/about.html">link</a>)
</p></blockquote>
<p><strong>Update:</strong> LinuxWorld has an article, <a href="http://www.linuxworld.com/news/2008/020608-hypertable.html">Zvents releases open-source cluster database</a>, on the release along with a <a href="http://www.linuxworld.com/podcasts/linux/2008/013008-linuxcast.html">podcast</a> with Doug Judd, principal search architect for Zvents. </p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/02/08/hypertable-09-alpha/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How YouTube scales MySQL for its large databases</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/#comments</comments>
		<pubDate>Fri, 28 Dec 2007 15:04:29 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Database]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/</guid>
		<description><![CDATA[Like most research labs, we rely on MySQL whenever we need a database. And like most (I&#8217;m guessing, here), it&#8217;s common to overhear something like the following in our lab &#8212; &#8220;We really need to replace MySQL with Oracle or DB2 in X so it can handle the load.&#8221; But we never get around to [...]]]></description>
			<content:encoded><![CDATA[<p>Like most research labs, we rely on MySQL whenever we need a database. And like most (I&#8217;m guessing, here), it&#8217;s common to overhear something like the following in our lab &#8212; &#8220;We really need to replace MySQL with Oracle or DB2 in X so it can handle the load.&#8221; But we never get around to it.</p>
<p>Maybe we don&#8217;t have to.  Check out <a href="http://itc.conversationsnetwork.org/shows/detail3299.html">Scaling MySQL at YouTube</a>, a keynote talk by YouTube DBA Paul Tuckfield at the <a href="http://conferences.oreillynet.com/mysqluc2007/">2007 MySQL Conference</a> put online by Conversationnetwork.org.</p>
<blockquote><p>
&#8220;In mid 2006, YouTube served approximately 100 million videos in a single day. To maintain a website of that scale, one would imagine YouTube has hundreds of DBAs. But in fact, there are just three people that make it all work. Paul Tuckfield, the MySQL DBA at YouTube shares horror stories about scalability at YouTube and how he coped with them to keep the show going everyday, while learning important lessons along the way. &#8230;  According to him, the three important reasons for YouTube&#8217;s scalability are Python, <a href="http://en.wikipedia.org/wiki/Memcached">Memcache</a> and <a href="http://dev.mysql.com/doc/refman/5.0/en/replication.html">MySQL replication</a>, the last having the most impact. Most people think that the answer to scalability is in upgrading hardware and CPU power. Adding CPUs doesn&#8217;t work on its own; wisdom is in getting the maximum amount of RAM for the CPU and then fine tuning.&#8221; (<a href="http://itc.conversationsnetwork.org/shows/detail3299.html">src</a>)
</p></blockquote>
<p><center></p>
<div class="audio-player-container player"><script type="text/javascript"> function FlashRequest(command, args) {}</script></p>
<div class="audio-player-placeholder"><object width="400" height="27" align="middle" codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=7,0,0,0" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"><param value="best" name="quality"/><param value="never" name="allowScriptAccess"/><param value="window" name="wmode"/><param value="http://www.google.com/reader/ui/3247397568-audio-player.swf?audioUrl=http%3A%2F%2Ffeeds.conversationsnetwork.org%2F~r%2Fgigavox%2Fchannel%2Fitconversations%2F~5%2F207402349%2FITC.mySQL-PaulTuckfield-2007.04.26.mp3" name="movie"/><embed width="400" height="27" pluginspage="http://www.macromedia.com/go/getflashplayer" flashvars="playerMode=embedded" wmode="window" bgcolor="#ffffff" quality="best" allowscriptaccess="never" src="http://www.google.com/reader/ui/3247397568-audio-player.swf?audioUrl=http%3A%2F%2Ffeeds.conversationsnetwork.org%2F~r%2Fgigavox%2Fchannel%2Fitconversations%2F~5%2F207402349%2FITC.mySQL-PaulTuckfield-2007.04.26.mp3" type="application/x-shockwave-flash" classname="audio-player-embed"/></object></div>
</div>
<p></center></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/12/28/how-youtube-scales-mysql-for-its-large-databases/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>
