<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>UMBC ebiquity &#187; NLP</title>
	<atom:link href="http://ebiquity.umbc.edu/blogger/category/ai/nlp/feed/" rel="self" type="application/rss+xml" />
	<link>http://ebiquity.umbc.edu/blogger</link>
	<description>EBB is the ebiquity research group\\\'s blog at the University of Maryland, Baltimore County (UMBC).  We focus on technologies that facilitate the design, implementation and control of distributed, intelligent information systems -- mobile and pervasive computing, ad hoc networking, multiagent systems, knowledge representation and reasoning, and the semantic web.  As the tides of technology ebb and flow, we hope the good ideas wash up on our beach and the bad ones drift back out to sea.</description>
	<lastBuildDate>Fri, 20 Nov 2009 13:50:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Google VP on semantic search and the Semantic Web</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/11/11/google-vp-on-semantic-search-and-the-semantic-web/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/11/11/google-vp-on-semantic-search-and-the-semantic-web/#comments</comments>
		<pubDate>Wed, 11 Nov 2009 14:00:23 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[sEARCH]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=2660</guid>
		<description><![CDATA[PCWorld has a story, Google VP Mayer Describes the Perfect Search Engine, with some interesting comments on semantic search from Marissa Mayer, Google&#8217;s vice president of Search Products &#038; User Experience.

&#8220;IDGNS: What&#8217;s the status of semantic search at Google? You have said in the past that through &#8220;brute force&#8221; &#8212; analyzing massive amounts of queries [...]]]></description>
			<content:encoded><![CDATA[<p>PCWorld has a story, <a href="http://www.pcworld.com/businesscenter/article/181874/google_vp_mayer_describes_the_perfect_search_engine.html">Google VP Mayer Describes the Perfect Search Engine</a>, with some interesting comments on <i>semantic search</i> from Marissa Mayer, Google&#8217;s vice president of Search Products &#038; User Experience.</p>
<blockquote><p>
&#8220;IDGNS: What&#8217;s the status of semantic search at Google? You have said in the past that through &#8220;brute force&#8221; &#8212; analyzing massive amounts of queries and Web content &#8212; Google&#8217;s engine can deliver results that make it seem as if it understood things semantically, when it really functions using other algorithmic approaches. Is that still the preferred approach?</p>
<p>Mayer: We believe in building intelligent systems that learn off of data in an automated way, [and then] tuning and refining them. When people talk about semantic search and the semantic Web, they usually mean something that is very manual, with maps of various associations between words and things like that. We think you can get to a much better level of understanding through pattern-matching data, building large-scale systems. That&#8217;s how the brain works. That&#8217;s why you have all these fuzzy connections, because the brain is constantly processing lots and lots of data all the time.</p>
<p>IDGNS: A couple of years ago or so, some experts were predicting that semantic technology would revolutionize search and blindside Google, but that hasn&#8217;t happened. It seems that semantic search efforts have hit a wall, especially because semantic engines are hard to scale.</p>
<p>Mayer: The problem is that language changes. Web pages change. How people express themselves changes. And all those things matter in terms of how well semantic search applies. That&#8217;s why it&#8217;s better to have an approach that&#8217;s based on machine learning and that changes, iterates and responds to the data. That&#8217;s a more robust approach. That&#8217;s not to say that semantic search has no part in search. It&#8217;s just that for us, we really prefer to focus on things that can scale. If we could come up with a semantic search solution that could scale, we would be very excited about that. For now, what we&#8217;re seeing is that a lot of our methods approximate the intelligence of semantic search but do it through other means.&#8221;  </p></blockquote>
<p>I interpret these comments to mean that Google&#8217;s management still views the concept of semantic search (and the Semantic Web) as involving better understanding of the intended meaning of text in documents and queries.  The W3C&#8217;s <i>web of data</i> model is still not on their radar.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/11/11/google-vp-on-semantic-search-and-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>WolframAlpha releases API</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/10/16/wolframalpha-releases-api/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/10/16/wolframalpha-releases-api/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 05:16:04 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[KR]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=2589</guid>
		<description><![CDATA[Wolfram&#124;Alpha is an interesting query answering system developed by Wolfram Research that is a blend of a question answering system and a Semantic Web alternative.  It tries to interpret and answer queries expressed as a sequence of words from a large collection of interlinked tables.  Oh, and Mathematica is in thrown in for [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Wolfram_Alpha">Wolfram|Alpha</a> is an interesting query answering system developed by Wolfram Research that is a blend of a <a href="http://en.wikipedia.org/wiki/Question_answering">question answering</a> system and a Semantic Web alternative.  It tries to interpret and answer queries expressed as a sequence of words from a large collection of interlinked tables.  Oh, and <a href="http://en.wikipedia.org/wiki/Mathematica">Mathematica</a> is in thrown in for free.  A free Web version was released last Spring.</p>
<p>The news today is that Wolfram|Alpha has released an API, as noted in their <a href="http://blog.wolframalpha.com/2009/10/15/the-wolframalpha-api-has-arrived/">blog</a>:</p>
<blockquote><p> &#8220;The API allows your application to interact with Wolfram|Alpha much like you do on the web—you send a web request with the same query string you would type into Wolfram|Alpha’s query box and you get back the same computed results. It’s just that both are in a form your application can understand. There are plenty of ways to tweak and control the results, as well.&#8221;  </p></blockquote>
<p>The <a href="http://products.wolframalpha.com/api/pricing.html">pricing plan</a> runs from $60/month for 1000 (6 cents a query) queries to $220K for up to 10M queries/month (2.2 cents a query).  programming <a href="http://products.wolframalpha.com/api/languagebindings.html">language bindings</a> are available for Java, PHP, Perl, Python, Ruby and .NET.</p>
<p>Their original web interface remains free, but the TOS specifies that it <o>&#8220;may be used only by a human being using a conventional web browser to manually enter queries one at a time.&#8221;<br />
</o></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/10/16/wolframalpha-releases-api/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>HealthBase semantic search is very positive about the Semantic Web</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/09/03/healthbase-semantic-search-is-very-positive-about-the-semantic-web/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/09/03/healthbase-semantic-search-is-very-positive-about-the-semantic-web/#comments</comments>
		<pubDate>Fri, 04 Sep 2009 04:26:59 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[NLP]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[sEARCH]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=2413</guid>
		<description><![CDATA[HealthBase is a &#8217;semantic search engine&#8217; for healthcare information that is driven by content mined from &#8220;millions of authoritative health sources&#8221; including WebMD, Wikipedia, PubMed, and Mayo Clinic’s health site.  Techcrunch first described it as the ultimate medical content search engine but then had a follow up article reporting that HealthBase thinks you can [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://healthbase.netbase.com/">HealthBase</a> is a &#8217;semantic search engine&#8217; for healthcare information that is driven by content mined from &#8220;millions of authoritative health sources&#8221; including WebMD, Wikipedia, PubMed, and Mayo Clinic’s health site.  Techcrunch first described it as the <a href="http://www.techcrunch.com/2009/09/02/healthbase-is-the-ultimate-medical-content-search-engine/">ultimate medical content search engine</a> but then had a follow up article reporting that HealthBase <a href="http://www.techcrunch.com/2009/09/02/netbase-thinks-you-can-get-rid-of-jews-with-alcohol-and-salt/">thinks you can get rid of jews with alcohol and salt</a>.  Language Log had some more <a href="http://languagelog.ldc.upenn.edu/nll/?p=1715">fun</a> exploring HealthBase.</p>
<p>We thought we&#8217;d see what HealthBase thought of the Semantic Web and it turns out that if you are experiencing the Semantic Web as a <i>condition</i> there are several recommended <a href="http://healthbase.netbase.com/#Semantic%20Web&#038;Treatments">treatments</a>.</p>
<p><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/09/Picture-4.png" alt="healthbase1" title="Treatments for the Semantic Web" width="327" height="256" /></p>
<p>and as a <i>treatment</i> itself, HealthBase is pretty <a href="http://healthbase.netbase.com/#Semantic%20Web&#038;Pros">positive</a>.</p>
<p><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/09/Picture-5.png" alt="healthbase2" title="Pros and cons of the Semantic Web according to HealthBase" width="260" height="338"  /></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/09/03/healthbase-semantic-search-is-very-positive-about-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>AAAI study examines long-term AI futures and impact on society</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/07/25/aaai-study-examines-long-term-ai-futures-and-impact-on-society/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/07/25/aaai-study-examines-long-term-ai-futures-and-impact-on-society/#comments</comments>
		<pubDate>Sat, 25 Jul 2009 20:10:48 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[aaai]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=2234</guid>
		<description><![CDATA[John Markoff has an article for tomorrow&#8217;s New York Times, Scientists Worry Machines May Outsmart Man on a recent AAAI study on the future of AI.
 &#8220;A robot that can open doors and find electrical outlets to recharge itself. Computer viruses that no one can stop. Predator drones, which, though still controlled remotely by humans, [...]]]></description>
			<content:encoded><![CDATA[<p>John Markoff has an article for tomorrow&#8217;s New York Times, <a href="http://www.nytimes.com/2009/07/26/science/26robot.html">Scientists Worry Machines May Outsmart Man</a> on a recent AAAI study on the future of AI.</p>
<blockquote><p> &#8220;A robot that can open doors and find electrical outlets to recharge itself. Computer viruses that no one can stop. Predator drones, which, though still controlled remotely by humans, come close to a machine that can kill autonomously.  Impressed and alarmed by advances in artificial intelligence, a group of computer scientists is debating whether there should be limits on research that might lead to loss of human control over computer-based systems that carry a growing share of society’s workload, from waging war to chatting with customers on the phone.&#8221;  </p></blockquote>
<p>The <a href="http://research.microsoft.com/en-us/um/people/horvitz/AAAI_Presidential_Panel_2008-2009.htm">study</a> was commissioned by AAAI to &#8220;to explore and address potential long-term societal influences of AI research and development&#8221;.  Look for a report published by AAAI later this year.  The study involved twenty-five participants who were divided into three subgroups: on concerns, control and guidelines, the nature and timing of disruptive advances, and ethical and legal issues.</p>
<p>There was a <a href="http://ijcai-09.org/invited_panel.html">panel session</a> earlier this month at IJCAI where some of the study participants discussed highlights from the study.  Hopefully this was filmed and the results will be added to the <a href="http://videolectures.net/ijcai09_pasadena/">videolectures.net IJCAI09 collection</a>.</p>
<p>While I am generally skeptical of an impending <a href="http://en.wikipedia.org/wiki/Technological_singularity">technological singularity</a>, which seems to sum up many of the concerns some have, there are aspects of the future that I do wonder about.  At the top of my list is what will happen when virtually all of human knowledge is published on the Web (as it nearly is now) in a for that machines can understand.  I&#8217;m pretty sure that this will happen in the next decade or two, either through the current <a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web</a> approach (as a web of data) or by gradually improving techniques for machine understanding of human languages and images.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/07/25/aaai-study-examines-long-term-ai-futures-and-impact-on-society/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Nano-content: 1st 2 words</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/04/06/nano-content-1st-2-words/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/04/06/nano-content-1st-2-words/#comments</comments>
		<pubDate>Mon, 06 Apr 2009 18:46:21 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[NLP]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1819</guid>
		<description><![CDATA[Not only do you have to choose title of your papers, posts and web pages well, their first two words should be chosen to carry the message. Jakob Nielsen reports on UI research showing that the first 11 characters of links and headlines are important in forming some idea of what the item is about.

First [...]]]></description>
			<content:encoded><![CDATA[<p>Not only do you have to choose title of your papers, posts and web pages well, their first two words should be chosen to carry the message. <a href="http://en.wikipedia.org/wiki/Jakob_Nielsen_(usability_consultant)">Jakob Nielsen</a> reports on UI research showing that the first 11 characters of links and headlines are important in forming some idea of what the item is about.</p>
<blockquote><p>
<a href="http://www.useit.com/alertbox/nanocontent.html">First 2 Words: A Signal for the Scanning Eye</a><br />
&#8220;Our newest usability study &#8230; tests how well users understand the first 11 characters of a website&#8217;s links and headlines. For example, we&#8217;d represent this article by the &#8220;First 2 Wor&#8221; string. &#8230; Why test text that&#8217;s so severely truncated? Because online reading is often dominated by the F-pattern. That is, people read the first few listed items somewhat thoroughly — thus the cross-bars of the &#8220;F&#8221; — but read less and less as they continue down the list, eventually passing their eyes down the text&#8217;s left side in a fairly straight line. At this point, users see only the very beginning of the items in a list. &#8230;&#8221;
</p></blockquote>
<p>Nielsen calls the initial few words in a title &#8220;nano-content&#8221;.  While it&#8217;s hard to pack some ideas into 11 characters, it sounds like a good goal.</p>
<p>Choosing the words for a link or title carefully is a key to influencing search engines &#8212; these words are given higher weight when indexing the associated content.  But search engines don&#8217;t scan like humans, so putting the most relevant early in the string helps when a person is shown a list of results.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/04/06/nano-content-1st-2-words/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wolfram Alpha: an alternative to Google, the Semantic Web and Cyc?</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/03/11/wolfram-alpha-google-cyc-semantic-web/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/03/11/wolfram-alpha-google-cyc-semantic-web/#comments</comments>
		<pubDate>Wed, 11 Mar 2009 13:46:20 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[Datamining]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1804</guid>
		<description><![CDATA[There&#8217;s been a lot of interest in Wolfram Alpha in the past week, starting with a blog post from Steve Wolfram, Wolfram&#124;Alpha Is Coming!, in which he described his approach to building a system that integrates vast amounts of knowledge and then tries to answer free form questions posed to it by people.  His [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s been a lot of interest in <a href="http://www.wolframalpha.com/">Wolfram Alpha</a> in the past week, starting with a blog post from Steve Wolfram, <a href="http://blog.wolfram.com/2009/03/05/wolframalpha-is-coming/">Wolfram|Alpha Is Coming!</a>, in which he described his approach to building a system that integrates vast amounts of knowledge and then tries to answer free form questions posed to it by people.  His post lays out his approach, which does not involve extracting data from online text.</p>
<blockquote><p>
&#8220;A lot of it is now on the web—in billions of pages of text. And with search engines, we can very efficiently search for specific terms and phrases in that text.  But we can’t compute from that. And in effect, we can only answer questions that have been literally asked before. We can look things up, but we can’t figure anything new out.</p>
<p>So how can we deal with that? Well, some people have thought the way forward must be to somehow automatically understand the natural language that exists on the web. Perhaps getting the web semantically tagged to make that easier.</p>
<p>But armed with <a href="http://www.wolfram.com/products/mathematica/">Mathematica</a> and <a href="http://www.wolframscience.com/">NKS</a> I realized there’s another way: explicitly implement methods and models, as algorithms, and explicitly curate all data so that it is immediately computable.&#8221;
</p></blockquote>
<p>Nova Spivack took a look at Wolfram Alpha last week and thought that it could be <a href="http://www.twine.com/item/122mz8lz9-4c/wolfram-alpha-is-coming-and-it-could-be-as-important-as-google">&#8220;as important as Google&#8221;</a>.</p>
<blockquote><p>
In a nutshell, Wolfram and his team have built what he calls a &#8220;computational knowledge engine&#8221; for the Web. OK, so what does that really mean? Basically it means that you can ask it factual questions and it computes answers for you.</p>
<p>It doesn&#8217;t simply return documents that (might) contain the answers, like Google does, and it isn&#8217;t just a giant database of knowledge, like the Wikipedia. It doesn&#8217;t simply parse natural language and then use that to retrieve documents, like Powerset, for example.</p>
<p>Instead, Wolfram Alpha actually computes the answers to a wide range of questions &#8212; like questions that have factual answers such as &#8220;What is the location of Timbuktu?&#8221; or &#8220;How many protons are in a hydrogen atom?,&#8221; &#8220;What was the average rainfall in Boston last year?,&#8221; &#8220;What is the 307th digit of Pi?,&#8221; &#8220;where is the ISS?&#8221; or &#8220;When was GOOG worth more than $300?&#8221;
</p></blockquote>
<p>Doug Lenat, also had a chance to <a href="http://www.semanticuniverse.com/blogs-i-was-positively-impressed-wolfram-alpha.html">preview Wolfram Alpha</a> and came away impressed:</p>
<blockquote><p>
&#8220;Stephen Wolfram generously gave me a two-hour demo of Wolfram Alpha last evening, and I was quite positively impressed.  As he said, it&#8217;s not AI, and not aiming to be, so it shouldn&#8217;t be measured by contrasting it with HAL or Cyc but with Google or Yahoo.&#8221;
</p></blockquote>
<p>Doug&#8217;s review does a good job of sketching the differences he ses between Wolfram Alpha and systems like Google and Cyc.</p>
<p>Lenat&#8217;s description makes Wolfram Alpha sound like a variation on the Semantic Web vision, but one that more like a giant closed database than a distributed Web of data.  The system is set to launch in May 2009 and I&#8217;m anxious to give it a try.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/03/11/wolfram-alpha-google-cyc-semantic-web/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Twitter-Calais mashup tracks IL-5 election buzz</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/02/24/twitter-calais-mashup-tracks-il-5-election-buzz/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/02/24/twitter-calais-mashup-tracks-il-5-election-buzz/#comments</comments>
		<pubDate>Tue, 24 Feb 2009 21:02:42 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[Mobile Computing]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social media]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1791</guid>
		<description><![CDATA[WindyCitizen.com is &#8220;a crowd-powered front page for the Windy City&#8221; that &#8220;brings Chicagoans the best of the local web by letting them share, rate and discuss their favorite local news, photos, videos and more.&#8221;



Their Windy City Twitter Tracker mashup uses Open Calais as a named entity recognizer to track Tweets about candidates in the special [...]]]></description>
			<content:encoded><![CDATA[<p>WindyCitizen.com is &#8220;a crowd-powered front page for the Windy City&#8221; that &#8220;brings Chicagoans the best of the local web by letting them share, rate and discuss their favorite local news, photos, videos and more.&#8221;</p>
<p><center><br />
<a href="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/02/picture-11.png"><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/02/picture-11-300x113.png" title="Twitter-Calais mashup tracks IL-5 election buzz" /></a><br />
</center><br />
Their <a href="http://election.windycitizen.com/candidates/all">Windy City Twitter Tracker</a> mashup uses <a href="http://opencalais.com/">Open Calais</a> as a named entity recognizer to track Tweets about candidates in the special election to fill the US House seat for Chicago&#8217;s 5th district that that Rahm Emanuel vacated.  Calais might be overkill for this, since there is a small set of known candidates, but it&#8217;s an impressive semantic mashup nonetheless.</p>
<blockquote><p> &#8220;We&#8217;re searching Twitter constantly to keep you up to date with the conversation about the IL-5 special election. The graph above lets you track buzz about the candidates over the last two weeks.&#8221;  </p></blockquote>
<p>The <a href="http://election.windycitizen.com/candidates/all">Windy City Twitter Tracker</a> is probably written to be easily repurposed, judging from the Web site, which describe it as currently tracking the &#8220;Race for the 5th&#8221;.  The mashup is credited to <a href="http://whattechnology.com/">Whattech</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/02/24/twitter-calais-mashup-tracks-il-5-election-buzz/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Martin Kay: When is a Translation not a Translation, 4:30 Tue 2/3, JHU</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/30/martin-kay-when-is-a-translation-not-a-translation-430-tue-23-jhu/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/30/martin-kay-when-is-a-translation-not-a-translation-430-tue-23-jhu/#comments</comments>
		<pubDate>Fri, 30 Jan 2009 16:08:44 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[NLP]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1749</guid>
		<description><![CDATA[Next week the JHU Center for Language and Speech Processing will host a talk by Martin Kay of  Stanford University, When is a Translation not a Translation? at 4:30pm Tuesday, 3 February 2009.  From the announcement:
 &#8220;A translation is generally taken to be a text that expresses the same meaning as another text [...]]]></description>
			<content:encoded><![CDATA[<p>Next week the JHU Center for Language and Speech Processing will host a talk by <a href="http://www.stanford.edu/~mjkay/">Martin Kay</a> of  Stanford University, <a href="http://www.clsp.jhu.edu/news-events/abstract.php?sid=20090203">When is a Translation not a Translation?</a> at 4:30pm Tuesday, 3 February 2009.  From the announcement:</p>
<blockquote><p> &#8220;A translation is generally taken to be a text that expresses the same meaning as another text in a different language. But the products of the best translators reflects a different, if more illusive, goal. I will seek a somewhat more adequate characterization of translation as it is actually practiced and discuss its consequences for machine translation.</p>
<p>Martin Kay is a professor of linguistics and computer science at Stanford University. For many years, he was also a research fellow at the Xerox Palo Alto Research Center. He made a number of fundamental contributions to computational linguistics, including chart parsing, unification grammar, and applications of finite-state technology, notably in phonology. He has been an intermittent worker on, and skeptical observer of, machine translation since 1958.&#8221;</p></blockquote>
<p>For a preview of what he will probably talk about, you might look at a paper on Professor Kay&#8217;s web site that he describes as &#8220;<a href="http://www.stanford.edu/~mjkay/CurrentState.pdf">some unfinished musings on the nature of translation</a>&#8220;.</p>
<p>This a chance to hear someone who has made many important contributions to several areas of computational linguistics and computer science over a long career.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/30/martin-kay-when-is-a-translation-not-a-translation-430-tue-23-jhu/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Extracting Wikipedia infobox values from text</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/27/extracting-wikipedia-infoboxes-values-from-text/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/27/extracting-wikipedia-infoboxes-values-from-text/#comments</comments>
		<pubDate>Tue, 27 Jan 2009 20:39:28 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[NLP]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social media]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1747</guid>
		<description><![CDATA[ This year&#8217;s Text Analysis Conference (TAC) has an interesting track focused on processing text to populate Wikipedia infoboxes, both for existing entities with missing values as well as newly discovered entities.  
TAC has been run by the US National Institute of Standards and Technology (NIST) to to encourage research in natural language processing [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/tac-logo002-150x150.jpg" alt="Text Analysis Conference" title="tac-logo002" width="150" height="150" align="right" /> This year&#8217;s <a href="http://www.nist.gov/tac/">Text Analysis Conference</a> (TAC) has an interesting track focused on processing text to <a href="http://apl.jhu.edu/~paulmac/kbp.html">populate Wikipedia infoboxes</a>, both for existing entities with missing values as well as newly discovered entities.  </p>
<p>TAC has been run by the US National Institute of Standards and Technology (NIST) to to encourage research in natural language processing and related applications.  As in the NIST sponsored MUC, TREC and ACE workshops, this is done by by providing a large test collection, common evaluation procedures, and a forum for organizations to share their results.  The first TAC was held this year and included 65 teams from 20 countries who participated in three tracks: question answering, summarization and recognizing textual entailments.</p>
<p>TAC 2009 will include a new track on <a href="http://apl.jhu.edu/~paulmac/kbp.html">Knowledge Base Population</a> coordinated by <a href="mailto:paul.mcnamee@jhuapl.edu">Paul McNamee</a> of the Johns Hopkins University <a href="http://web.jhu.edu/hltcoe">Human Language Technology Center of Excellence</a>.</p>
<blockquote><p> &#8220;The goal of the new Knowledge Base Population track is to augment an existing knowledge representation with information about entities that is discovered from a collection of documents. A snapshot of Wikipedia infoboxes will be used as the original knowledge source, and participants will be expected to fill in empty slots for entities that do exist, add missing entities and their learnable attributes, and provide links between entities and references to text supporting extracted information. The KBP task lies at the intersection of Question Answering and Information Extraction and is expected to be of particular interest to groups that have participated in ACE or TREC QA.&#8221;</p></blockquote>
<p>This is an exciting task and doing well in it will require a a mixture of language processing, knowledge-based processing and (probably) machine learning.</p>
<p>The TAC 2009 workshop will be co-located with TREC and held 16-17 November in Gaithersburg, MD.  If you are interested in participating, you should register by March 3.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/27/extracting-wikipedia-infoboxes-values-from-text/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>N. American Computational Linguistics Olympiad at UMBC</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/07/n-american-computational-linguistics-olympiad-at-umbc/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/07/n-american-computational-linguistics-olympiad-at-umbc/#comments</comments>
		<pubDate>Wed, 07 Jan 2009 22:14:43 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[UMBC]]></category>
		<category><![CDATA[computational linguistics]]></category>
		<category><![CDATA[naclo]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1729</guid>
		<description><![CDATA[If you are a high school or middle school student who is interested in
computers and also in languages, you should consider participating in the 2009 North American Computational Linguistics Olympiad (NACLO).  This might be the first step on a path that could lead to your helping to create the next Google!
NACLO is a competition [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/naclo-poster.jpg"><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/naclo-poster-150x150.jpg" align="right"/></a>If you are a high school or middle school student who is interested in<br />
computers and also in languages, you should consider participating in the <a href="http://www.naclo.cs.cmu.edu/">2009 North American Computational Linguistics Olympiad</a> (NACLO).  This might be the first step on a path that could lead to your helping to create the next Google!</p>
<p><a href="http://en.wikipedia.org/wiki/North_American_Computational_Linguistics_Olympiad">NACLO</a> is a competition for middle-school and high-school students focused on solving problems involving linguistics and computational linguistics.  WOrking the problems only requires keen analytical ability and good problem-solving skills &#8212; no prior background in linguistics, foreign languages or computer science is required.</p>
<p>NACLO consists of two rounds &#8212; an initial round on February 4 open to all students and a subsequent invitational round on March 11 for contestants who have advanced from the first. Winners of the second round will be invited to participate in  the <a href="http://en.wikipedia.org/wiki/International_Linguistics_Olympiad">International Linguistics Olympiad</a>.  Last year, two US teams went to Bulgaria to compete in the sixth International Linguistics Olympiad and <a href="http://www.nsf.gov/news/news_summ.jsp?cntn_id=112073">gold medals</a> in  individual and team events.</p>
<p>Support for NACLO is provided by Google, the <a href="http://www.aclweb.org/">Associaton for Computational Linguistics</a>, and the National Science Foundation, which said in an August press release :</p>
<blockquote><p>
&#8220;Aside from being a fun intellectual challenge, the Olympiad mimics the skills used by researchers and scholars in the field of computational linguistics, which is increasingly important for the United States and other countries. Using computational linguistics, these experts can develop automated technologies such as translation software that cut down on the time and training needed to work with other languages, or software that automatically produces informative English summaries of documents in other languages or answer questions about information in these documents. In an increasingly global economy where businesses operate across borders and languages, having a strong pool of computational linguists is a competitive advantage. With threats emerging from different parts of the world, developing computational linguistics skills has also been identified as vital to national defense in the 21st century.&#8221; (<a href="http://www.nsf.gov/news/news_summ.jsp?cntn_id=112073">src</a>)
</p></blockquote>
<p>Students can participate at the NACLO site at UMBC, which is sponsored by the <a href="http://ilit.umbc.edu/">UMBC Institute for Language in Information Technology</a>.  Check out their <a href="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/naclo-poster.jpg">poster</a> and <a href="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/sample-problem.pdf">sample problem</a> If you like this kind of puzzle and <a href="http://www.naclo.cs.cmu.edu/practice.php">others like it</a>, sign up to be part this exciting competition.</p>
<p>Students should <a href="http://www.naclo.cs.cmu.edu/reg_student.php">register online</a> by January 20. Late registrations may be accepted up to February 3 if space is available. The UMBC NACLO event will take place on Wednesday February 4 in room 312 of the University Center.  For more information, contact one of the local organizers: Professors <a href="http://ilit.umbc.edu/PeoplePage.htm">Marjorie McShane</a> (marge@umbc.edu), <a href="http://ilit.umbc.edu/PeoplePage.htm">Sergei Nirenburg</a> (sergei@umbc.edu) and Margaret A. Russell (margaret.a.russell@gmail.com).</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/07/n-american-computational-linguistics-olympiad-at-umbc/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
