 | 2007 August 
Archive for August, 2007
August 13th, 2007, by Tim Finin, posted in Uncategorized
Sigh. The Semantic Web is for sale. Literally. Sort of.
Subject: semanticweb.com and semantic-web.com on sale Date: Mon, 13 Aug 2007 15:20:25 -0000 From: flashjack_jumping <flashjack_jumping @yahoo.de> To: semanticweb@yahoogroups.com
*semanticweb.com *and *semantic-web.com* on sale
please submit offers to <info at semanticweb.com>
they should roughly start at 10k USD
pierre __._,_.___
We know only too well how this scheme works, as we’ve been burned several times ourselves. You register a nifty domain and for a pittance. Then, a year or two later you are warned that your domain is about to expire. Several times. Maybe you tell yourself that you’ve got to remember to take care of that (repeatedly) or perhaps the messages end up in your spam bucket (that’s our excuse and we’re sticking to it). Or maybe you are busy actually trying to create the Semantic Web.
In the mean time, useless, low-life Internet bottom-feeders are running scripts that identify domains with good pagerank and/or hot names that are due to expire soon. They use any of several services or scripts to try to grab the registration the moment it expires (for a pittance) and then try to sell it back to the original owner or a competitor or even some spammer who would like to have a page with a good initial pagerank.
Doing this is not illegal and it’s probably part of the brilliance that is capitalism. But it is very damaging to one’s Karma. And I pity the poor immigrant who’s strength is spent in vain.
I hope that the bottom dweller who did this doesn’t get his 10K USD for this.
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 10th, 2007, by Tim Finin, posted in Uncategorized
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 10th, 2007, by Tim Finin, posted in Uncategorized
The LSDIS lab at the University of Georgia has released a new version of the SwetoDblp dataset. This has about 11M triples that capture the data in DBLP enriched with other datasets adding relationships to other entities including publishers, companies and universities. Rather than being a simple mapping of DBLP’s flat XML rendering, it’s based on a good ontology with lots of classes and individuals.
B. Aleman-Meza, F. Hakimpour, I.B. Arpinar, A.P. Sheth: SwetoDblp Ontology of Computer Science Publications, Web Semantics: Science, Services and Agents on the World Wide Web, 2007 (in Press)
It’s a great resource that we have used in collaboration with LSDIS with support from a joint NSF ITR project (ITR-IIS-0325464, ITR-IIS-0325172).
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 9th, 2007, by Tim Finin, posted in Uncategorized
The DARPA Trauma Pod project was featured at the 2007 DARPATech conference. We worked on a small part of the project, which was led by SRI and involved a large team of contractors and universities. The popular Mechanics article described this way.
“A single human will operate the robot remotely during surgery, but Trauma Pod will be able to perform a number of functions, such as fluid administration and surgical assistance, autonomously. The goal is to stabilize injured soldiers as quickly as possible, and previous Trauma Pod designs have included related systems that evacuate the patient. Giroir said that a prototype will be delivered to troops within two years.”
I found the Trauma Pod system to be a very impressive distributed system that comprised multiple autonomous robotic systems, multiple sensing systems, speech understanding, plan recognition and much more.
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 9th, 2007, by Tim Finin, posted in Uncategorized
Google’s splog classifier came up with an embarrassing false positive this week. It mistakenly thought that its own Google Custom Search Blog on blogspot.com was a splog and disabled it.
Robert McMillan of IDG News Service explained what happened next.
“Blogger’s spam classifier misidentified the Custom Search Blog as spam,” he said via e-mail on Wednesday. Typically Google notifies blog owners when it has spotted content associated with spam on their Web sites to give them a chance to clear up any misunderstandings. However, that didn’t work out in this case. “The Custom Search Blog bloggers overlooked their notification, and after a period of time passed, the blog was disabled.” When blogs are disabled like this, their URL becomes available to the general public. That’s when Srikanth swooped in and wrote the joke post. “It was a case of “URL squatting” and not a security issue or any kind of hack,” Carlson said. Google quickly realized its mistake and the Custom Search Blog is now back in action.
This was first noticed by Blogscoped, which has saved an image of the hack for posterity.
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 8th, 2007, by Tim Finin, posted in Uncategorized
Technology Review reports on Money Trouble in Second Life.
“There’s a long line of avatars waiting to use the automatic-teller machines for Ginko Financial, a virtual bank in the online game Second Life. For more than a week, account holders have been demanding their money back in what some folks are calling a bank run. Set off by high interest rates and a recent ban on in-game gambling, the bank run could ultimately have a major effect on the game’s economy. The theft of approximately $12,000 from the Second Life World Stock Exchange doesn’t help matters either.”
Unlike many MMOGs, Second Life has an active economy based on the Linden Dollar (L$) and encourages users to buy and sell game goods and real estate to one another. While the currency is fictional, TR reports
“But those dollars do have real-world value: players can buy or sell Linden dollars at a rate of about L$270 to $1 on the Lindex market. Second Life’s website even boasts that “thousands of residents are making part or all of their real life income from their Second Life businesses.”
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 5th, 2007, by Tim Finin, posted in Uncategorized
NSF is sponsoring a workshop on next generation data mining and cyber-enabled discovery for innovation 10-12 October 2007 in Baltimore’s Inner Harbor.
“This National Science Foundation workshop on Next Generation Data Mining and Cyber Enabled Discovery for Innovation (NGDM’07) will bring together data mining researchers, scientists, and engineers from a diverse background along with domain experts for various emerging problems that are relevant to Cyber Enabled Discovery for Innovation (CDI). The objective is to enhance the understanding of the research problems and facilitate creating an environment for better understanding the challenges in front of the data mining and the CDI community.”
The workshop will focus on five areas
- Data mining in e-science and engineering
- Media, pervasive computing, and ubiquitous data mining
- Data mining in security surveillance, and privacy protection
- Social science, finance, digital humanities, and data mining
- The Web, semantics, and data mining
and seeks submissions of relevant five page extended abstract from data mining researchers and practitioners by 3 September 2007.
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 2nd, 2007, by Tim Finin, posted in Uncategorized
Dan Karran blogs about a talk by Michael Jones (CTO of Google Earth) at the State of the Map conference, in which Jones describes how they are capturing new data for Google Earth via crowdsourcing.
“Now, everything you see here was created by people in Hyderabad. We have a pilot program running in India. We’ve done about 50 cities now, in their completeness, with driving directions and everything – completely done by having locals use some software we haven’t released publicly to draw their city on top of our photo imagery.”
A podcast of the talk is available.
O’reilly radar described it this way
“Google has been sending GPS kits to India that enable locals to make more detailed maps of their area. After the data has been uploaded and then verified against other participant’s data it becomes a part of the map. The process is very reminiscent of what Open Street Map, the community map-building project, has been doing. The biggest difference is that the data (to my knowledge) is owned by Google and is not freely available back to the community like it is with OSM.”
Edit | Bookmark@del.icio.us | Trackback | No Comments »
August 2nd, 2007, by Tim Finin, posted in Uncategorized
Ebiquity PhD student Akshay Java has been collecting Twitter data since March 2007 and has just written a paper that analyzes the subset from April and May of this year.
Why We Twitter: Understanding Microblogging Usage and Communities, Akshay Java, Xiaodan Song, Tim Finin, and Belle Tseng, Joint 9th WEBKDD and 1st SNA-KDD Workshop, August 2007.
His dataset included about 1350K posts from over 75K users. The paper covers a lot of the standard statistics you would expect — usage trends, basic network properties, top hubs and authorities, community structure, and geographic distribution. Akshay’s title pays homage to the early paper that asked why we blog, but the title also reflects the paper’s key contribution — an attempt to tease out the user’s intention in writing a tweet, i.e. to analyze why people are using Twitter.
Edit | Bookmark@del.icio.us | Trackback | 1 Comment »
August 1st, 2007, by Tim Finin, posted in Uncategorized
An article in CIO Insight, Google wary of behavioral targeting in online ads, quotes Google VP Susan Wojcicki as saying that Google does not plan to do behavioral targeting — building up a profile of a user’s interests over time to better select advertisements for them to see.
“Google Inc. is looking to find more links between the searches its users do in order to better target advertising, but the company is reluctant to go much further than that in tracking their behavior.”
Google is working on improving relevance by looking at what might be thought of as search two grams.
“Google has been testing for several weeks a new advertising feature that delivers ads based not simply on a specific search term, but also on the immediately previous search, she said. A user who types “Italy vacation” into a Google search box might see ads about Tuscany or cheap flights to Europe. Were the same user to subsequently search for “weather,” Google will assume there is a link between “Italy vacation” and “weather” and deliver ads tied to local weather conditions in Italy.”
This is a straightforward idea that can be done without building user models or profiles.
Edit | Bookmark@del.icio.us | Trackback | No Comments »
|  |
|  |