One billion spam comments
March 23rd, 2007, by Tim Finin, posted in UncategorizedAkismet is closing in on identifying a billion spam comments. I hope they capture the one that puts them over the top for posterity.
| Building intelligent systems in open, heterogeneous, dynamic, distributed environments | 06 July 2008, 22:17:53 EDT ![]() |
|||
2007 March Archive for March, 2007One billion spam commentsMarch 23rd, 2007, by Tim Finin, posted in UncategorizedAkismet is closing in on identifying a billion spam comments. I hope they capture the one that puts them over the top for posterity. links for 2007-03-22March 22nd, 2007, by Tim Finin, posted in Uncategorized
Oracle 11g to support some OWL inferencingMarch 22nd, 2007, by Tim Finin, posted in UncategorizedI noticed in Seth Ladd’s Semergence blog that the next version of Oracle’s RDF Database (11g) is expected to have native inferencing for a subset of OWL. This is in addition to faster querying and bulk-loading and “new SQL operators for enhancing a relational query using an ontology”. See this thread in Oracle’s Semantic Technologies Forum. According to a recent presentation on 10g, the native OWL inferencing will include:
We’ve not yet tried 10g, but it’s on our short list of things to do. I guess this task just moved up in the list. links for 2007-03-21March 21st, 2007, by Tim Finin, posted in Uncategorized
Why the Semantic Web will fail, NOT.March 21st, 2007, by Tim Finin, posted in UncategorizedSlashdot has a post today titled Why the Semantic Web Will Fail that points to a post by Stephen Downes with the same name. His argument is based on the belief that “The Semantic Web will never work because it depends on businesses working together, on them cooperating.” He says:
While the argument Stephen makes is grounded in his distrust of corporations, his second point above is off the mark, at least for RDF. One of the features of the W3C’s model (based on RDF) is that it doesn’t push the idea that everyone should adopt the same vocabulary (or ontology) for a topic or domain. Instead it offers a way to publish vocabularies with some semantics, including how terms in one vocabulary relate to terms in another. In addition, the framework makes it trivial to publish data in which you mix vocabularies, making statements about a person, for example, using terms drawn from FOAF, Dublin Core and others. The RDF approach was designed with interoperability and extensibility in mind, unlike many other approaches. RDF is showing increasing adoption, showing up in products by Oracle, Adobe and Microsoft, for example. If this approach doesn’t continue to flourish and help realize the envisioned “web of data”, and it might not after all, it will have left some key concepts, tested and explored, on the table for the next push. IMHO, the ’semantic web’ vision — a web of data for machines and their users John Backus passes awayMarch 20th, 2007, by Tim Finin, posted in UncategorizedJohn Backus, inventor of FORTRAN and BNF, functional programming advocate and winner of the 1977 Turing Award, has passed away. He was 82. The New York Times published his obituary today. links for 2007-03-20March 20th, 2007, by Tim Finin, posted in Uncategorized
Who created the first blog?March 20th, 2007, by Tim Finin, posted in UncategorizedDeclan McCullagh and Anne Broache have an article on cnet that explores the question who created the first blog?.
The article mentions some who come immediately to mind:
and also explores some earlier roots, like finger and .plan files. Those last two are interesting connections and makes sense. Kind of. In my experience, the vast majority of people who used .plan files used them to document their generic schedules and availability, rather than to contemporaneously document their activities. It’s a good article, overall. Warsaw University wins 2007 ACM programming contestMarch 19th, 2007, by Tim Finin, posted in Uncategorized
The contest has been running since the 1970s has is generally recognized as the oldest, largest and most prestigious programming contest in the world. This year over 6000 teams began the multi-tiered competition with88 teams in the finals at Maihama Japan. See the final problems that the teams had to solve and the final team standings. How Google separates the blogs from the splogsMarch 19th, 2007, by Tim Finin, posted in UncategorizedGoogle’s patent application (filed 13 September 2005) for Ranking blog documents is being discussed around the web.
The Google Operating System blog has a nice summary of the features Google mentions as useful in separating the blogs from the splogs. No surprises here.
Spotted on Micro Persuasion. MSM Citations in Republican, Democrat BlogsMarch 19th, 2007, by Akshay Java, posted in UncategorizedA number of qualitative and quantitative analysis of Main Stream Media (MSM) sources have So, the question is — which MSM sources are going to play an important role on the Blogosphere during the election year? To analyze this we first look at the most cited MSM sources from the ICWSM dataset shown on the right (the complete list is here). Next, we use a list of 113 Republican and 144 Democrat blogs. This list was compiled using a data set provided by Dr. Lada Adamic and by querying Technorati. We count the number of citations for MSM in each of the sets. The MSM sources most frequently cited by democrats and Republican blogs is as follows:
Limitations
Conclusions MSM is influential and there are selective preferences of each community towards different sources. Some of the sources that are categorized under MSM in the dataset almost have a blog like quality. As people rely on blogs for information and opinions, the indirect influence that MSM sources (and perhaps, its biases) can not be ignored. While blogs and MSM seem to almost have a symbiotic relation, (IMHO) this election season might see a fierce competition between the two. [Acknowledgment: Buzzmetrics for the dataset, Dr. Lada Adamic for the Republican/Democrat labels] Honeyblogs lure suckers to known spam domainsMarch 19th, 2007, by Tim Finin, posted in UncategorizedOne interesting aspect of web spam is that the suckers include both web searchers and advertisers. The goal of spammers is to get the two together and watch the clicking. A new article in the NYT, Researchers Track Down a Plague of Fake Web Pages, discusses results by a team of researchers from Microsoft and UC Davis.
The researchers will present their paper on redirection spamming WWW-2007:
A related paper on spamming in forums (ppt presentation) also uses a “context-based” approach that focuses on link structure and redirection. | You are currently browsing the UMBC ebiquity weblog archives for March, 2007.   Home | Archive | Login | Feedpersonal mathematics
self help
social networks
RDF
Lori Drew
Privacy
success
life skills
networking
systems
Faviki
dbpedia
wikitology
Wikipedia
Business Intelligence
leaderless resistance
Estonia
Internet War
Policy
workshop
iswc
AI
Games
mobile computing
Twitter
advertising
Google
social networking
Semantic Web
Social media |
| About Us | Contact Us | Site Map | Legal | Privacy Copyright © 1999 - 2008 UMBC ebiquity research group. Copyright © 2003 - 2008 Site design and RGB engine code by Filip Perich. XG Page generated in 0.151 seconds. |