UMBC ebiquity

memeta

Status: Past project

Project Description:

Weblogs, or blogs, have become an important new way to publish information, engage in discussions and form communities. The memeta project is developing a framework for representing and studying the structure and content of communities of blogs. We are particularly interested in how metadata about blogs can be extracted, discovered and computed and how that metadata can be used in the analysis of blogs and to provide new blog related services.

Examples of concrete problems we hope to be able to solve and issues we want to address are distinguishing blogs from non-blogs; recognizing spam blogs (splogs); recognizing comment spam and trackbacks; categorizing and clustering blogs; recommending blogs to people; modeling trust relationships in blog communities; and spotting trends in blog communities.

memeta's blog database is driven by a custom blog crawler that collects information on over six Million blogs.

Start Date: March 2005

End Date: December 2008

Principal Investigator:
Anupam Joshi

Students:
Akshay Java
Pranam Kolari

Collaborators:
James Mayfield
Tim Oates

Tags: blog, splog, learning, text mining

 

There are 5 associated publications:  Hide the list...

5 Refereed Publications

2006

1. Pranam Kolari et al., "Blog Track Open Task: Spam Blog Classification", InCollection, TREC 2006 Blog Track Notebook, November 2006, 6928 downloads.

2. Pranam Kolari et al., "Detecting Spam Blogs: A Machine Learning Approach", InProceedings, Proceedings of the 21st National Conference on Artificial Intelligence (AAAI 2006), July 2006, 9718 downloads.

3. Pranam Kolari et al., "Characterizing the Splogosphere", InProceedings, Proceedings of the 3rd Annual Workshop on Weblogging Ecosystem: Aggregation, Analysis and Dynamics, 15th World Wid Web Conference, May 2006, 6343 downloads.

4. Pranam Kolari et al., "SVMs for the Blogosphere: Blog Identification and Splog Detection", InProceedings, AAAI Spring Symposium on Computational Approaches to Analysing Weblogs, March 2006, 11306 downloads.

5. Pranam Kolari et al., "Memeta: A Framework for Multi-Relational Analytics on the Blogosphere", InProceedings, AAAI 2006 Student Abstract Program, February 2006, 3827 downloads.

 

There are 0 associated resources:  Hide the list...

 

Research Areas:
 Language technology
 Security, Trust and Privacy
 Semantic Web
 Web based information systems
 Web services