Blogvox2: A Modular Domain Independent Sentiment Analysis System
Thursday, June 7, 2007, 13:00pm - Thursday, June 7, 2007, 15:00pm
325b ITE
In this paper, I propose Blogvox2: an information retrieval based modular domain independent sentiment analysis framework that uses customized pattern matching techniques, nave Bayesian filter, bag of words and part of speech tagging techniques for opinion extraction in blogs. We also developed prototype two-panel and four-panel search applications of the query results. Also, trends on the hot and top topics on the opinionated sentences are analyzed.
By this framework, the benefits of Blogvox2 we created a modular approach provides a platform where new modules for different domains can be easily plugged in. The framework provides the date of publishing, permanent link and the URLs of the sentences that expresses opinions based on the analysis. Additionally, tools for trend analysis for obtaining the hot and top topic identification graphics based on the obtained opinionated sentences for presented.
Based on the analysis of the blogvox2 on political domain, our system performs well with Unigram approach. We investigated our framework with pattern matching techniques, bigram techniques, and incorporating parts of speech tagging, which haven't fared as well as unigram techniques, although combining the unigram and bigram techniques performed similar to the unigram approach. We also investigated the reasons for the performance degradation or enhancements on each approach. Based on our analysis, we also developed different applications the ease of using the framework.