<?xml version="1.0"?>

<!DOCTYPE owl [
  <!ENTITY rdf "http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <!ENTITY rdfs "http://www.w3.org/2000/01/rdf-schema#">
  <!ENTITY xsd "http://www.w3.org/2001/XMLSchema#">
  <!ENTITY owl "http://www.w3.org/2002/07/owl#">
  <!ENTITY cc "http://web.resource.org/cc/#">
  <!ENTITY event "http://ebiquity.umbc.edu/ontology/event.owl#">
  <!ENTITY person "http://ebiquity.umbc.edu/ontology/person.owl#">
  <!ENTITY assert "http://ebiquity.umbc.edu/ontology/assertion.owl#">]>

<!--
  This ontology document is licensed under the Creative Commons
  Attribution License. To view a copy of this license, visit
  http://creativecommons.org/licenses/by/2.0/ or send a letter to
  Creative Commons, 559 Nathan Abbott Way, Stanford, California
  94305, USA.
-->

<rdf:RDF 
  xmlns:rdf = "&rdf;"
  xmlns:rdfs = "&rdfs;"
  xmlns:xsd = "&xsd;"
  xmlns:owl = "&owl;"
  xmlns:cc = "&cc;"
  xmlns:event = "&event;"
  xmlns:person = "&person;"
  xmlns:assert = "&assert;">
  <event:Event rdf:about="http://ebiquity.umbc.edu/event/html/id/355/Clustering-short-status-messages-a-topic-model-based-approach">
    <rdfs:label><![CDATA[Clustering short status messages: a topic model based approach]]></rdfs:label>
    <event:title><![CDATA[Clustering short status messages: a topic model based approach]]></event:title>
    <event:speaker><person:MSAlumnus rdf:about="http://ebiquity.umbc.edu/person/html/Anand/Karandikar/"><person:name><![CDATA[Anand  Karandikar]]></person:name><rdfs:label><![CDATA[Anand  Karandikar]]></rdfs:label></person:MSAlumnus></event:speaker>
    <event:startDate rdf:datatype="&xsd;dateTime">2010-07-26T09:00:00-05:00</event:startDate>
    <event:endDate rdf:datatype="&xsd;dateTime">2010-07-26T11:00:00-05:00</event:endDate>
    <event:location><![CDATA[ITE 325b, UMBC]]></event:location>
    <event:abstract><![CDATA[<p>Recently, there has been an exponential rise in the use of online social media systems like Twitter and Facebook. Even more usage has been observed during events related to natural disasters, political turmoil or other such crises. Tweets or status messages are short and may not carry enough contextual clues. Hence, applying traditional natural language processing algorithms on such data is challenging. Topic model is a popular method for modeling term frequency occurrences for documents in a given corpus. A topic basically consists of set of words that co-occur frequently. Unsupervised nature allows topic models to be trained easily on datasets meant for specific domains.</p>

<p>We use the topic modeling feature of the MALLET machine learning tool kit to generate topic models from unlabelled data. We propose a way to cluster tweets by using the topic distributions in each tweet. We address the problem of determining which topic model is optimal for clustering tweets based on its clustering performances. We also demonstrate a use case wherein we cluster twitter users based on the content they tweet. We back our research with experimental results and evaluations.</p>

Committee Members: <ul> <li>Dr. Tim Finin (Chair)</li> <li>Dr. Anupam Joshi</li> <li>Dr. Charles Nicholas</li> </ul>
]]></event:abstract>
    <event:tag><![CDATA[topic model]]></event:tag>
    <event:tag><![CDATA[social media]]></event:tag>
    <event:tag><![CDATA[twitter]]></event:tag>
    <event:host><person:PrincipalFaculty rdf:about="http://ebiquity.umbc.edu/person/html/Tim/Finin/"><person:name><![CDATA[Tim  Finin]]></person:name><rdfs:label><![CDATA[Tim  Finin]]></rdfs:label></person:PrincipalFaculty></event:host>
  </event:Event>

  <rdf:Description rdf:about="">
    <cc:License rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
  </rdf:Description>

</rdf:RDF>

