Representing Probabilistic Relations in RDF

Talk at UMBC

2005-03-07

Yoshio Fukushige

W3C Fellow / Matsushita Electric Industrial Co., Ltd.(Panasonic)

fuku@w3.org / fukushige.yoshio@jp.panasonic.com

http://www.w3.org/2005/03/07-yoshio-UMBC/

Overview of this talk

Today's talk consists of:

Self Introduction

my picture
Name Yoshio Fukushige 福重 貴雄
Affiliate Network Systems Development Center,
Matsushita Electric Industrial Co., Ltd.
fukushige.yoshio@jp.panasonic.com
W3C fellow, based at Keio Univ.
fuku@w3.org
Research domain Semantic Web (cf. RDF Data Access Working Group), Text classification/clustering, Application of Bayesian Network

Representing Probabilistic Relations in RDF

The aim of my ongoing work is to provide

and to provide them as a strawman for standard / best practice in processing uncertain information in RDF.

Overall Framework

  1. Represent probabilistic relations in a RDF graph
  2. Transform it into a Bayesian Network(BN) and store in a BN reasoner
  3. Transform the observation graph (when provided) and feed it to the reasoner
  4. Transform the query and feed it to the reasoner
  5. Get the reasoning result, transform and answer back

(Provide as a Web Service?)

Overall Framework

An Example for Discussion

Bayesian Network for the Metastatic Cancer problem

MetastaticCancer
TRUEFALSE
0.200.80
SerumCalcium
TRUE FALSE
Metastatic
Cancer
TRUE0.800.20
FALSE0.200.80
BrainTumor
TRUEFALSE
Metastatic
Cancer
TRUE0.200.80
FALSE0.050.95
BrainTumor Coma
TRUEFALSE
Serum
Calcium
TRUETRUE0.800.20
FALSE0.800.20
FALSETRUE0.800.20
FALSE0.050.95
HeadAche
TRUEFALSE
Brain
Tumor
TRUE0.800.20
FALSE0.600.40

Vocabulary for Representing Probabilistic Relations

A node in BN is represented as a node of type prob:ProbabilisticStatements.

RDF graph for ProbabilisticRelations

Events as N-ary relations

The targets of description are probabilistic relations between events. An event is represented as a prob:Clause node. (prob:Event, prob:State or prob:Proposition are better namings?) A prob:Clause node has a predicate node and role nodes (all are instances).

e.g. Clause node with * represents a High Serum Calcium event

RDF graph for High Serum Calcium event

Representing P(SerumCalcium | MetastaticCancer)

RDF graph for P(SerumCalcium|MetastaticCancer)
SerumCalcium
TRUE FALSE
Metastatic
Cancer
TRUE 0.80 0.20
FALSE 0.20 0.80

Click here to see a representation in N3 format

Probabilistic Observation

e.g. Mary has a headache with probability of 70%.

RDF graph for Mary's headache

Resultant Beliefs

e.g. Mary has a metastatic cancer with probability of 9.7%.

RDF graph for Mary's metastatic cancer

Current Status and Issues

I have examined the vocabulary needed for representing probabilistic relations in RDF, and now am writing code for transformation to BN. Query language and protocol are yet to be examined.

Remaining issues are among others,

Comparison with BayesOWL

  My approach BayeseOWL
main target Causal relationship (RDF) Ontological relationship (OWL)
basic proposition prob:Clause Variable (?)
conditions are attached to prob:ProbabilisticStatements (distribution) CondProb (case in distribution)
probability prob:Probability literal

How can these approaches help each other?

Some Questions on BayesOWL

Appendix

Why not use a predicate for a prob:Close?

In order not to make it a ground statement (= which is said to be real). The graph below says that Mary really has high serum calcium

A graph for a ground triples

Why the predicate node is an instance node, not a class node?

Suppose Hans may have dropped an ax. Clause 1 claims that it is a golden ax, and clause 2 claims that it is a silver ax. Both agree that he may have dropped something.

RDF graph for two claims on what Hans dropped

Valid XHTML 1.0! Valid CSS!