2011-02-08T11:00:00-05:00 2011-02-08T12:00:00-05:00 Reproducibility of results is a key tenet of science. Some modern scientific domains, such as Earth Science, have become computationally complicated and, particularly with the advent of higher resolution space based remote sensing platforms, tremendously data intensive. Over the last few decades, these complexities along with the the rapid advancement of the state of the art confound the goal of scientific transparency.

We explore concepts of data identification, organization, equivalence and reproducibility for such data intensive scientific processing. We present a conceptual model useful for describing and representing data provenance suitable for very precise data and processing identification. We present a scheme for creating and maintaining identifiers for precise dataset membership and provenance equivalence at various degrees of granularity and data aggregation.

Application of this model will allow more specific data citations in scientific literature based on large datasets and data provenance equivalence. Our provenance representations will enable independent reproducibility required by scientific transparency. Increasing transparency will contribute to understanding, and ultimately, credibility of scientific results.

]]>

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:cc="http://web.resource.org/cc/#" xmlns:event="http://ebiquity.umbc.edu/ontology/event.owl#" xmlns:person="http://ebiquity.umbc.edu/ontology/person.owl#" xmlns:assert="http://ebiquity.umbc.edu/ontology/assertion.owl#">

<event:Event rdf:about="http://ebiquity.umbc.edu/event/html/id/380/Enabling-Reproducibility-of-Scientific-Data-Flows-with-Provenance-Equivalence">

<rdfs:label>

<![CDATA[ Enabling Reproducibility of Scientific Data Flows with Provenance Equivalence ]]>

</rdfs:label>

<event:title>

<![CDATA[ Enabling Reproducibility of Scientific Data Flows with Provenance Equivalence ]]>

</event:title>

<event:speaker>

<person:PhDAlumnus rdf:about="http://ebiquity.umbc.edu/person/html/Curt/Tilmes">

<person:name>

<![CDATA[ Curt Tilmes ]]>

</person:name>

<rdfs:label>

<![CDATA[ Curt Tilmes ]]>

</rdfs:label>

</person:PhDAlumnus>

</event:speaker>

<event:startDate rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-02-08T11:00:00-05:00</event:startDate>

<event:endDate rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-02-08T12:00:00-05:00</event:endDate>

<event:location>

<![CDATA[ 325 ITE, UMBC ]]>

</event:location>

<event:abstract>

<![CDATA[ Reproducibility of results is a key tenet of science. Some modern scientific domains, such as Earth Science, have become computationally complicated and, particularly with the advent of higher resolution space based remote sensing platforms, tremendously data intensive. Over the last few decades, these complexities along with the the rapid advancement of the state of the art confound the goal of scientific transparency. We explore concepts of data identification, organization, equivalence and reproducibility for such data intensive scientific processing. We present a conceptual model useful for describing and representing data provenance suitable for very precise data and processing identification. We present a scheme for creating and maintaining identifiers for precise dataset membership and provenance equivalence at various degrees of granularity and data aggregation. Application of this model will allow more specific data citations in scientific literature based on large datasets and data provenance equivalence. Our provenance representations will enable independent reproducibility required by scientific transparency. Increasing transparency will contribute to understanding, and ultimately, credibility of scientific results. ]]>

</event:abstract>

<event:tag>

<![CDATA[ provenance ]]>

</event:tag>

<event:tag>

<![CDATA[ scientific computing ]]>

</event:tag>

<event:tag>

<![CDATA[ semantic web ]]>

</event:tag>

<event:host>

<person:Faculty rdf:about="http://ebiquity.umbc.edu/person/html/Yelena/Yesha">

<person:name>

<![CDATA[ Yelena Yesha ]]>

</person:name>

<rdfs:label>

<![CDATA[ Yelena Yesha ]]>

</rdfs:label>

</person:Faculty>

</event:host>

</event:Event>

<rdf:Description rdf:about="">

<cc:License rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>

</rdf:Description>

</rdf:RDF>