UMBC ebiquity

PROB: A tool for Tracking Provenance and Reproducibility of Big Data Experiments

Speaker: Vladimir Korolev

Start: Monday, February 10, 2014, 10:00AM

End: Monday, February 10, 2014, 11:30AM

Location: 346 ITE


Reproducibility of computations and data provenance are very important goals to achieve in order to improve the quality of one's research. Unfortunately, despite some efforts made in the past, it is still very hard to reproduce computational experiments with high degree of certainty. The Big Data phenomenon in recent years makes this goal even harder to achieve. In this work, we propose a tool that aids researchers to improve reproducibility of their experiments through automated keeping of provenance records.

Tags: provenance, big data