OPQLPig: A Seamless Synergy between Pig and Provenance Query for Big DataOctober 19, 2016 Track: General We propose and design a framework for storing large provenance datasets in HDFS and querying them using OPQL, Pig and Hadoop. We extend OPQL, a graph-level provenance query language, to support W3C PROV-DM, standard provenance model; we propose algorithms to translate OPQL constructs to equivalent Pig Latin programs; and we develop and evaluate our OPQLPig solution on provenance datasets of UTPB. Speaker(s)
|