Provenance Data
Provenance schema and requirements
ProvenanceIF5 gives a brief overview of the provenance discussions at Integration Fest 5 (Manchester Sept 2003).
IfVProvenancePlan describes current activities in this area.
ProvenanceOutline captures ideas based on experience up to IF5 for input into the
InformationModel (a provenance schema) and provenance requirements for the mIR.
Graves Disease Provenance examples from IF4 (ISMB demo)
The myGrid integration fest in June 2003 (IF4) was based around preparing integrating components for a demo at ISMB. The
VideoStore contains a series of videos based on the
GravesDisease scenario. In some of these the provenance record is premature. This is due to a refresh mistiming between the workbench and the workflow enactor.
workflow provenance records
The following workflow provenance records have been extracted from the myGrid Information Repository (mIR). They are those included in the
VideoStore ISMB videos, and manually processed through the same XSL stylesheet to convert the XML into HTML. You can use the
Taverna Workflow Instance Id to check the file with the corresponding video.
provenance from mIR metadata
In addition to the workflow provenance records above, we might also be interested in more general questions. For example, what workflows have been run within the
GravesDisease scenario and when. This sort of experimental level metadata is held in the myGrid Information Repository (MIR). The following are example views over the MIR.
These provide a different view on the same workflow enactments (runs) as the ISMB videos in the
VideoStore and the provenance records above.
Concrete Provenance Examples
GDProvenanceExample documents the workflow provenance recorded through an example workflow provenance record from the
GravesDisease scenario that was used for the ISBM demo in June 2003.
EmbossProvenanceExample documents the provenance recorded in myGrid 0.0 and 0.1 for an example
WSFL workflow. This example provides a basis for people to identify where extra provenance metadata could be useful.
For example provenance files see
ExampleEmbossWorkflow#Provenance.
myGrid Provenance Workshop, 29 Nov 2002
We held a workshop over Access Grid to explore issues of provenance in myGrid. Were were lucky to be joined from Edinburgh by Peter Buneman (see below).
Here are some the viewpoints the team brought to the meeting:
And here are the minutes:
Other Provenance Resources
Now at
- LucMoreau's PASOA project
- PASOA aims to investigate the concept of provenance and its use for reasoning about the quality and accuracy of data and services in the context of eScience. The problems of determining the origin of a result or deciding when results of analysis are no longer valid become important concerns in open Grid environment, where providers are dynamically organised in virtual organanisations to offer services to the community. In this context, provenance data is an annotation able to explain how a particular result has been derived.
Provenance in myGrid 0.0/0.1
There was a significant change in May-June 2003 when the workflow language changed from WSFL to Scufl. For details of earlier experiments see
ProvenanceInPrePrototype.
Provenance Documents
ProvenanceDocuments is a working area for contributions towards myGrid papers on provenance.
See also
Provenance section on public web site.
Some (incomplete) Provenance for this page