Trusted Provenance of Collaborative, Adaptive, Process-Based Data Processing Pipelines

Ludwig Stage*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

18 Downloads (Pure)

Abstract

The abundance of data nowadays provides a lot of opportunities to gain insights in many domains. Data processing pipelines are one of the ways used to automate different data processing approaches and are widely used by both industry and academia. In many cases data and processing are available in distributed environments and the workflow technology is a suitable one to deal with the automation of data processing pipelines and support at the same time collaborative, trial-and-error experimentation in term of pipeline architecture for different application and scientific domains. In addition to the need for flexibility during the execution of the pipelines, there is a lack of trust in such collaborative settings where interactions cross organisational boundaries. Capturing provenance information related to the pipeline execution and the processed data is common and certainly a first step towards enabling trusted collaborations. However, current solutions do not capture change of any aspect of the processing pipelines themselves or changes in the data used, and thus do not allow for provenance of change. Therefore, the objective of this work is to investigate how provenance of workflow or data change during execution can be enabled. As a first step we have developed a preliminary architecture of a service – the Provenance Holder – which enables provenance of collaborative, adaptive data processing pipelines in a trusted manner. In our future work, we will focus on the concepts necessary to enable trusted provenance of change, as well as on the detailed service design, realization and evaluation.

Original languageEnglish
Title of host publicationEnterprise Design, Operations, and Computing. EDOC 2023 Workshops - IDAMS, iRESEARCH, MIDas4CS, SoEA4EE, EDOC Forum, Demonstrations Track and Doctoral Consortium, 2023, Revised Selected Papers
EditorsTiago Prince Sales, Sybren de Kinderen, Henderik A. Proper, Luise Pufahl, Dimka Karastoyanova, Marten van Sinderen
Place of PublicationCham
PublisherSpringer
Pages363-370
Number of pages8
ISBN (Electronic)978-3-031-54712-6
ISBN (Print)978-3-031-54711-9
DOIs
Publication statusPublished - 2-Mar-2024
Eventseveral workshops, EDOC Forum and Demonstrations and Doctoral Consortium track, which were held in conjunction with 27th International Conference on Enterprise Design, Operations, and Computing, EDOC 2023 - Groningen, Netherlands
Duration: 30-Oct-20233-Nov-2023

Publication series

NameLecture Notes in Business Information Processing
Volume498 LNBIP
ISSN (Print)1865-1348
ISSN (Electronic)1865-1356

Conference

Conferenceseveral workshops, EDOC Forum and Demonstrations and Doctoral Consortium track, which were held in conjunction with 27th International Conference on Enterprise Design, Operations, and Computing, EDOC 2023
Country/TerritoryNetherlands
CityGroningen
Period30/10/202303/11/2023

Keywords

  • Collaborative Processes
  • Data Processing Pipelines
  • Provenance of ad-hoc workflow change
  • Provenance of Change
  • Reproducibility
  • Trust
  • Workflow evolution provenance

Fingerprint

Dive into the research topics of 'Trusted Provenance of Collaborative, Adaptive, Process-Based Data Processing Pipelines'. Together they form a unique fingerprint.

Cite this