Datenschutzerklärung|Data Privacy

A. Borusan

06.03.2013, 16 Uhr c.t. TU Berlin, EN building, seminar room EN 719 (7th floor), Einsteinufer 17, 10587 Berlin: "On the evolving role of workflow middleware to address the challenges of big data science" (Paolo Missier, Newcastle University)

The use of workflow has slowly been gaining popularity as a high level programming model for e-science. Some of the reasons for its limited uptake by the scientific community have been analyzed in a recent study 1. These tie in with the ongoing debate on the limits and opportunities of data and method sharing in science, and are addressed by evolving requirements for workflow repositories.

From a computing perspective, on the other hand, workflow technology has all the appeals (and perhaps the limitations) of middleware: a potential for exploiting parallelism in the data in a way that is transparent to users, portability over multiple computing infrastructures, and ancillary services such as capturing and querying execution histories (provenance).

In this talk I will briefly discuss these capabilities, using two workflow management systems, Taverna and e-Science Central, and a case study in Chemical Engineering as a reference. Taverna has been used successfully for the past ten years mainly in bioinformatics, while the more recent e-Escience Central has been positioning itself as a tool for big data analytics which offers good scalability over a cloud infrastructure.