Search Results for author: Marta Mattoso

Found 3 papers, 1 papers with code

Workflow Provenance in the Lifecycle of Scientific Machine Learning

no code implementations30 Sep 2020 Renan Souza, Leonardo G. Azevedo, Vítor Lourenço, Elton Soares, Raphael Thiago, Rafael Brandão, Daniel Civitarese, Emilio Vital Brazil, Marcio Moreno, Patrick Valduriez, Marta Mattoso, Renato Cerqueira, Marco A. S. Netto

We contribute with (i) characterization of the lifecycle and taxonomy for data analyses; (ii) design principles to build this view, with a W3C PROV compliant data representation and a reference system architecture; and (iii) lessons learned after an evaluation in an Oil & Gas case using an HPC cluster with 393 nodes and 946 GPUs.

BIG-bench Machine Learning

Provenance Data in the Machine Learning Lifecycle in Computational Science and Engineering

no code implementations9 Oct 2019 Renan Souza, Leonardo Azevedo, Vítor Lourenço, Elton Soares, Raphael Thiago, Rafael Brandão, Daniel Civitarese, Emilio Vital Brazil, Marcio Moreno, Patrick Valduriez, Marta Mattoso, Renato Cerqueira, Marco A. S. Netto

To handle this problem, in this paper we contribute with a detailed characterization of provenance data in the ML lifecycle in CSE; a new provenance data representation, called PROV-ML, built on top of W3C PROV and ML Schema; and extensions to a system that tracks provenance from multiple workflows to address the characteristics of ML and CSE, and to allow for provenance queries with a standard vocabulary.

BIG-bench Machine Learning

BioWorkbench: A High-Performance Framework for Managing and Analyzing Bioinformatics Experiments

1 code implementation11 Jan 2018 Maria Luiza Mondelli, Thiago Magalhães, Guilherme Loss, Michael Wilde, Ian Foster, Marta Mattoso, Daniel S. Katz, Helio J. C. Barbosa, Ana Tereza R. Vasconcelos, Kary Ocaña, Luiz M. R. Gadelha Jr

This framework automatically collects provenance data, including both performance data from workflow execution and data from the scientific domain of the workflow application.

Distributed, Parallel, and Cluster Computing Databases

Cannot find the paper you are looking for? You can Submit a new open access paper.