Monica Duke has expertise in the areas of resource discovery and data management, particularly metadata, use of repositories and data citation. She works at UKOLN, in the UK. UKOLN is a centre of expertise in digital information management based at the University of Bath. She has worked on projects funded by the Joint Information Systems Committe (JISC) in the UK for the last 10 years, including the eBank UK project from 2004-2007. eBank UK was an early example of a project working with researchers to investigate the management of data, applying digital library principles and infrastructures to support scholarly communication. More recently, Monica was project manager for SageCite, a project funded by the JISC for one year from August 2010.
SageCite has investigated the application of citation in the context of Sage Bionetworks, a US-based not for profit organisation dedicated to a collaborative approach to building data sharing infrastructure in the biosciences. Sage Bionetworks has a mission to improve disease through community-based modelling of large multi-contributor datasets, creating maps that describe the underpinnings of disease and predict the behaviour of complex systems.
The SageCite project has undertaken various strands of work. In this presentation, due to time constraints, I will focus on describing typical data and processing activities at Sage Bionetworks as a use case. I will describe the demonstrator from the SageCite project, in which the workflow tool Taverna has been used to make explicit the data sources and methods used in Sage Bionetworks pipelines. The demonstrator applies the principle that capturing and documenting data processes through workflow tools supports better citation, and makes the cited resource more re-usable, strengthening the reproduciblity and validationnof the research.