Towards Sustainable Curation and Preservation: The SEAD Project's Data Services Approach

When the effort to curate and preserve data is made at the end of a project, there is little opportunity to leverage ongoing research work to reduce curation costs or conversely, to leverage curation efforts to improve research productivity. In the Sustainable Environment Actionable Data (SEAD) project, we have envisioned a more active approach to data curation and preservation in which these processes occur in parallel with research and generate sufficient short and long-term return on researcher investments for self-interest to drive their adoption. In this paper, we describe the conceptual framework motivating the SEAD project and the suite of data services we have developed and deployed as an initial implementation of this approach. Use cases in which these services can reduce curation effort and aid ongoing research are highlighted and, based on our experience to date, we identify some key architectural features of our approach as well as open challenges to fully realizing the value of this approach in the broad ecosystem of cyberinfrastructure.

[1]  Inna Kouper,et al.  SEAD: An Integrated Infrastructure to Support Data Stewardship in Sustainability Science , 2013 .

[2]  Inna Kouper,et al.  SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science , 2013, Int. J. Digit. Curation.

[3]  Deborah L. McGuinness,et al.  Parallel Identities for Managing Open Government Data , 2012, IEEE Intelligent Systems.

[4]  Carole A. Goble,et al.  An Identity Crisis in the Life Sciences , 2006, IPAW.

[5]  James D. Myers,et al.  Adapting the electronic laboratory notebook for the semantic era , 2005, Proceedings of the 2005 International Symposium on Collaborative Technologies and Systems, 2005..

[6]  Sean Bechhofer,et al.  Research Objects: Towards Exchange and Reuse of Digital Knowledge , 2010 .

[7]  Joe Futrelle,et al.  Medici : A Scalable Multimedia Environment for Research , 2011 .

[8]  David R. Karger The Semantic Web and End Users: What's Wrong and How to Fix It , 2014, IEEE Internet Computing.

[9]  Ruth E. Duerr,et al.  The Data Conservancy Instance: Infrastructure and Organizational Services for Research Data Curation , 2012, D Lib Mag..

[10]  Jonathan W. Essex,et al.  CombeChem: A Case Study in Provenance and Annotation Using the Semantic Web , 2006, IPAW.

[11]  James D. Myers I Think Therefore I Am Someone Else: Understanding the Confusion of Granularity with Continuant/Occurrent and Related Perspective Shifts , 2010, IPAW.

[12]  Quan Zhou,et al.  Komadu: A Capture and Visualization System for Scientific Data Provenance , 2015 .

[13]  James D. Myers,et al.  Semantic middleware for e‐Science knowledge spaces , 2011, Concurr. Comput. Pract. Exp..

[14]  Carole Goble,et al.  myExperiment – A Web 2.0 Virtual Research Environment , 2007 .

[15]  Ellen J. Cramer,et al.  VIVO: Enabling National Networking of Scientists , 2010, IASSIST.