by Brianna Marshall, Digital Curation Coordinator
This is part two of a three-part series where I explore platforms for archiving and sharing your data. Read the first post in the series, focused on UW’s institutional repository, MINDS@UW.
To help you better understand your options, here are the areas I address for each platform:
- Background information on who can use it and what type of content is appropriate.
- Options for sharing and access
- Archiving and preservation benefits the platform offers
- Whether the platform complies with the forthcoming OSTP mandate
Dryad is a repository appropriate for data that accompanies published articles in the sciences or medicine. Many journals partner with Dryad to provide submission integration, which makes linking the data between Dryad and the journal easy for you. Pricing varies depending on the journal you are publishing in; some journals cover the data publishing charge (DPC) while others do not. Read more about Dryad’s pricing model or browse the journals with sponsored DPCs.
Sharing and access
Data uploaded to Dryad are made available for reuse under the Creative Commons Zero (CC0) license. There are no format restrictions to what you upload, though you are encouraged to use community standards if possible. Your data will be given a DOI, enabling you to get credit for sharing.
Archiving and preservation
According to the Dryad website, “Data packages in Dryad are replicated across multiple systems to support failover, improve access times, allow recovery from disk failures, and preserve bit integrity. The data packages are discoverable and backed up for long-term preservation within the DataONE network.”
The OSTP mandate requires all federal funding agencies with over $100 million in R&D funds to make greater efforts to make grant-funded research outputs more accessible. This will likely mean that data must be publicly accessible and have an assigned DOI (though you’ll need to check with your funding agency for the exact requirements). As long as the data you need to share is associated with a published article, Dryad is a good candidate for OSTP-compliant data: it mints DOIs and makes data openly available under a CC0 license.
Have additional questions or concerns about where you should archive your data? Contact us.
By Allan Barclay, Ebling Library
New Requirements to Make Work and Data More Transparent and Reusable
April 2015 – The National Science Foundation (NSF) recently released a set of public access requirements for researchers applying for grants with an effective date on or after January 2016. According to the plan, entitled Today’s Data, Tomorrow’s Discoveries, the objectives of increasing public-accessibility are to make research and data easier for other investigators and educational institutions to use, and spur innovation from these same communities.
The NSF sees these requirements as the “initial implementation” of a framework that will change and grow over time to include additional research products and degree of accessibility.
The scope of the plan is initially focused on four types of outcome products:
- Articles in peer-reviewed journals
- Papers accepted as part of juried conference proceedings
- Articles/juried papers in conference proceedings authored entirely or in part by NSF employees
- Data generated and curated as part of an NSF-required Data Management Plan (DMP).
Researchers who receive all or partial NSF funding will be required to
- Deposit either the version of record or final accepted peer-reviewed manuscript of these products in a public access compliant repository as designated by the NSF. At this time, the NSF has designated the Department of Energy’s PAGES (Public Access Gateway for Energy and Science) system as their designated repository.
- Make these outcome products freely available for download, reading and analysis no later than 12 months after initial publication.
- Provide a minimum level of machine-readable metadata with each product at the time of initial publication.
- Ensure the long-term preservation of products.
- Provide a unique persistent identifier to all products in the award annual and final reports.
The NSF expects that investigators will be able to deposit research products into the PAGES system by the end of the 2015 calendar year. Data underling journal article or conference paper findings should be deposited in a repository as specified by the publication or as described in the research proposal’s DMP.
Public access requirement specifics will be provided in future NSF documents and grant solicitations.
For more information on how these new requirements could affect your grant proposal, contact the solicitation’s Cognizant Program Officer or the UW-Madison’s Research Data Services.
by Lisa Abler, Assistant Scientist, Dept. of Comparative Biosciences, School of Veterinary Medicine
Image courtesy of Colleen Simon for opensource.com
Researchers are increasingly exposed to the concepts of research data sharing and open data. Funders, publishers, research institutions and possibly even colleagues are introducing these phrases. Often, the differences between these concepts can be confusing, and understanding how one or both could affect your research may be a mystery. The following provides definitions, some benefits to researchers and a list of resources for learning more about these topics, as well as why and how to implement them for your lab.
What is Research Data Sharing?
Research data sharing is the act of making your research data available to others for reuse. There are a number of aspects that factor into data sharing:
- Which data to share: raw data, processed data, both?
- How to share the data: lab meetings, scientific meetings, journal publication, online databases?
- With whom to share: coworkers, collaborators, peers, funders, the public?
- How soon to share: immediately, after ensuring your own lab’s publishing needs are met, never?
There are also restrictions that may apply to data sharing at many levels, from institutions to publishers to the federal government (e.g., privacy). Fortunately, there are resources to help navigate these restrictions. See what UW-Madison, Johns Hopkins University and the University of Oregon have to offer on this topic:
What is Open Data?
As defined by The Open Definition, “Open data and content can be freely used, modified, and shared by anyone for any purpose.” In essence, open data is an unrestricted mode of research data sharing. The most important points to consider when making data open include:
- Availability: Data must be available to anyone to use, with no restrictions based on person, group or undertaking
- Access and usability: Data must be accessible, preferably downloadable over the Internet, and must be available as a whole, in a reusable format, for a reasonable reproduction cost
- Licensing: Data must be made available as a whole and licensing must allow utilization without restrictions on use (i.e., in whole or in part), redistribution or modification
What are the benefits of sharing my data?
There are many potential benefits to sharing your research data:
- Increases recognition through citation of datasets
- Facilitates the exchange of ideas and sharing of expertise among peers
- May be required by research funders or publishers
- Increases visibility of and interest in your research, especially in a global research environment
- Provides evidence of research findings, as well as opportunities for verification or validation
- May allow opportunities for reciprocity; by sharing your data to further research, others may be willing to share with you
- Can encourage collaborations or co-authorships
- Can accelerate discovery if many qualified scientists are working on a common problem, particularly data analysis in complex fields
- Avoids experimental duplications, especially in the case of negative findings or failed experiments
- Sharing data is central to scientific progress, benefiting both research endeavors and the public
Where can I find more information?
Research Data Sharing
- Funding agencies
- Research Data Alliance (RDA) A group of members from academia, industry and government who, through Working Groups and Interest Groups, strive to enable data sharing across barriers, from geography to discipline to domain.
- UW-Madison Policies
- The Open Data Foundation A non-profit working to develop standards and open data solutions across many fields, with a focus on statistical data.
- Open Data Commons Offers legal tools to provide and utilize open data.
- Other Resources
- Sowing the Seed: Incentives and Motivations for Sharing Research Data, A Researcher’s Perspective (Knowledge Exchange, 2014)
A collection of case studies across five disciplines (and European countries) to identify barriers and enablers to research data sharing and to examine motivations and incentives that encourage data sharing. Includes recommendations for academic disciplines, research institutions and data repositories.
- Establishing Incentives and Changing Cultures to Support Data Access (Wellcome Trust, 2014)
An analysis involving reviews of previous reports, interviews, focus groups and a web survey to elucidate factors that help or hinder research data sharing and to indentify potential incentives to facilitate data access and sharing. Includes recommendations for research institutions and research leaders.
For University of Wisconsin researchers who rely on Department of Energy federal grants, the other shoe has dropped. To be precise, the DoE’s “shoe” or plan to increase access to the works and data of its federally-funded investigators is one of approximately thirteen plans many federal agencies will likely be announcing in the next several weeks. In February 2013, the White House Office of Science and Technology Policy issued a memo (known as the OSTP memo) that required all agencies that fund over $100 million in research annually to create a plan to allow greater public access to its’ researchers’ work and data after a 12 month embargo period. The Washington Post’s recent article on the announcement indicates that this particular plan is not without its detractors.
RDS will be covering the release of all OSTP Memo plans as they are announced.
In order to promote data sharing and use, PLOS has revised its Data Policy, effective March 1, 2014. All published articles will be required to include a Data Availability Statement, describing how authors will maintain compliance with PLOS’s data sharing policy. For more information see PLOS Editorial and Publishing Policies: Sharing of Data, Materials, and Software and the FAQs for Editorial Policy.
The California Digital Library (CDL) is conducting a survey to better understand researcher practices and perceptions around data publication. Their aim is to learn what a “data publication” should look like: what should accompany a dataset to make it re-useable, how should creators be credited, what do you expect from peer review of data? The results will help to shape the CDL’s efforts, and will be made publicly available for anyone to use.
If you are involved in research in any branch of the Sciences or Social Sciences, your input would be valuable, even if you have never shared or published data or given much thought to the issues involved. In addition, CDL would like you to consider sharing this survey with any of your colleagues who might be interested.
The survey can be taken anonymously in 5-10 minutes. It is available at: http://goo.gl/PuIVoC
Please join us for an event about open access publishing and open data:
Date: Thursday, October 24, 2013
Location: DeLuca Forum, Wisconsin Institutes for Discovery, 330 N. Orchard
Hours: 8:00 AM – 10:30 AM
Sponsors: UW-Madison Libraries, Research Data Services Group
More information: http://researchdata.wisc.edu/open-access-open-data/
Description: The libraries of the UW are committed to examining opportunities to engage faculty, staff, and students in discussions of research data management and public access compliance. This forum will explore current and emerging trends in research data and publication access, policy, preservation, management and discovery. We will discuss library efforts to assist faculty and staff in making research data and articles publicly accessible in the hope of accelerating research innovation and enriching the learning process.