UW-Madison Research Data Services http://researchdata.wisc.edu Wed, 25 Mar 2015 16:57:27 +0000 en-US hourly 1 http://wordpress.org/?v=4.1 Get to Know the RDS Team: Brianna Marshall http://researchdata.wisc.edu/news/get-to-know-the-rds-team-brianna-marshall/ http://researchdata.wisc.edu/news/get-to-know-the-rds-team-brianna-marshall/#comments Sun, 22 Mar 2015 16:43:02 +0000 http://researchdata.wisc.edu/?p=5384 [...]]]> In this series, we introduce the team members who make up Research Data Services (RDS). This interview is with Brianna Marshall, RDS Chair and Digital Curation Coordinator at the General Library System.

Describe your role at the General Library System.

My position is a newly created role meant to explore the library’s role in data services on campus. In a nutshell, I lead Research Data Services and manage UW’s institutional repository, MINDS@UW. A lot of my job is strategizing for the future – where are we now? where do we need to be? – and trying to gather necessary resources.

What’s the most interesting project you’ve worked on recently?

It’s hard to pick just one, especially because one of my favorite parts about my job is how varied my days are! On any given day I will be doing a consultation with a researcher, giving a presentation, or tinkering with the repository. One current project that has captured my attention is a toolkit RDS is developing to pinpoint tools that support research data management. We are still in the prototyping phase but I am incredibly excited about having the chance to really clarify what tools are out there for different aspects of data management.

What excites you about supporting research data management on campus?

brianna_interviewThe University of Wisconsin is one of the top tier research institutions in the country. Being in a position to help support that incredible research is a big deal to me. I’ll be the first to admit that effective data management can quickly become overwhelming, so I love having the chance to say, Here are three small steps you can implement today that will make things better. As a native Wisconsinite, I’m also a proud advocate of the Wisconsin Idea – I know that the research coming out of UW benefits the entire state.

If you had an unlimited budget, what would you institute on campus?

Without a doubt I would invest in a data repository. Understandably, this is a huge financial investment that would need to be driven by campus. In my mind it’s an appropriate middle ground between on our existing repository, MINDS@UW, which is well-suited for publications, and the storage and backup options offered through DoIT. There needs to be an extra layer that allows for critical research data generated at UW to be archived and made discoverable. This will help researchers comply with federal funding mandates and allow UW to remain involved in an important piece of the research process.

Do you have a favorite UW building or landmark?

I’m partial to Science Hall. I was lucky enough to get a behind the scenes tour from the building manager when I started my job, and boy is it an interesting place. From the massive topographic maps on each floor to the old anatomy department (where they used to push bodies down a slide located in one of the towers!) to the secret attic, there’s a lot of intrigue there. If I ever write my mystery novel, I think I know the setting!

What do you like to do outside of work?

I make things: I’m a quilter, scrapbooker, and photographer. I enjoy seeking out adventure whenever possible. In the image included in this post I’m at the Apostle Island Ice Caves in my natural state: behind the camera.

Do you have a question for Brianna or the rest of the RDS team? Contact us today.

]]>
http://researchdata.wisc.edu/news/get-to-know-the-rds-team-brianna-marshall/feed/ 0
Report: NISO Conference on Scientific Data Management http://researchdata.wisc.edu/news/report-niso-conference-scientific-data-management-caring-for-your-institution-and-its-intellectual-wealth/ http://researchdata.wisc.edu/news/report-niso-conference-scientific-data-management-caring-for-your-institution-and-its-intellectual-wealth/#comments Fri, 13 Mar 2015 18:57:29 +0000 http://researchdata.wisc.edu/?p=5326 [...]]]> By Allan Barclay, Information Architecture Librarian at Ebling Library

The National Information Standards Organization (NISO) held a virtual conference, “Scientific Data Management: Caring for Your Institution and its Intellectual Wealth” on February 18. A variety of data management projects and academic organizations were represented, including the US Department of Energy, Emory University, Tufts University, Oregon State University, University of Illinois at Urbana-Champaign, Force 11, the Center for Open Science and the RMap project. The web page for the event (including slide decks) is still available at the NISO website. Some highlights include:

The DART Project

A research project using data management plans (DMPs) from successful grant applications, the end product is a rubric for the review of future DMPs prior to submission. It can also help a institution identify gaps in research data management services. The rubric should be available for release later this year.


Force 11

Force 11 is “a grass roots community that developed out of beyond-the-PDF conferences.” They address issues such as data access and reuse, transparency in research, data citation, and attribution for the different roles and outputs in the research process. They host at least a dozen different forums for the discussion or creation of better standards and practices in research communications and e-scholarship.


Center for Open Science

The Center for Open Science is a non-profit technology start-up company working on a free, open source application called the Open Science Framework – a set of tools focused on transparency and reproducibility in the research workflow. Features include file sharing, provenance tracking, persistent URLs, automated versioning and API connections to common data storage providers including Figshare, GitHub, Amazon S3, Dropbox, and Dataverse.


RMap Project

RMap is a two year project that started with discussions between the Data Conservancy community at Johns Hopkins, Portico and the IEEE. The idea behind the project is that the “atomic unit” of scholarly research is a complex distributed object with building blocks of text, graphics, data, and more which resides in different locations at different institutions using different technologies. Not only do the different artifacts themselves need to be preserved, the links between them also need to be preserved. The RMap project hopes to create a framework and tools to facilitate this process, sort of like an operating system for a repository of scholarly research activities.


 

]]>
http://researchdata.wisc.edu/news/report-niso-conference-scientific-data-management-caring-for-your-institution-and-its-intellectual-wealth/feed/ 0
Apply for the NADDI 2015 Student Scholarship http://researchdata.wisc.edu/news/apply-for-the-naddi-2015-student-scholarship/ http://researchdata.wisc.edu/news/apply-for-the-naddi-2015-student-scholarship/#comments Tue, 10 Mar 2015 17:45:37 +0000 http://researchdata.wisc.edu/?p=5317 [...]]]> NADDI_color

UW-Madison Research Data Services is accepting applications for a student scholarship to the North American Data Documentation Initiative (NADDI) 2015 conference. The conference will be held at the Pyle Center on the UW-Madison campus, April 8-10.

NADDI 2015 is the premier data documentation conference – a great opportunity for those using metadata standards and others interested in learning more to share stories, discoveries, and experiences. This conference will be of interest to future librarians and data professionals in the social sciences and other disciplines.

Student scholarship applications are due by Friday, March 20th. The scholarship will cover the conference registration fee. After the conference, the scholarship recipient will be asked to write a brief blog post sharing their experience on the RDS blog.

Please send a brief statement of interest and CV/resume to RDS Chair Brianna Marshall.

For more information about the conference and DDI please visit the conference website.

]]>
http://researchdata.wisc.edu/news/apply-for-the-naddi-2015-student-scholarship/feed/ 1
Upcoming Brown Bag Talk on Open Access, Open Data, & Open Ed http://researchdata.wisc.edu/news/upcoming-brown-bag-talk-on-open-access-open-data-open-ed/ http://researchdata.wisc.edu/news/upcoming-brown-bag-talk-on-open-access-open-data-open-ed/#comments Mon, 09 Mar 2015 19:09:28 +0000 http://researchdata.wisc.edu/?p=5312 [...]]]> Our third brown bag talk, “Open Access, Open Data, and Open Ed Updates,” will be presented by Doug Way, Brianna Marshall, Carrie Nelson, and Jim Jonas.

TIME: Wednesday, March 18, 12pm-1pm.

PLACE: Bunge Room, School of Library and Information Studies, 4th floor of Helen C. White Hall.

ABSTRACT: In this talk, the presenters will introduce the concepts of open access, data, and educational resources. They will share recent updates in each domain and highlight existing resources for learning more. The second half of the presentation will be reserved for questions and unstructured conversation about these issues.

Please RSVP for this talk if you plan to attend. View other talks in this series in our archive.

]]>
http://researchdata.wisc.edu/news/upcoming-brown-bag-talk-on-open-access-open-data-open-ed/feed/ 0
Guides, Tutorials, and Courses for Learning About Data Management http://researchdata.wisc.edu/news/selected-guides-tutorials-and-courses-for-learning-about-data-management/ http://researchdata.wisc.edu/news/selected-guides-tutorials-and-courses-for-learning-about-data-management/#comments Wed, 25 Feb 2015 16:20:45 +0000 http://researchdata.wisc.edu/?p=5282 [...]]]> by Cid Freitag, ‎Instructional Technology Program Manager at DoIT Academic Technology

Notebook-Data

If the data you need still exists;
If you found the data you need;
If you understand the data you found;
If you trust the data you understand;
If you can use the data you trust;
Someone did a good job of data management.

Rex Sanders ‐ USGS‐Santa Cruz*

Data management practices have been described in detail in a variety of documentation and tutorials, which may focus on specific needs and resources applicable to the organization that produced them. The following is a selected list of resources that are general enough to apply to different disciplines, and more broadly than the university or agency that developed them.

Guides and Tutorials

Data Science MOOCs

Several Massively Open Online Courses cover topics related to data analysis and research methods. Even if you choose not to do the coursework and earn a statement of completion, it’s easy to sign up for the courses, which gives you access to lectures and examples.

The Class Central website has curated a list of several data science and analysis methods MOOCs, developed by reputable sources.

The MOOCs listed here have been developed through Johns Hopkins University, and offered through the Coursera platform. They are part of a Data Science Specialization series of of courses, and have applicability to data management practices outside of specific analytical techniques. Each of these courses lasts 4 weeks, and are frequently offered. Currently, there is a new offering of each course starting each month from March through June, 2015.

The Data Scientist’s Toolbox, Jeff Leek, Roger Peng, Brian Caffo

“The course gives an overview of the data, questions, and tools that data analysts and data scientists work with.” It focuses on a practical introduction to tools, using version control, markdown, git, GitHub, R, and RStudio.

Getting and Cleaning DataJeff Leek, Roger Peng, Brian Caffo

“This course will cover the basic ways that data can be obtained…..It will also cover the basics of data cleaning and how to make data “tidy”… The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data.” Tools used in this course:  Github, R, RStudio

Reproducible Research, Jeff Leek, Roger Peng, Brian Caffo

“Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them…This course will focus on literate statistical analysis tools which allow one to publish data analyses in a single document that allows others to easily execute the same analysis to obtain the same results.” Tools: R markdown, knitr


*Rex Sanders quote from: Environmental Data Management: CHALLENGES AND OPPORTUNITIES, Jamie Gerrard | March 2014

 

Looking for additional information about research data management? Contact us.

]]>
http://researchdata.wisc.edu/news/selected-guides-tutorials-and-courses-for-learning-about-data-management/feed/ 0
Second Spring 2015 Holz Brown Bag Talk http://researchdata.wisc.edu/news/second-spring-2015-holz-brown-bag-talk/ http://researchdata.wisc.edu/news/second-spring-2015-holz-brown-bag-talk/#comments Wed, 11 Feb 2015 15:00:02 +0000 http://researchdata.wisc.edu/?p=5241 [...]]]> Photo courtesy of Kristin Briney

Photo courtesy of Kristin Briney

Our second brown bag talk, “Zero to Sixty: Establishing Research Data Services from Scratch,” will be presented by Kristin Briney, Data Services Librarian at the University of Wisconsin-Milwaukee.

TIME: Wednesday, February 25, 12pm-1pm.

PLACE: Cat Lab (4191F), School of Library and Information Studies, 4th floor of Helen C. White Hall.

ABSTRACT: What does it take to create research data services where none existed before? Kristin Briney will discuss establishing data services at the University of Wisconsin-Milwaukee. Her talk will include strategy and lessons learned 18 months into the process.

ABOUT KRISTIN: Kristin is a PhD chemist who works at the interface of science, technology, and information management. Her particular interests are: helping researchers manage their data, improving informatics systems through robust metadata and workflows, teaching information retrieval and management skills, and using technology to make science accessible to everyone.

Please RSVP for this talk if you plan to attend. View other talks in this series in our archive.

]]>
http://researchdata.wisc.edu/news/second-spring-2015-holz-brown-bag-talk/feed/ 0
NADDI 2015 at UW-Madison http://researchdata.wisc.edu/news/naddi-2015-at-uw-madison/ http://researchdata.wisc.edu/news/naddi-2015-at-uw-madison/#comments Tue, 10 Feb 2015 22:07:33 +0000 http://researchdata.wisc.edu/?p=5130 [...]]]> NADDI_color

Research Data Services is proud to co-sponsor the third annual North American Data Documentation Initiative conference, occurring April 8-10 at the University of Wisconsin-Madison.

The theme for NADDI 2015, Research Data Management: Enhancing Discoverability with Open Metadata Standards, emphasizes an applied use of DDI to research data. Meant to appeal to individuals involved in creating, managing and using research data, the conference encourages the submission of presentations that showcase the importance of DDI metadata for not only discovering and using research data, but as a practical and utilitarian principle supporting research data production and management.

The conference also encourages presentations on current data service models at other institutions who want to brainstorm how to integrate DDI into their workflows. Finally, because UW-Madison is home to two longitudinal studies (MIDUS and Wisconsin Longitudinal Study) that collect biological and other non-survey data types, NADDI2015 will be a convenient forum to discuss documenting complex use cases with DDI.

The call for presentations is open through February 13. For more information, visit the NADDI 2015 website or download the NADDI 2015 informational flyer.

About DDI

The Data Documentation Initiative (DDI) is an open metadata standard for describing data and data collection activities. DDI’s principal goal is making research metadata machine-actionable. The specification can document and manage different stages of data lifecycles, such as conceptualization, collection, processing, analysis, distribution, discovery, repurposing, and archiving.

]]>
http://researchdata.wisc.edu/news/naddi-2015-at-uw-madison/feed/ 0
Workshop on Data Management for Ecologists a Success http://researchdata.wisc.edu/news/workshop-on-data-management-for-ecologists-a-success/ http://researchdata.wisc.edu/news/workshop-on-data-management-for-ecologists-a-success/#comments Tue, 10 Feb 2015 18:01:08 +0000 http://researchdata.wisc.edu/?p=5229 [...]]]>
Photo courtesy of Brianna Marshall

Photo courtesy of Brianna Marshall

By Erin Carrillo, Information Services Librarian, Steenbock Library

In November, RDS held a two day data management workshop for graduate student researchers. Participants were from several departments across campus, including Limnology, Entomology, Forest and Wildlife Ecology, Geography, and the Nelson Institute for Environmental Studies, and were part of a cohort of graduate students doing research in the area of biodiversity conservation, funded by an NSF Integrative Graduate Education and Research Traineeship grant.

We planned the workshop with two graduate students, Kara Cromwell (Zoology) and Alex Latzka (Center for Limnology), who saw a need to provide new researchers with the knowledge and skills to navigate the changing research data landscape. From funder and publisher requirements for data management plans and data sharing, to the ongoing development of metadata standards and discipline-specific data repositories, researchers need to be aware of trends within their discipline and practice good data management from the outset. Kara and Alex also wanted to encourage and facilitate the sharing of research data within the group.

The workshop addressed several broad topics within data management, but content was tailored to the specific needs of the group. We administered a survey to the group at the beginning of the planning process to gauge students’ current knowledge of data management practices, as well as their specific needs. We identified several areas of focus, and modules were developed for each area. Stephanie Hampton, a visiting scientist coming from Washington State and former deputy director of NCEAS (National Center for Ecological Analysis and Synthesis), was invited by grad students in the Center for Limnology. She had recently published a few high impact papers on the future of ecology, especially with respect to Big Data, and gave a short talk giving participants perspective on why sound data management will matter as they advance in their careers.

The final program was:

  • Spreadsheets, Jan Cheetham, DoIT Academic Technology and Barry Radler, Institute on Aging
  • File Organization, Elliott Shuppy, School of Library and Information Studies (SLIS)
  • Storage & Preservation, Brianna Marshall, Digital Curation Coordinator; Luke Bluma, DoIT Storage & Backup; Elliott Shuppy
  • Metadata, Corinna Gries, Center for Limnology, North Temperate Lakes Long Term Ecological Research (LTER)
  • Data Management Plans, Corinna Gries
  • Keynote talk by Stephanie E. Hampton, Kaeser Scholar, Washington State University, Director of the Center for Environmental Research, Education, and Outreach

We built in designated work time at the end of the first day to give participants an opportunity to apply what they had learned and collaborate with their colleagues. Module presenters were available to answer questions.  Presenters deposited slide decks and other workshop materials in a Box folder that we shared with participants after the workshop.

We had participants complete a pre- and post-workshop survey to assess the effectiveness of the workshop. The results revealed that participants generally rated their ability to practice good data management higher after the workshop. We also got this positive feedback from Kara:

“Alex and I heard a lot of positive feedback throughout the workshop… The schedule flowed smoothly, the content was very well suited to the needs of the group, and all the modules were engaging. We really appreciate the time you invested, and I know everyone (including many who weren’t able to attend) will continue to take advantage of the resources posted in the Box folder. It was a definite success!”

It was a pleasure to work with Kara and Alex and their group, and we look forward to using what we learned from planning this workshop to organize similar workshops tailored to the needs of researchers in different disciplines across campus.

Is your lab or department interested in working with RDS to develop a discipline-specific data management workshop? Contact us.

]]>
http://researchdata.wisc.edu/news/workshop-on-data-management-for-ecologists-a-success/feed/ 0
Manage Your Data with LabArchives http://researchdata.wisc.edu/storing-data/manage-your-data-with-labarchives/ http://researchdata.wisc.edu/storing-data/manage-your-data-with-labarchives/#comments Tue, 10 Feb 2015 14:18:55 +0000 http://researchdata.wisc.edu/?p=5211 [...]]]> line beaker

By Jan Cheetham, Research and Instructional Technologies Consultant, DoIT

LabArchives is an ELN (Electronic Lab Notebook) that provides data storage, data documentation, collaboration, and export features. Like traditional paper lab notebooks, an ELN can serve as a continuous and complete record of the research process.

Basics

Collaboration and Sharing

LabArchives provides flexible permissions and roles for lab members and their collaborators. It is recommended that PI’s assume the Owner role in all their lab’s notebooks, in alignment with UW-Madison’s Policy on Data Stewardship, Access, and Retention and to ensure that no data is lost when lab members graduate or leave the university.

There are several approaches for organizing notebooks and managing edit/read rights of individuals. Permissions can be set at the level of the notebook, page, or entry. It also possible for individuals in the Owner or Admin role to share notebooks, pages, and entries with collaborators outside the university. Although LabArchives has a method for creating Digital Object Identifiers (DOIs) for notebooks, this requires making the notebook publicly available. The UW-Madison LabArchives site currently has the public sharing feature turned off as a security measure to prevent inadvertent sharing of notebooks.

The ELN provides a timestamp and record of every user action, creating an electronic record of who added or edited an entry and when. In addition, nothing can be permanently deleted from the ELN. ( LabArchives allows you to move a notebook, page, or entry to a Delete Bin; however, these items are not actually deleted and can be recovered at any time.)

Organizing and Documenting

The ability to blend digital data with the human readable narrative of the research process is one of the main advantages of an ELN over other file sharing/storage services or hybrid paper/electronic systems. LabArchives has a number of different entry types for entering data and recording the narratives. Below are a few suggestions that will help ensure that the information you enter in LabArchives can be readily retrieved.

Naming conventions
LabArchives currently does not offer a way to browse through folders or pages chronologically. Therefore, you may want to use file-naming conventions for pages (and possibly, folders). Names should contain a project name, date, experiment identifier, etc. For more specific suggestions, see naming conventions in an ELN.  It is also a good idea to use similar naming conventions for files you attach or link to in the ELN to make it easier to trace through versions and locate those with transformations.

Documenting attached files
In LabArchives, you upload and attach a single data file to an attachment entry on a page. The file can be of any type and up to 250 MB in size. The entry will display the name of the attached file and you can also enter a description with detailed information (metadata) about the file. When you upload a new version of the file to the same entry, LabArchives retains all prior versions and lets you revert back to older versions through the entry’s revision history. However, as noted below, only the most recent version is included in HTML export. Therefore, to ensure that all data files that you or someone else would need to reproduce your findings are archived both inside the ELN and in HTML exports, be sure to create a separate attachment entry for each essential file that needs to be retained in its original, unaltered form. Then, new versions of the data file (in which the original data are cleaned, transformed, analyzed, visualized, etc.) should be added to the ELN as one or more new entries.

Documenting linked files
When data files are too big (>250 MB) or too numerous to attach to the ELN, you can create links to them from within a rich text entry. However, LabArchives does not check links or verify locations, so you will need to ensure the files are in a secure and permanent location. It is also a good practice to record the name of the file and its location directly in the rich text entry since the URL you add when you create a link is not directly visible in the entry.

Exporting and Archiving

LabArchives has two export formats, PDF and HTML. The PDF version is similar to a scanned paper notebook page. The HTML version lacks some of the appearance of the notebook but contains more complete information, including attached files. As with any digital platform you use for your research data, you will want to have a backup and archival plan. This should take into account how often you make changes to the notebook and include methods for retaining duplicate copies of important data files in alternate locations.

PDF
PDFs can be created for a single entry or page or entire notebook. PDFs include: text entries, thumbnails of images and widgets, annotations and descriptions of attachments, user name and time/date stamps. They do not include: attached files, version history of attachments, or comments. URLs of links in rich text entries may be retrievable, depending on the application you use to read the PDF.

HTML
The HTML option exports an entire notebook. Each page in the notebook is a separate HTML file and the most recent version of each attached file is also included. This export option also does not include version history of attachments or comments. Again, URLs that you add to create links in rich text entries may be retrievable, depending on the browser you use to read the HTML pages.

Do you have additional questions or concerns about electronic lab notebooks? Contact us.

]]>
http://researchdata.wisc.edu/storing-data/manage-your-data-with-labarchives/feed/ 1
Data Archiving Platforms: MINDS@UW http://researchdata.wisc.edu/storing-data/data-repos/data-archiving-platforms-mindsuw/ http://researchdata.wisc.edu/storing-data/data-repos/data-archiving-platforms-mindsuw/#comments Mon, 02 Feb 2015 20:40:08 +0000 http://researchdata.wisc.edu/?p=5076 [...]]]> by Brianna Marshall, Digital Curation Coordinator, General Library System

This is part one of a three-part series where I explore platforms for archiving and sharing your data. To help you better understand your options, here are the areas I will address for each platform:

  • Background information on who can use it and what type of content is appropriate
  • Options for sharing and access
  • Archiving and preservation benefits the platform offers
  • Compliance with the forthcoming OSTP mandate

MINDS@UW

About: MINDS@UW is the University of Wisconsin’s institutional repository, intended to capture, archive, and provide access to scholarship originating from campus researchers of any discipline. It is supported by the UW Libraries and free for all UW-affiliated researchers to use. While a wide variety of file formats are supported, this platform is best suited to handling text-based formats.

Sharing and access: Items in the repository are given a permanent URL that can be used to share the item; however, DOIs are not minted at this time. Items can be made open access (accessed free of charge by anyone, anywhere, at any time) or they can be embargoed (no access is provided until a certain time, up to a few years, has passed). Embargoed items are still discoverable since the metadata is indexed in the repository but the content will not be visible.

Archiving and preservation: The Libraries are committed to long-term preservation of all MINDS@UW items. In addition to the current backup practices in place, the Libraries are collaborating with the UW-Madison Office of the CIO to design and pilot a campus-scaled digital preservation infrastructure. This service, and the libraries’ own preservation repositories, will eventually be aligned with the Digital Preservation Network (DPN).

OSTP mandate: The OSTP mandate requires all federal funding agencies with over $100 million in R&D funds to make greater efforts to make grant-funded research outputs more accessible. This will likely mean that data must be publicly accessible and have an assigned DOI (though you’ll need to check with your funding agency for the exact requirements). Because MINDS@UW cannot provide a DOI at this time, it is not a suitable place for funder data.

The UW Libraries are always looking to improve this platform to better fit the needs of researchers. If you have a question, comment, or suggestion related to MINDS@UW, please contact repository manager Brianna Marshall.

Visit MINDS@UW.

Do you have additional questions or concerns about where you should archive your data? Contact us.

]]>
http://researchdata.wisc.edu/storing-data/data-repos/data-archiving-platforms-mindsuw/feed/ 0