Tool: Tabula

Information adapted from the Tabula website.

What is Tabula?

If you’ve ever needed data that only exists in a PDF format, you’ve likely discovered that you can’t easily copy and paste the data, which makes being able to actually use it difficult.  Tabula is a free, open-source tool you can use for “liberating data tables locked inside PDF files.”

For an example of Tabula being used to extract data for a visualization project, check out this blog post by the Jane Speaks Initiative. Other examples can also be found on the Tabula website.

What can Tabula help you do?

Tabula runs in your web browser, making it easy to browse to the PDF containing the data you need, select the portion of the PDF containing the data tables, and then easily extract the data from the tables into a CSV file or a Microsoft Excel spreadsheet.

How do you get it?

You can download Tabula for free from its website. It is also available on GitHub.

What else should you know?

Tabula works only with text-based PDFs; the developers note that it will not work with scanned documents. Tabula is available for Windows, Mac OS X, and Linux operating systems.

 

Tool: OpenICPSR

Written by Chiu-chuang Lu Chou; Information adapted from OpenICPSR

 

OpenICPSR is a self-serving data repository for researchers who need to deposit their social and behavioral science research data for public access compliance.  Researchers can share up to 2 GB data in OpenICPSR for free. Researchers prepare all data and documentation files necessary to allow their data collection be read and interpreted independently.  They also prepare metadata to allow their data be searched and discovered in ICPSR catalog and major search engines. A DOI and a data citation will be provided to the depositor after data are published.

 

Depositors will receive data download reports from OpenICPSR. All OpenICPSR data is governed by the Attribution 4.0 Creative Commons License. Server-side encryption is used to encrypt all files uploaded to OpenICPSR. Data deposited in self-deposit package are distributed and preserved as-is, exactly as they arrive without the standard curation and preservation features available to professional curation package.

 

OpenICPSR offers Professional Curation Package to researchers, who like to utilize ICPSR’s curation services including full metadata generation and a bibliography search, statistical package conversion, and user support. The cost of professional curation is based on the number of variables and complexity of the data. To learn more about OpenICPSR, please visit their website.

Tool: ATLAS.ti

Information adapted from ATLAS.ti website

What is ATLAS.ti?

ATLAS.ti is a software workbench that helps you perform qualitative analysis on large amounts of text, graphics, audio, or video. ATLAS.ti supports a wide range of data formats, including most common text formats (including .txt, .doc., .docx, and .pdf), “dozens” of graphic and audio formats including .wav and .mp3, and many common video formats. You can also import data from Twitter or Evernote, surveys, or a reference manager.

(more…)

Tool: Research Cores – New Resource Directory

Information from the Research Cores website.

The Office of Campus Research Cores recently debuted a new tool to connect researchers to the resources, services, and cores they need. Cores are groups or facilities that provide shared access to resources such as instruments, technologies, or expert consultations for researchers, and the Directory of Resources for Researchers includes over 60 of them. It also includes data for 300 resources and 170 services. You can search the directory, use a sample search, or browse the directory as a list.

Research Cores aims to support research at UW-Madison in the biological sciences, physical sciences, social sciences, and arts and humanities. Visit their website for more information or to offer feedback on this exciting new tool.