Tool: Tabula

Information adapted from the Tabula website.

What is Tabula?

If you’ve ever needed data that only exists in a PDF format, you’ve likely discovered that you can’t easily copy and paste the data, which makes being able to actually use it difficult.  Tabula is a free, open-source tool you can use for “liberating data tables locked inside PDF files.”

For an example of Tabula being used to extract data for a visualization project, check out this blog post by the Jane Speaks Initiative. Other examples can also be found on the Tabula website.

What can Tabula help you do?

Tabula runs in your web browser, making it easy to browse to the PDF containing the data you need, select the portion of the PDF containing the data tables, and then easily extract the data from the tables into a CSV file or a Microsoft Excel spreadsheet.

How do you get it?

You can download Tabula for free from its website. It is also available on GitHub.

What else should you know?

Tabula works only with text-based PDFs; the developers note that it will not work with scanned documents. Tabula is available for Windows, Mac OS X, and Linux operating systems.

 

Tool: CodingBat Python

Information adapted from CodingBat Python website.

What is CodingBat Python?

CodingBat Python is a website that offers Python coding problems you can work through for practice (it also offers Java problems). It was created by Nick Parlante, a computer science lecturer at Stanford. It’s geared towards beginners, although some knowledge of Python is required. The website notes that these problems are the sort you’d encounter in a first or second computer science course.

(more…)

Tool: ATLAS.ti

Information adapted from ATLAS.ti website

What is ATLAS.ti?

ATLAS.ti is a software workbench that helps you perform qualitative analysis on large amounts of text, graphics, audio, or video. ATLAS.ti supports a wide range of data formats, including most common text formats (including .txt, .doc., .docx, and .pdf), “dozens” of graphic and audio formats including .wav and .mp3, and many common video formats. You can also import data from Twitter or Evernote, surveys, or a reference manager.

(more…)

Tool: Identity Finder

Information from DoIT’s recent news and DoIT’s Identity Finder information.

What is Identity Finder?

This tool was featured recently by DoIT, but we wanted to cover the information again as it’s a great tool for those interested in ensuring security on their local machines. Identity Finder is a software tool that can help find personally identifiable or sensitive information on your local machine. Finding restricted information allows you to take steps to ensure protection or encryption of that information. (more…)

Tool: ReproZip

Information adapted from ReproZip.org.

What is ReproZip?

ReproZip is a software packaging tool developed by Fernando Chirigati, Juliana Freire, Rémi Rampin, Dennis Shasha, and Vicky Steeves at NYU. ReproZip is designed to make the computational components of research easier to reproduce across different machines.

(more…)