These datasets have been made openly available in response to the COVID-19 pandemic.
For additional help finding datasets, see our page on Finding, Reusing, and Citing Data.
COVID-19 Open Research Dataset is a free resource of over 45,000 scholarly articles, including over 33,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community. This dataset is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease. The corpus will be updated weekly as new research is published in peer-reviewed publications and archival services like bioRxiv, medRxiv, and others.
The AFI DSI COVID-19 Research Group is a wide-ranging coalition hosted by AFI DSI, with people from UW, local & state government, industry, and other academic institutions curating information related to the COVID-19 pandemic. Their site is an excellent source for the latest in COVID-19 projections and visualizations, along with links to COVID-19 data repositories, code resources, and data science stories.
Wisconsin Department of Health Services COVID-19 Data tracks laboratory confirmed cases of COVID-19 and updates the site totals once a day.
The COVID Racial Data Tracker is a collaboration of the The COVID Tracking Project and the Antiracist Research and Policy Center that tracks inequities in how the COVID-19 pandemic is affecting communities by collecting, publishing, and analyzing racial data on the pandemic from across the United States.
Open access epidemiological data from the COVID-19 outbreak from The Lancet is a free online collection of line list epidemiological, clinical, and laboratory data collated from a variety of sources, including official reports from WHO, Ministries of Health, and Chinese local, provincial, and national health authorities.
Dimensions COVID-19 makes all related articles, preprints, datasets, and clinical trials on Dimensions available for download as a Google Sheet or as a .CSV file.
Global Initiative on Sharing All Influenza Data (GISAID) promotes the international sharing of all influenza virus sequences, related clinical and epidemiological data associated with human viruses, and geographical as well as species-specific data associated with avian and other animal viruses, to help researchers understand how the viruses evolve, spread and potentially become pandemics. Their site hosts a dashboard of data related to COVID-19 from the global community on their homepage.