Call to Action to the Tech Community on New Machine Readable COVID-19 Dataset

CD2H_new2Requested by The White House Office of Science and Technology Policy, the COVID-19 Open Research Dataset (CORD-19) represents the most extensive machine-readable coronavirus literature collection available for data and text mining to date, with over 29,000 articles, more than 13,000 of which have full text.

Now, The White House joins these institutions in issuing a call to action to the Nation’s artificial intelligence experts to develop new text and data mining techniques that can help the science community answer high-priority scientific questions related to COVID-19.

The CORD-19 resource is available on the Allen Institute’s website and will continue to be updated as new research is published in archival services and peer-reviewed publications. Researchers should submit the text and data mining tools and insights they develop in response to this call to action via the Kaggle platform. Through Kaggle, a machine learning and data science community owned by Google Cloud, these tools will be openly available for researchers around the world.

  • machine learning
  • data discovery
  • COVID-19

Publishing CTSA Program Hub’s Name
Center for Data to Health