Skip to content

Latest commit

 

History

History
42 lines (26 loc) · 1.42 KB

README.md

File metadata and controls

42 lines (26 loc) · 1.42 KB

badge

Data and processing scripts for clinical trials information in http://ClinicalTrials.gov, a registry and results database of publicly and privately supported clinical studies of human participants conducted around the world.

Deposit in this database has been required since September 2007 for all "applicable clinical trials" as per FDAAA 801.

Getting the data

  1. Go to http://www.clinicaltrials.gov/
  2. Search (no query) to get all results
  3. Hit download and select all results
  4. Wait while 542 Mb zip file downloads search_results.zip
  5. unzip - you now have 2.3Gb of clinical trials xml
  6. Use the scripts - see below

Data Structure

It's XML. Here's the XSD: http://clinicaltrials.gov/ct2/html/images/info/public.xsd.

Sample records:

  • data/NCT00000102.xml (without results)
  • data/NCT01101477.xml (with results)

Data Stats

139848 XML files as of 2013-02-02.

As of Feb 1st 2013 only 8,044 trials included posted-results.

Scripts

Node.js script in extract.js - still under development.