Data Driven Medicine


Stanford School of Medicine


With the spread of electronic health records and increasingly low cost assays for patient molecular data, powerful data repositories with tremendous potential for biomedical research, clinical care and personalized medicine are being built. But these databases are large and difficult for any one specialist to analyze. This course introduces methods for finding the hidden associations within the full set of data. This course includes a programming project.


Highly recommended: STATS216

Topics include

Topics Include

  • Methods for data-mining at the internet scale
  • Handling of large-scale electronic medical records data for machine learning
  • Methods in natural language processing and text-mining applied to medical records
  • Methods for using ontologies for the annotation and indexing of unstructured content as well as semantic web technologies

Note on Course Availability

This course is typically offered Autumn quarter.

The course schedule is displayed for planning purposes – courses can be modified, changed, or cancelled. Course availability will be considered finalized on the first day of open enrollment. For quarterly enrollment dates, please refer to our graduate certificate homepage.

Thank you for your interest. The course you have selected is not open for enrollment. Please click the button below to receive an email when the course becomes available again.

Request Information