Image from ChEMBL Blog
This course is hosted in the School of Informatics and Computing at Indiana University. The lead instructor is Prof David J Wild. The course is primarily a graduate course (I590) and is open to all IU graduate students, though a number of places are also available for informatics undergraduate students who meet the prerequisites.
This course is also part of the new Data Science Curriculum in the school - click the link for more information on how to apply to this certificate program.
With exploding healthcare costs, greater longevity and widespread health challenges of diabetes, obesity, cancer and cardiovascular disease, medicine and healthcare will be a primary scientific and economic focus for the remainder of this century. Informatics and data science offer the promise of a level of understanding of health, disease and treatment on a scale never before imagined. This course will address the big data techniques that are being used in the drug discovery, healthcare and translational medicine domains and will be organized around three questions: how can data science help researchers find new drugs and reuse old ones? How can data science help doctors treat patients better? And how can data science help us all lead healthier lives?
The course is broken down into sections, based around these questions, and modules. Each week of the course will focus on 1-3 modules. Each of these modules will have four parts: a Video, which gives an overview of the topic; Learning Goals that list what you should aim to know after completing the module; Learning Tasks that all students should complete in addition to watching the video, and Going Deeper that gives resources for advanced students and those that want to go deeper into the material.
Students will: Understand the current scientific and human challenges of drug discovery, health and translational medicine; be able to describe the demonstrated or potential value of data science techniques in each of these areas; understand the specific opportunities afforded by crossing domain boundaries; be able to practically work with drug discovery and EMR data using the R statistics package and network visualization tools.
There are no required texts for this course, however, there are several books that are recommended for background reading (more will likely be added to this list as the course progresses)
- R for Medicine and Biology practical guide to using the R statistics package with biology and healthcare data
- Introducing Cheminformatics - eBook overview of the field of cheminformatics, in PDF and Kindle format