Applied Math and Statistics 550.450

Computational Molecular Medicine

Spring 2017

(4 credits, EN)


Computational systems biology has emerged as the dominant framework for analyzing high-dimensional “omics” data in order to uncover the relationships among molecules, networks and disease. In particular, many of the core methodologies are based on statistical modeling, including machine learning, stochastic processes and statistical inference. We will cover the key aspects of this methodology, including measuring associations, testing multiple hypotheses, and learning predictors, Markov chains and graphical models. In addition, by studying recent important articles in cancer systems biology, we will illustrate how this approach enhances our ability to annotate genomes, discover molecular disease networks, detect disease, predict clinical outcomes, and characterize disease progression. Whereas a good foundation in probability and statistics is necessary, no prior exposure to molecular biology is required (although helpful).


Prof. Joel S. Bader, joel.bader at,

Course developed by Prof. Donald Geman

Office: Clark 201C; Miller Research Building 319

Office hours: 5:45 PM Shaffer 303, with discussions continued at One World Cafe

Teaching Assistants

  • Tobi Bosede, abosede2 at
  • Kamel Lahouel, klahoue1 at


Lectures: Monday and Wednesday, 4:30 – 5:45 PM, Shaffer 303


  1. Section 2, Friday, 9:00 – 9:50 AM, Hodson 216, Kamel
  2. Section 1, Friday, 1:30 PM – 2:20 PM, Gilman 377, Tobi

TA Office Hours

  • Mondays, 1-2 PM, Whitehead 212
  • Fridays, 10-11 AM, Clark 307B

Key Dates

Lecture, HW, Quiz, and Exam Dates

  • Midterm, Monday, March 27
  • Final exam, according to registrar schedule

Syllabus with dates


Lecture notes posted on BlackBoard.


Posted on BlackBoard.

On-line material

Posted on BlackBoard.


EN.550.420 and EN.550.430 (Introduction to Probability and Introduction to Statistics)

All prerequisites may be satisfied by an equivalent course or by the instructor’s permission.

Online Resources

BlackBoard? Yes, BlackBoard.

Course Objectives

  1. Have a working knowledge of the modern understanding of genetics and genomics.
  2. Understand probability and statistics in the small n, large p limit (many more variables than observations).
  3. Understand how to perform hypothesis testing with applications to genomic data.
  4. Understand how to learn and apply classifiers using biological data as input.
  5. Develop and apply stochastic models for biological processes.
  6. Develop and apply graphical models to describe latent variables in biological systems.

Course Topics

  1. Basic genetics and genomics, Central Dogma, biological information and high-throughput data
  2. Genotype and phenotype, statistical genetics
  3. Hypothesis testing for DNA (GWAS) and RNA (differential expression)
  4. Cancer biology, cancer classification, driver mutations
  5. Tumor evolution, Galton-Watson models, epigenetics
  6. Graphical models, hidden Markov models for biological sequences, signaling networks

Course Expectations & Grading

6 homework assignments, each worth 1 grading unit

Midterm exam, worth 2 grading units

Final exam, worth 3 grading units

All work (including HW and exams) should be independent.

If you must be absent for an academic or professional reason, including conferences, medical school or graduate school interviews, or other on-site interviews, please speak to me ahead of time. You must speak to me ahead of time if an absence conflicts with an exam date.

If you are absent for a non-academic reason, primarily health-related, please have the Dean of Students email me on your behalf.


The strength of the university depends on academic and personal integrity. In this course, you must be honest and truthful. Ethical violations include cheating on exams, plagiarism, reuse of assignments, improper use of the Internet and electronic devices, unauthorized collaboration, alteration of graded assignments, forgery and falsification, lying, facilitating academic dishonesty, and unfair competition.

In addition, the specific ethics guidelines for this course are:

  • Homework: students should work independently and eschew use of previous years’ assignments in preparing homework assignments for this course.
  • Late homework will receive a 50% deduction.
  • Study materials: all study materials are generally permitted, including course materials from previous years. Advance copies of current-year questions are not permitted, however. Students taking exams at different times, particularly for make-up exams, may not share information.
  • Exams: only blank paper, pens, and pencils are permitted. Electronic devices, including calculators, phones, and computers, are strictly forbidden. Equation sheets, if necessary, will be provided in the exam itself.

Report any violations you witness to the instructor.

You can find more information about university misconduct policies on

the web at these sites:

Students with Disabilities

Any student with a disability who may need accommodations in this class must obtain an accommodation letter from Student Disability Services, 385 Garland, (410)


ABET Outcomes

  • Ability to apply mathematics, science and engineering principles (a).
  • Ability to design and conduct experiments, analyze and interpret data (b).
  • Ability to identify, formulate and solve engineering problems (e).
  • Understanding of professional and ethical responsibility (f).
  • Ability to communicate effectively (g).
  • The broad education necessary to understand the impact ofengineering solutions in a global and societal context (h).
  • Recognition of the need for and an ability to engage in life-long learning (i).
  • Knowledge of contemporary issues (j).
  • Ability to use the techniques, skills and modern engineeringtools necessary for engineering practice (k).
%d bloggers like this: