CS68 - Bioinformatics
Spring 2017


Course Basics

Lecture: Tuesday/Thursday 9:55 - 11:10am, Science Center 199
Lab A: Monday 1:15 - 2:45pm, Clothier 016
Lab B: Monday 3:00 - 4:30pm, Clothier 016
Instructor: Ameet Soni    
Email:
Office: Science Center 253
Office hours: 3pm to 5pm, Wednesday or by appointment
Prep Time (limited availability): Monday 11am to 1pm, Tuesday/Thursday before 10am
Course Discussion: Piazza (mandatory enrollment)

Welcome to CS68. This course is an introduction to the fields of bioinformatics and computational biology, with a central focus on algorithms and their application to a diverse set of computational problems in molecular biology. While the course will view algorithms applied through the lens of biology, the themes of the course drive at the heart of the larger "Big Data" phenomena. Computational themes will include dynamic programming, greedy algorithms, supervised learning and classification, data clustering, tree inference, graphical models, data management, and structured data representation. Applications will include genetic sequence analysis, pairwise-sequence alignment, phylogenetic trees, motif finding, gene-expression analysis, and protein-structure prediction.

While significant time will be spent exploring the biological significance of problems, the central focus in this course will be on understanding how to develop algorithms for complex problems. In particular, the general question we will answer is "How does one reason about large amounts of complex data?" That is, how do we uncover underlying phenomena and draw conclusions in the face of large data sets with noisy, intricate relationships? While this question is presented in context of problems in molecular biology, it applies to open problems across all of the sciences. We will see that many of the algorithms we cover have applications and foundations in far-reaching domains including natural language, social network analysis, security, and search.

To enroll in this course you must have completed CPSC 35. There is no requirement for prior biology experience. The course will also cover a good deal of probability theory, but much of this can be picked up with provided reading. This course is designated as a natural sciences and engineering practicum (NSEP) and qualifies as a Group 3: Applications course for the CS major/minor requirements. It also cross-listed as BIOL 68 and counts as a non-lab course towards a biology major.

Required Course Textbook

Biological Sequence Analysis by Durbin, Eddy, Krogh, and Mitchison.

Additional references