Algorithms in Sequence Analysis


Course Objective

Have you ever wondered how we can track a gene across 3 billion years of
evolution? Sequence alignment can be used to compare genes from humans
and bacteria, using a dynamic programming algorithm. In this course we
focus on algorithms for biological sequences that can be applied to real
scientific problems in biology.

Students will obtain in-depth knowledge about the theory of sequence
analysis methods. They will also develop understanding and skills to
apply the algorithms to protein and DNA sequences. We would like to
stress that no biological knowledge is required to enter this course.

- At the end of the course, the student will be aware of the major
issues, methodology and available algorithms in sequence analysis.
- At the end of the course, the student will have hands-on experience in
tackling biological problems using sequence analysis algorithms and
applying the general statistical framework of Hidden Markov Models.
- At the end of the course, the student will be able to implement
several of the most important algorithms in sequence analysis.

Course Content

- Dynamic programming, database searching, pairwise and multiple
alignment, probabilistic methods including hidden markov models, pattern
matching, entropy measures, evolutionary models, and phylogeny.

- Programming (in Python) own alignment algorithm based on dynamic
- Reverse translation and dynamic programming
- Homology searching and pattern recognition using biological and
disease examples
- Multiple alignment of biological sequences
- Entropy-based functional residues prediction
- Programming (in Python) own implementation of Hidden Markov Models and
using it to predict protein domain structure

Teaching Methods

13 Lectures: 2 two-hour lectures per week
13 Computer practicals and associated assignments: 2 two-hour hands-on
sessions per week

Method of Assessment

The final grade for this course will consist of 50% practical work (see
above) and 50% theoretical assessment.
The theoretical assessment will be an oral and/or written exam
(depending on number of students).

Entry Requirements

Bachelor in any science discipline (including medicine).
Basic programming skills (Python) and an interest in biological


Course material on
Books: Durbin, R., Eddy, S.R., Krogh, A., Mitchison, G.. Biological
Sequence Analysis. Cambridge University Press, 1998, 350 pp., ISBN
Recommended reading: Marketa Zvelebil and Jeremy O. Baum Understanding
Bioinformatics Garland Science 2008 ISBN-10: 0-8153-4024-9

Target Audience

mAI, mBio, mCS

Additional Information

Signing up via is mandatory.
The course is taught in English.

General Information

Course Code X_405050
Credits 6 EC
Period P2
Course Level 400
Language of Tuition English
Faculty Faculty of Science
Course Coordinator prof. dr. J. Heringa
Examiner prof. dr. J. Heringa
Teaching Staff prof. dr. J. Heringa

Practical Information

You need to register for this course yourself

Last-minute registration is available for this course.

Teaching Methods Seminar, Lecture
Target audiences

This course is also available as: