Data Mining Techniques

Dit vak wordt in het Engels aangeboden. Omschrijvingen kunnen daardoor mogelijk alleen in het Engels worden weergegeven.

Doel vak

The aim of the course is that students acquire data mining knowledge and
skills that they can apply in a business environment. How the aims are
to be achieved: Students will acquire knowledge and skills mainly
through the following: an overview of the most common data mining
algorithms and techniques (in lectures), a survey of typical and
interesting data mining applications, and practical assignments to gain
"hands on" experience. The application of skills in a business
environment will be simulated through various assignments of the course.

Inhoud vak

The course will provide a survey of basic data mining techniques and
their applications for solving real life problems. After a general
introduction to Data Mining we will discuss some "classical" algorithms
like Naive Bayes, Decision Trees, Association Rules, etc., and some
recently discovered methods such as boosting, Support Vector Machines,
and co-learning. A number of successful applications of data mining will
also be discussed: marketing, fraud detection, text and Web mining,
possibly bioinformatics. In addition to lectures, there will be an
extensive practical part, where students will experiment with various
data mining algorithms and data sets. The grade for the course will be
based on these practical assignments (i.e., there will be no final


Lectures (h) and compulsory practical work (pra). Lectures are planned
to be interactive: there will be small questions, etc.


Practical assignments (i.e. there is no exam). There will be two
assignments done in groups of three. There is a possibility to get a
grade without doing these assignments: to do a real research project
instead (which will most likely to involve more work, but it can also be
more rewarding). For the regular assignments the first assignment counts
for 40% and the second for 60%. The grade of both assignments needs to
be sufficient to pass the course.


Ian H. Witten, Eibe Frank, Mark A. Hall, Data Mining: Practical Machine
Learning Tools and Techniques (Third Edition). Morgan Kaufmann, January
2011 ISBN 978-0-12-374856-0. Also the second and fourth edition can be


mBA, mCS, mAI, mBio

Aanbevolen voorkennis

Kansrekening and Statistiek or Algemene Statistiek (knowledge of
statistics and probabilities) or equivalent. Recommended: Machine

Algemene informatie

Vakcode X_400108
Studiepunten 6 EC
Periode P5
Vakniveau 500
Onderwijstaal Engels
Faculteit Faculteit der Bètawetenschappen
Vakcoördinator dr. M. Hoogendoorn
Examinator dr. M. Hoogendoorn
Docenten dr. M. Hoogendoorn

Praktische informatie

Voor dit vak moet je zelf intekenen.

Voor dit vak kun je last-minute intekenen.

Werkvormen Hoorcollege, Computerpracticum

Dit vak is ook toegankelijk als: