Course ObjectiveAfter taking this course, you will be able to:
- Understand the fundamentals of the most important problems that modern
Web companies face daily;
- Process large amounts of Web data efficiently using state-of-the-art
tools that are currently used in the Web industry;
- Extract useful insights from raw data available on the Web;
- Adapt or reuse techniques used on the Web to other fields (e.g. Data
Mining, Artificial Intelligence) where similar problems might occur.
Course ContentThe Web constitutes the largest repository of knowledge that is
available to mankind, and its impact on modern society is unprecedented
at many levels. Many Web companies are valued with billion dollar
quotations and are now central to our modern life.
The key players in the Web industry must face numerous challenges that
are concerned with the size, distribution, heterogeneity, and the
uncontrolled nature of the Web. Systems to process Web data require the
application of a combination of techniques spanning databases,
distributed systems, data mining, and artificial intelligence.
The goal of this course is to introduce the student to the most advanced
systems and techniques which deal with Web data. Important classes of
- the storage and retrieval of Web data (How can we store and retrieve
information from large social networks, graphs, or large volumes of
- efficient entity disambiguation (What is a particular web page talking
- large-scale knowledge extraction (What sort of knowledge can we
extract from web documents -- e.g. Wikipedia?)
- effective link prediction (Is there a connection between two
- expressive ontological inference (Can current knowledge lead to more
- trust (Can we trust the content on a certain blog post?)
This course will describe techniques to perform these tasks with a
particular emphasis on scalability, which is a crucial aspect in this
domain. In order to better understand the challenges and effectiveness
of current solutions, the student will be called to implement practical
assignments on realistic Web data. These assignments will be part of the
final evaluation of the course.
Teaching MethodsThe course takes the form of lectures and practical assignments.
Method of AssessmentA combination of exams and group homework assessments.
LiteratureA mixture of scientific publications and other material available on the
|Language of Tuition||English|
|Faculty||Faculty of Science|
|Course Coordinator||J. Urbani|
You need to register for this course yourself
Last-minute registration is available for this course.
This course is also available as: