Syllabus academic year 2010/2011
(Created 2010-07-25.)
WEB INTELLIGENCE AND INFORMATION RETRIEVALEITN01
Credits: 7,5. Grading scale: TH. Cycle: A (Second Cycle). Main field: Technology. Language of instruction: The course might be given in English. EITN01 overlaps following cours/es: EIT031. Optional for: C4, D4, D4ks. Course coordinator: Anders Ardö, Anders.Ardo@eit.lth.se, Electrical and Information Technology. Prerequisites: EDA011 or EDA016 Programming, First Course. Recommended prerequisits: FMA420 Linear Algebra. Assessment: Written examination, passed laboratories and assignments. Home page: http://www.eit.lth.se/course/eitn01.

Aim
The goal of this course is to increase the understanding of methods for information retrieval, structuring and text mining, specially from Internet based sources.

Knowledge and understanding
For a passing grade the student must

Skills and abilities
For a passing grade the student must

Judgement and approach
For a passing grade the student must

Contents
Information Retrieval: basic methods for ranking and searching, vector models, tf-idf relevance ranking. Information Retrieval systems.

Query Language: Different query languages for search in structured databases are presented.

Stuctured information: Indexing, searching and relevance ranking of search results. Exemplified with the aid of searches in structured databases (SRU/CQL).

Feature extraction: Extract properties and features for text documents.

Basic methods for classification and knowledge extraction (as Neural Networks, Support Vector Machines, etc) are presented and experimented with. Using extracted features to implement topic classification of text documents.

Performance: Performance indicators like precision and recall.

Literature
Baeza-Yates, R, Ribeiro-Neto, B: Modern Information Retrieval.
Addison-Wesley 1999. ISBN: 0-201-39829-X
Articles and documents from the Web.
Course notes and labs