Syllabus academic year 2008/2009
(Created 2008-07-17.)
WEB INTELLIGENCE AND INFORMATION RETRIEVALEITN01

Higher education credits: 7,5. Grading scale: TH. Level: A (Second level). Language of instruction: The course might be given in English. EITN01 overlap following cours/es: EIT031. Optional for: C4, D3. Course coordinator: Anders Ardö, Anders.Ardo@eit.lth.se, Inst för elektro- och informationsteknik. Prerequisites: EDA011 or EDA016 Programming, First Course. Recommended prerequisits: FMA420 Linear Algebra. EDA132 Artificial Intelligence (good to attend in parallell). Assessment: Passed laboratories and assignments. Home page: http://www.eit.lth.se/course/eitn01.

Aim
The goal of this course is to increase the understanding of methods for information retrieval, structuring and text mining, specially from Internet based sources.

Knowledge and understanding
For a passing grade the student must

Skills and abilities
For a passing grade the student must

Judgement and approach
For a passing grade the student must

Contents
Information Retrieval: basic methods for ranking and searching, vector models, tf-idf relevance ranking. Information Retrieval systems.

Query Language: Different query languages for search in structured databases are presented.

Stuctured information: Indexing, searching and relevance ranking of search results. Exemplified with the aid of searches in structured databases (SRU/CQL).

Feature extraction: Extract properties and features for text documents.

Computational Intelligence: Basic methods for Computational Intelligence classification and knowledge extraction (as Neural Networks, Support Vector Machines, etc) are presented and experimented with. Using extracted features to implement topic classification of text documents.

Performance: Performance indicators like precision and recall.

Literature
Ricardo Baeza-Yates, Berthier Ribeiro-Neto: "Modern Information Retrieval", Addison-Wesley 1999, ISBN: 0-201-39829-X.
Articles and documents from the Web.
Course notes and labs