Syllabus academic year 2011/2012
(Created 2011-09-01.)
Credits: 7,5. Grading scale: TH. Cycle: A (Second Cycle). Main field: Technology. Language of instruction: The course might be given in English. EITN01 overlaps following cours/es: EIT031. Optional for: C4, D4, D4ks. Course coordinator: Associate Professor Anders Ardö,, Electrical and Information Technology. Prerequisites: EDA011 or EDA016 Programming, First Course. Recommended prerequisits: FMA420 Linear Algebra. Assessment: Written examination, passed laboratories and assignments. Home page:

The goal of this course is to increase the understanding of methods for information retrieval, structuring and text mining, specially from Internet based sources.

Knowledge and understanding
For a passing grade the student must

Skills and abilities
For a passing grade the student must

Judgement and approach
For a passing grade the student must

Information Retrieval: basic methods for ranking and searching, vector models, tf-idf relevance ranking. Information Retrieval systems.

Query Language: Different query languages for search in structured databases are presented.

Stuctured information: Indexing, searching and relevance ranking of search results. Exemplified with the aid of searches in structured databases (SRU/CQL).

Feature extraction: Extract properties and features for text documents.

Basic methods for classification and knowledge extraction (as Neural Networks, Support Vector Machines, etc) are presented and experimented with. Using extracted features to implement topic classification of text documents.

Performance: Performance indicators like precision and recall.

Baeza-Yates, R, Ribeiro-Neto, B: Modern Information Retrieval.
Addison-Wesley 1999. ISBN: 0-201-39829-X
Articles and documents from the Web.
Course notes and labs