Course syllabus

Språkteknologi
Language Technology

EDAN20, 7,5 credits, A (Second Cycle)

Valid for: 2019/20
Decided by: PLED C/D
Date of Decision: 2019-04-01

General Information

Elective for: C4-pv, D4-pv, D4-mai, E4-bg, F4, F4-pv, F4-mai, Pi4-pv, Pi4-bam
Language of instruction: The course will be given in English

Aim

In the past 15 years, language technology has considerably matured driven by the massive increase of textual and spoken data and the need to process them automatically. Although there are few systems entirely dedicated to language processing, there are now scores of applications that are to some extent "language-enabled" and embed language processing techniques such as spelling and grammar checkers, information retrieval and extraction, or spoken dialogue systems. This makes the field form a new requirement for the CS engineers.

The course introduces theories used in language technology. It attempts to cover the whole field from character encoding and statistical language models to semantics and conversational agents, going through syntax and parsing. It focuses on proven techniques as well as significant industrial or laboratory applications.

Learning outcomes

Knowledge and understanding
For a passing grade the student must

Competences and skills
For a passing grade the student must

Judgement and approach
For a passing grade the student must

Contents

Examination details

Grading scale: TH - (U,3,4,5) - (Fail, Three, Four, Five)
Assessment: Compulsory course items: Assignments and possibly an examination. The coursework assignments are carried out in teams of two students, but can also be carried out individually. The first laboratory session will be dedicated to a hands-on approach to the programming tools used in the course. The assignments will then consist of six programming problems and individual reports. Passing all the assignments will consist in passing the course with a mark of 3. Optionally, the students will be able to set an examination and improve their mark to 4 or 5.

The examiner, in consultation with Disability Support Services, may deviate from the regular form of examination in order to provide a permanently disabled student with a form of examination equivalent to that of a student without a disability.

Parts
Code: 0113. Name: Statistical Techniques for Text Analysis.
Credits: 3,5. Grading scale: UG. Assessment: To qualify for a passing grade the laboratory work must be completed. Contents: Laboratory work.
Code: 0213. Name: Syntactic and Semantic Processing of Text.
Credits: 4. Grading scale: UG. Assessment: To qualify for a passing grade the laboratory work must be completed. Contents: Laboratory work.
Code: 0313. Name: Written Examination.
Credits: 0. Grading scale: TH. Assessment: Passing the course with a mark of 3 will consist in passing all the assignments. Optionally, the students will be able to take the written examination and improve their mark to 4 or 5. Contents: Optional written examination.

Admission

Admission requirements:

The number of participants is limited to: No
The course overlaps following course/s: EDA171

Reading list

Contact and other information

Course coordinator: Professor Pierre Nugues, Pierre.Nugues@cs.lth.se
Course homepage: http://cs.lth.se/edan20