Course syllabus

# Linjär och logistisk regression med datainsamling

Linear and Logistic Regression with Data Gathering

## FMSN40, 9 credits, A (Second Cycle)

## General Information

## Aim

## Learning outcomes

## Contents

## Examination details

## Admission

Admission requirements:## Reading list

## Contact and other information

Linear and Logistic Regression with Data Gathering

Valid for: 2020/21

Decided by: PLED I

Date of Decision: 2020-04-03

Main field: Technology.

Elective Compulsory for: I3

Language of instruction: The course will be given in English

Regression analysis deals with modelling how one characteristic (height, weight, price, concentration, etc) varies with one or several other characteristics (sex, living area, expenditures, temperature, etc). Linear regression is introduced in the basic course in mathematical statistics but here we expand with, e.g., "how do I check that the model fits the data", "what should I do i it doesn't fit", "how uncertain is it", and "how do I use it to draw conclusions about reality".

When perfoming a survey where people can awnser yes/no or little/just fine/much, or car/bicycle/bus or some other categorical alternative, you cannot use linear regression. Then you need logistic regression instead. This is the topic in the second half of the course.

As part of the course you should construct a questionaire or experimental plan for a problem of your choice, collect the data and analyse it using an suitable regression model.

Knowledge and understanding

For a passing grade the student must

- Describe the differences between continuous and discrete data, and the resulting consequences for the choice of statistical model
- Give an account of the principles behind different estimation principles,
- Describe the statistical properties of such estimates as appear in regression analysis,
- Interpret regression relations in terms of conditional distributions,
- Explain the concepts odds and odds ratio, and describe their relation to probabilities and to logistic regression.

Competences and skills

For a passing grade the student must

- Formulate a multiple linear regression model for a concrete problem,
- Formulate a multiple logistic regression model for a concrete problem,
- Estimate the parameters in the regression model and interpret them,
- Examine the validity of the model and make suitable modifications of the model,
- Use the model resulting for prediction,
- Use some statistical computer program for analysis of regression data, and interpret the results,
- Present the analysis and conclusions of a practical problem in a written report and an oral presentation.
- Construct a form for data collection that can be used to answer a particular practical problem.

Judgement and approach

For a passing grade the student must

- Always check the prerequisites before stating a regression model,
- Evaluate the plausibility of a performed study,
- Reflect over the limitations of the chosen model and estimation method, as well as alternative solutions.

Least squares and maximum-likelihood-method; odds ratios; Multiple and linear regression; Matrix formulation; Methods for model validation, residuals, outliers, influential observations, multi co-linearity, change of variables; Choice of regressors, F-test, likelihood-ratio-test; Confidence intervals and prediction. Introduction to: Correlated errors, Poisson regression as well as multinomial and ordinal logistic regression. Questionaire construction and design of experiments.

Grading scale: TH - (U,3,4,5) - (Fail, Three, Four, Five)

Assessment: Written and oral project presentation, peer assessment and oral examination.

The examiner, in consultation with Disability Support Services, may deviate from the regular form of examination in order to provide a permanently disabled student with a form of examination equivalent to that of a student without a disability.

Parts

Code: 0117. Name: Examination.

Credits: 3. Grading scale: TH. Assessment: Oral examination

Code: 0217. Name: Project 1.

Credits: 1,5. Grading scale: UG. Assessment: Written project report and peer assessment Contents: Linear regression

Code: 0317. Name: Project 2.

Credits: 1,5. Grading scale: UG. Assessment: Written project report and peer assessment Contents: Logistic regression

Code: 0417. Name: Project 3.

Credits: 2,5. Grading scale: UG. Assessment: Written project plan, data gathering and oral project presentation.
Contents: The student's own regression problem

Code: 0517. Name: Laboratory Work.

Credits: 0,5. Grading scale: UG. Assessment: Computer exercises

- FMSF20 Mathematical Statistics, Basic Course or FMSF25 Mathematical Statistics - Complementary Project or FMSF45 Mathematical Statistics, Basic Course or FMSF50 Mathematical Statistics, Basic Course or FMSF55 Mathematical Statistics, Basic Course or FMSF70 Mathematical Statistics or FMSF75 Mathematical Statistics, Basic Course

The number of participants is limited to: No

The course overlaps following course/s: FMSN30, MASM22

- Rawlings, J.O., Pantula, S.G., Dickey, D.A.: Applied Regression Analysis - A Research Tool, 2ed. Springer, 1998, ISBN: 0-387-98454-2. Available as e-book.
- Alan Agresti: An introduction to categorical data analysis, 2nd ed. Wiley, 2007, ISBN: 978-0-471-22618-5. Available as e-book.

Director of studies: Johan Lindström, studierektor@matstat.lu.se

Course homepage: http://www.maths.lth.se/matstat/kurser/fmsn40/

Further information: Only one of the courses FMSN30 and FMSN40 may be included in a degree.