Course manual 2019/2020

Course content

In the analysis of biochemical systems, many measurements are performed, leading to complex multivariate data sets. The tendency is to measure more and more of just a few samples. Multivariate data analysis methods are often used to explore such sets.

This course covers a broad range of multivariate data analysis methods, for e.g. exploration, clustering, classification. The latter is especially important in biomarker discovery. Design of experiments and ANOVA for multivariate data is also discussed. Furthermore, the interpretation of selected features in terms of function and networks is discussed.The course starts with an introduction on the properties of the different types of functional genomics data.

The main goal of this course is to teach students how to interpret the results of the multivariate methods and how this relates to the biological problem that is studied. 


Study materials

Literature

  • Material is available at the Canvas site.

Syllabus

  • Material is available on the Canvas site.

Practical training material

  • Training material is available on the Canvas site.

Software

  • R

Objectives

  • Students know the origin of the data and the specific properties of the data.
  • Students understand how the data analysis methods work theoretically.
  • Students are able to apply the methods, and they are able to interpret the results.
  • Students comprehend the pitfalls of multivariate data and validation strategies to prevent overfit.
  • Students are able to critically review data analysis applications in which the above mentioned methods have been used.
  • Students are able to select the most appropriate method for a given biological question.

Teaching methods

  • Lecture
  • Computer lab session/practical training
  • Self-study

At the start of the course students have sufficient knowledge ofR, linear algebra and basic statistics. Students are expected to work through the provided R tutorial (which also contains basic linear algebra) when this is not the case.

Each topic of 8 topics starts with self-study of provided material (4h). A two hour 2 hour lecture introduces the specific topic which is followed by 4 hours of computer exercises. 

Worked out exercises are provided after the computer labs are finished. Students can ask for feedback at the start of the next computer lab.

 

Learning activities

Activity

Number of hours

Digitale Toets

3

Zelfstudie

165

Assessment

Item and weight Details

Final grade

1 (100%)

Digitale Toets

Both exams are "closed book". It is not allowed to take any material to the exam.

Fraud and plagiarism

The 'Regulations governing fraud and plagiarism for UvA students' applies to this course. This will be monitored carefully. Upon suspicion of fraud or plagiarism the Examinations Board of the programme will be informed. For the 'Regulations governing fraud and plagiarism for UvA students' see: www.student.uva.nl

Course structure

Weeknummer Onderwerpen Studiestof
1
2
3
4

Timetable

The schedule for this course is published on DataNose.

Additional information


Matlab, Linear Algebra, introduction level Statistics.

The maximum number of participants is 60. Admission to the course will depend on capacity, total number of applications, date of registration and background of the individual student. If the number of applications exceeds the capacity of the course, students may have to be selected and priority will be given in the following order:

  • First-year students of the master Life Sciences (UvA and VU)
  • Second-year students of the master Life Sciences (UvA and VU)
  • Students of the master Computational Sciences
  • Students of the master Chemistry
  • Students of the master Forensic Sciences
  • Students of other master programmes

Contact information

Coordinator

  • dr. J.A. Westerhuis