Multivariate Statistics in Ecology and Quantitative Genetics SS 2014
Lecture with Exercises (Vorlesung mit Übung)
Instructors
Prof. Dr. Dirk
Metzler and Dr. Noémie Becker
Time and rooms
June 23 2014 until July 11 2014
Lecture: Each day from 9:00 to
10:30, usually in room D00.013. Exceptions: on July 3 room B01.019, on July 7
in room B01.015
Exercises: Usually Tuesday to Friday from 1 pm to 5 pm (sometimes 6 pm).
The exam will take place on Friday, July 11, from 3:00 p.m. to 4:30 p.m.
in
lecture hall B01.019.
Participants can bring non-programmable pocket calculators
an their personal assortment of formulas on an A4 sheet. This A4 can have any contents on both
sides BUT ONLY IN YOUR OWN HANDWRITING! (no copy, not print-out, not another
person's handwriting)
Please note: If you fully attend the course (which is required to get
the ECTS points) you will need the time between the 11 a.m. and 5 p.m. to
solve exercises, which will be given in the morning lectures and discussed in the
afternoon courses.
Requirements
Basic knowledge of statistics. If you are not already familiar with the
free R software, it may be a good idea
to install it on your computer and get a bit familiar with it before the
course starts. Some
examples (data
file FinchesSulloway.txt), R
tutorial, Martin's introductory R course.
Contents
In the exercises you will practice to analyse data with the methods listed below using the R software.
We will discuss the interpretations of
the results and problems that may arise during the analysis. In the
lectures we will explain the theoretical concepts behind the methods and
show how these methods can be applied.
If you would like to discuss analyses of your own
datasets during the course, please contact D. Metzler or N. Becker as soon as possible.
- Fundamental methods: Multivariate linear regression, Model
Selection, Analysis of Variance (ANOVA), Experimental
designs (nested, balanced, unbalanced,...), Graphical Inspection
of Fitted Models, Generalized Linear Models (GLM), Mixed
Effects Models
R-package: lme4
- Special Methods for complex ecological datasets:
ANCOVA,
Redundancy Analysis, Principal Component Analysis (PCA), Canonical
Correspondence Analysis,....
R-package: vegan
(see also
the ecology
task view)
- Analysis of Gene Expression Data (Microarray
data).
Multiple-Testing Problems, Variance-Stabilizing
Normalization, Regularization, reconciliation with gene ontologies (GO)...
R-packages: vsn,
limma,
multtest,
GOstats
- Special
Methods in Genetics: Mapping Quantitative Traits Loci (QTLs).
Keywords:
Haley-Knott regression, Composite Interval Mapping, Multiple-QTL
models,...
R-package: qtl
(see also
the genetics task
view)
Language: English
Course material
-
Linear Models: slides, R commands for
Darwin finches, Darwin finches data, exercises
sheet 1, abcdx.txt, bp.txt
-
Balanced design ANOVAs, parameter transformations: slides,
R-script, exercises sheet
2, bacteria_trainig.txt, bacteria_predict.txt
-
Generalized Linear Models: slides,
exercise sheet 3,
TbDeerAndBoar.txt
-
Hotellings T2-test slides,
R-script,
raspberry.csv
exercise sheet 4,
grapes.txt
-
Principal Component Analysis (PCA): slides,
R-script,
exercise sheet 5,
HeightShoeWeight.txt
EWU.txt
-
Redundancy Analysis (RDA): slides,
R-script,
exercise sheet 6,
EWU.txt,
HSWoutlier.txt,
artificialFishes.txt,
RIKZGroups.txt
-
Correspondence Analysis (CA): slides,
R-script,
MexicanPlants.txt
-
Canonical Correspondence Analysis (CCA): slides,
R-script,
exercise sheet 7,
-
Mixed-effects
models slides, lme4a.R, mcmcglmm.R,
exercise sheet 8, fruits.txt
-
QTL Mapping slides, R-script, exercise sheet 9
-
Genexpression data and the problem of more parameters than
observations slides
, R-script
-
Some aspects of genome-wide association studies (GWAS)
slides
, exercise sheet 10
Material will be added during the course.
Here is the course material from 2013.
Literature
-
Introductory statistics with R / Peter Dalgaard (Springer 2002)
-
Biometry: The Principles and Practice of Statistics in
Biological Research (3rd Ed.) / Sokal, Rohlf, (Palgrave Macmillan 1995)
-
Mixed effects models and extensions in ecology with R /
Alain F. Zuur, Elena N. Ieno, N. J. Walker,
Anatoly A. Saveliev, Graham M. Smith.
(Springer 2009)
-
Modern applied statistics with S / W. N. Venables ; B. D. Ripley. - 4.
ed. (Springer 2002)
-
A Guide to QTL Mapping with R/qtl /
Karl W. Broman & Saunak Sen (Springer 2009)
-
Analysing ecological data / Alain F. Zuur ; Elena N. Ieno ; Graham M.
Smith. (Springer 2007)
-
Genetics and analysis of quantitative traits / Michael Lynch ; Bruce
Walsh. (Sinauer 1998)
-
Introduction to quantitative genetics / D. S. Falconer and Trudy F. C.
Mackay. - 4. ed. (Pearson 1996)
-
Numerical ecology / Legendre & Legendre (Elsevier 1998)
-
Statistical Genetics of Quantitative Traits / Wu, Ma & Casella. (Springer 2007)
-
Bioinformatics and Computational Biology Solutions Using R and Bioconductor /
R. Gentleman, V.J. Carey, W. Huber, R.A. Irizarry, S. Dudoit (Eds.). (Springer
2005)
-
Mixed effects models in S and S-PLUS / Jose C. Pinheiro ; Douglas M.
Bates. (Springer, 2004)
Last update: 10. July 2014