Bayesian Statistics, Simulation and Software
- with a View to Application Examples
Joint PhD Course, Spring 2011
Aalborg University
Information
-
Organisers and lecturers:
Kasper K. Berthelsen, Associate Professor
(e-mail: kkb (snabel-a) math.aau.dk) and
Søren L. Buhl, Associate Professor
(e-mail: slb (snabel-a) math.aau.dk),
Department of Mathematical Sciences, Aalborg University.
-
Place: Fredrik Bajers Vej 7G, Room G5-108 Aalborg University
-
You are expected to bring a laptop.
If possible, before the course starts install the free statistics
software
R.
You can wait until the second course day before installing
OpenBUGS.
You can find a guide here on how to
install these packages.
-
There is wireless access in the lecture room
(at least for people with a AAU e-mail account or using a VPN
client). If everything fails there is also wired internet access.
-
Course material:
-
Basics of probability theory.
-
Exercises to Basics of probability theory.
-
Basic methods for simulation of random variables:
1. Inversion.
-
A brief introduction to (simulation based) Bayesian inference.
-
Basic methods for simulation of random variables:
2. Accept/rejection algorithm.
-
Simulation from specific distributions -
especially the normal distribution.
-
Monte Carlo methods.
-
Model checking based on p-values.
-
Some elementary Markov chain theory.
-
A short diversion into the theory of Markov chains, with a view to
Markov chain Monte Carlo methods.
-
Importance sampling for unnormalized densities
-
Use of WinBUGS, Coda and DoodleBUGS
-
The beta-binomial distribution
-
Bother with the Notation
- Some useful textbooks:
- Peter M. Lee (2004).
Bayesian Statistics: an introduction, 3rd ed.
Arnold.
- Andrew Gelman, et al. (2003).
Bayesian Data Analysis, 2nd ed.
Chapman & Hall/CRC.
- Olle Häggström (2002).
Finite Markov Chains and Algorithmic Applications.
Cambridge University Press.
- Christian P. Robert and George Casella (2004).
Monte Carlo Statistical Methods, 2nd ed.
Springer.
- Maria Rizzo (2007).
Statistical Computing with R.
Chapman & Hall/CRC.
- Ioannis Ntzoufras (2009).
Bayesian Modeling Using WinBUGS.
Wiley.
Programme
This is an intensive PhD course on which you should expect to
spend most of your working effort during the period
Moreover, you should prepare yourself before joining each
day's programme.
A tentative programme is given below; it may very well be
adjusted as the course goes on!
1) Friday 27th May 9:00-16:00
- Morning
- It is important that you have R
installed on your computer!
- Introduction to R. Using this
demo and this
dataset.
- Exercises in R. Use this
document and this dataset.
- Solutions to the exercises.
- Afternoon
- In the afternoon we will brush-up some basics about probability
(events, conditional probablity, stochatic variables etc), discrete
and continuous distribution. For this you could read "Basics of
probability theory" or corresponding chapters in most books on basic
statistics.
- After the lecture you
should have a go at the exercises in "Exercises to Basics of
probability theory."
2) Monday 30th May 9:00-16:00
3) Wednesday 1st June 9:00-16:00
4) Friday 3rd June 9:00-16:00
- Morning
- Basic Markov chain theory, read at least Section 2 in "Some elementary
Markov chain theory" which deals with Markov chains with a discrete
state space.
- Markov chains on (more) general state spaces, read Sections 1-4
in "A Short diversion...".
- Exercises from this document.
- Solutions to the exercises.
- Afternoon
- During the lecture we shall be using the
demo and the
exercises for today.
- A simple linear regression using DoodleBUGS (see my demo).
- Programming in R illustrated by the Fibonacci numbers
(see my demo). You could check Sec. 10.1 in the manual
"R Language Definition" which can be found using help.start() in
R.
- Exercise 1 for today.
- A bit of Gibbs samling and Exercise 2 for today.
- Section 2.1 in "A short diversion..." and Exercise 3 for today.
- If time permits, my demo illustrating how to make a presentation
plot.
- Solutions to the exercises.
5) Friday 10th June 9:00-16:00
- Morning
- We continue with Markov chain theory, read Sections 9-10 in "A
Short diversion...". In Section 9 you can skip the example if you
like.
- Exercises from
this document. These are
mainly R exercises. Start with exercise 0. The other
exercises can be done according to what you find most
interesting.
- Here are the slides from the
lecture.
- Solution to the mining exercise
when a and b are fixed.
- Solution to the mining exercise
when a and b are not fixed.
- Solution to the beetles exercise.
- Solution to the banana exercise.
- Afternoon
- Second half of Exercise 3 from Lecture 4.
- A bit about graphical models.
- Exercises with Pump failure data.
Use this document and this
dataset.
- Filerne pump.R og
pump-fig.R.
- The Metropolis-Hastings algorithm.
- Laird Breyer
has made a Java-based M-H simulator.
- Solutions to the exercises.
6) Tuesday 14th June 9:00-16:00
- Morning
- First we will look at prior and posterior predictive
distributions as a tool for model
checking. Read Model checking
based on p-values.
- Next we will look at Metropolis within Gibbs, Section 9 in
"A short diversion...".
- After this you are meant to work on a small project. The
project consists in analysing the rat
tumour data, as described in Exercise 10 and pages 20 to 26
in "A short diversion...".
- Afternoon
- Functions to the rat tumour analysis.
- Same without logarithms in the density.
- Finish the project.
- At 15:45 we meet to evaluate the course.