- Department: Mathematics
- Credit value: 20 credits
- Credit level: I
- Academic year of delivery: 2024-25
- See module specification for other years: 2023-24
This module will equip students with the theoretical foundations of data science.
Used for IFoA exemption purposes.
Pre-requisite modules
Co-requisite modules
- None
Prohibited combinations
- None
Occurrence | Teaching period |
---|---|
A | Semester 1 2024-25 |
This module will give students a theoretical and mathematically formal framework for understanding the foundations of data science. Students will learn how to work with multiple random variables in a variety of settings: joint and conditional distributions will be developed, along with estimators and convergence theorems, and Markov chains will be introduced to deal with random variables indexed by discrete time. Further familiarity with the statistical software R will be developed throughout.
By the end of the module, students will be able to:
Perform computations involving the joint and conditional distributions and the related expectations.
Compute generating functions of standard distributions, apply them to obtain expectation and variance, and identify the distribution such as that of a sum of independent random variables with the said generating functions.
Apply limit theorems such as the Weak Law of Large Numbers and the Central Limit Theorem to deduce the asymptotic properties of a random variable sequence such as unbiasedness, consistency and asymptotic normality.
Estimate parameters of standard distributions following the maximum likelihood and the method of moments approach, and judge the quality of the resulting estimators.
Calculate absorption probabilities for discrete Markov chains.
Calculate, and interpret, stationary distributions for discrete Markov chains
Joint and conditional distributions (covering discrete and continuous distributions, in particular the Multivariate Normal)
Generating functions (moment and probability generating functions)
Modes of convergence and limit theorems (including WLLN and CLT)
Maximum likelihood and method of moments estimation
Further properties of estimators (for instance, precision measure (e.g., MSE), Cramer-Rao)
Markov chains, up to convergence to equilibrium and ergodic theorem
Brief introduction to MCMC (time permitting
Task | % of module mark |
---|---|
Closed/in-person Exam (Centrally scheduled) | 100 |
None
There will be five formative assignments with marked work returned in the seminars. At least one of them will contain a longer written part, done in LaTeX.
Task | % of module mark |
---|---|
Closed/in-person Exam (Centrally scheduled) | 100 |
Current Department policy on feedback is available in the student handbook. Coursework and examinations will be marked and returned in accordance with this policy.
M DeGroot and M Schervish (2012), Probability and Statistics (4th edition), Pearson
G Grimmett and D Stirzaker (2001), Probability and Random Processes, OUP