Hi, you are logged in as , if you are not , please click here
You are shopping as , if this is not your email, please click here

CMI Short Course: Record Linkage and Statistical Matching

Info

Course Information

CMI Short Course: Record Linkage and Statistical Matching

The Cathie Marsh Institute are proud to introduce their 2026 short courses. 

All participants will receive state of the art teaching and a lunch voucher as part of the course.

 

Record Linkage and Statistical Matching 

Taught by: Natalie Shlomo

Location- Ellen Wilkinson Building A3.7

Abstract:  

The aim is to provide theory and practical applications of deterministic and probabilistic approaches to data (record) linkage. The course provides a review of best practices and covers theory with respect to the pre-processing requirements, the methods used to calculate matching weights, types of errors in the classification, evaluation procedures and introduction to the E-M algorithm and analysis of linked data. By the end of the course, students should have an understanding of data linkage techniques and be able to implement and evaluate data linkage procedures.   

 Pre-requisite:  

No pre-requisite is required but students should have an understanding of basic statistical concepts. 

Course Code

CMI Record Linkage and Statistical Matching 2026
Course Description

References: 

 Belin, T.R. and Rubin, D. B. (1995) A Method for Calibrating False-Match Rates in Record Linkage. Journal of the American Statistical Association, 90, 694-707.  

  

Clark D. (2004)  Practical introduction to record linkage for injury research. Injury Prevention, 10(3):186. 

  

Fellegi, I. P. and Sunter, A. B. (1969) A Theory for Record Linkage, Journal of the American Statistical Association, 64, 1183-1210.  

  

Gill, L. (2001) Methods for Automatic Record Matching and Linkage and their use in National Statistics,  The National Statistics Methodology Series, ONS (available at http://www.ons.gov.uk/ons/guide-method/method-quality/specific/gss-methodology-series/index.html   number 25)  

  

Herzog, T. N., Scheuren, F. J. and Winkler, W. E. (2007) Data Quality and Record Linkage Techniques. New York: Springer. ISBN 978-0-387-69502-0 

  

Lahiri, P. and Larsen, M.D. (2005) Regression Analysis with Linked Data. Journal of the American Statistical Association, Vol. 100, No. 469, 222-230 (Also at:   

http://www.stat.iastate.edu/preprint/articles/2004-09.pdf)  

  

Mason, C.A. amd Shihfen, T. (2008) Data Linkage Using Probabilistic Decision Rules: A Primer, Birth Defects Research (Part A): Clinical and Molecular Teratology 82, 812-821 

 

Scheuren, F., and Winkler, W.E. (1993) Regression analysis of data files that are computer matched. Survey Methodology, 19, 39-58. 

  

Scheuren, F., and Winkler, W.E. (1997) Regression analysis of data files that are computer matched II. Survey Methodology, 23, 157-165. 

  

Shlomo, N. (2019). Overview of Data Linkage Methods for Policy Design and Evaluation in N. Crato, P. Paruolo (eds.), Data-Driven Policy Impact Evaluation. Springer. https://link.springer.com/chapter/10.1007/978-3-319-78461-8_4 

  

Winglee, M., Valliant, R. and Scheuren, F. (2005) A Case Study in Record Linkage. Survey Methodology, Vol. 31, Number 1, 3-12.  

  

Winkler, W. E. (1995) Matching and Record Linkage, in B.G. Cox et al. (ed) Business Survey Methods, New York: J. Wiley, 355-384   

http://www.fcsm.gov/working-papers/wwinkler.pdf   

StartEndCourse Fee 
Cathie Marsh Institute Member
17/06/202617/06/2026£10.00[Read More]