Skip to main content

Research Repository

Advanced Search

The effect of data aggregation on dispersion estimates in count data models

Errington, Adam; Einbeck, Jochen; Cumming, Jonathan; Rössler, Ute; Endesfelder, David

The effect of data aggregation on dispersion estimates in count data models Thumbnail


Authors

Adam Errington adam.errington@durham.ac.uk
PGR Student Doctor of Philosophy

Ute Rössler

David Endesfelder



Abstract

For the modelling of count data, aggregation of the raw data over certain subgroups or predictor configurations is common practice. This is, for instance, the case for count data biomarkers of radiation exposure. Under the Poisson law, count data can be aggregated without loss of information on the Poisson parameter, which remains true if the Poisson assumption is relaxed towards quasi-Poisson. However, in biodosimetry in particular, but also beyond, the question of how the dispersion estimates for quasi-Poisson models behave under data aggregation have received little attention. Indeed, for real data sets featuring unexplained heterogeneities, dispersion estimates can increase strongly after aggregation, an effect which we will demonstrate and quantify explicitly for some scenarios. The increase in dispersion estimates implies an inflation of the parameter standard errors, which, however, by comparison with random effect models, can be shown to serve a corrective purpose. The phenomena are illustrated by y-H2AX foci data as used for instance in radiation biodosimetry for the calibration of dose-response curves.

Citation

Errington, A., Einbeck, J., Cumming, J., Rössler, U., & Endesfelder, D. (2022). The effect of data aggregation on dispersion estimates in count data models. International Journal of Biostatistics, 18(1), 183-202. https://doi.org/10.1515/ijb-2020-0079

Journal Article Type Article
Acceptance Date Apr 21, 2021
Online Publication Date May 7, 2021
Publication Date 2022-05
Deposit Date May 19, 2021
Publicly Available Date Aug 20, 2021
Journal International Journal of Biostatistics
Print ISSN 1557-4679
Electronic ISSN 1557-4679
Publisher De Gruyter
Peer Reviewed Peer Reviewed
Volume 18
Issue 1
Pages 183-202
DOI https://doi.org/10.1515/ijb-2020-0079
Public URL https://durham-repository.worktribe.com/output/1247843

Files






You might also like



Downloadable Citations