Sean Farrell sean.farrell2@durham.ac.uk
PGR Student Doctor of Philosophy
PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records
Farrell, Sean; Appleton, Charlotte; Noble, Peter-John Mäntylä; Al Moubayed, Noura
Authors
Charlotte Appleton
Peter-John Mäntylä Noble
Dr Noura Al Moubayed noura.al-moubayed@durham.ac.uk
Associate Professor
Abstract
Effective public health surveillance requires consistent monitoring of disease signals such that researchers and decision-makers can react dynamically to changes in disease occurrence. However, whilst surveillance initiatives exist in production animal veterinary medicine, comparable frameworks for companion animals are lacking. First-opinion veterinary electronic health records (EHRs) have the potential to reveal disease signals and often represent the initial reporting of clinical syndromes in animals presenting for medical attention, highlighting their possible significance in early disease detection. Yet despite their availability, there are limitations surrounding their free text-based nature, inhibiting the ability for national-level mortality and morbidity statistics to occur. This paper presents PetBERT, a large language model trained on over 500 million words from 5.1 million EHRs across the UK. PetBERT-ICD is the additional training of PetBERT as a multi-label classifier for the automated coding of veterinary clinical EHRs with the International Classification of Disease 11 framework, achieving F1 scores exceeding 83% across 20 disease codings with minimal annotations. PetBERT-ICD effectively identifies disease outbreaks, outperforming current clinician-assigned point-of-care labelling strategies up to 3 weeks earlier. The potential for PetBERT-ICD to enhance disease surveillance in veterinary medicine represents a promising avenue for advancing animal health and improving public health outcomes.
Citation
Farrell, S., Appleton, C., Noble, P. M., & Al Moubayed, N. (2023). PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records. Scientific Reports, 13(1), Article 18015. https://doi.org/10.1038/s41598-023-45155-7
Journal Article Type | Article |
---|---|
Acceptance Date | Oct 17, 2023 |
Online Publication Date | Oct 21, 2023 |
Publication Date | 2023 |
Deposit Date | Nov 2, 2023 |
Publicly Available Date | Nov 2, 2023 |
Journal | Scientific Reports |
Publisher | Nature Research |
Peer Reviewed | Peer Reviewed |
Volume | 13 |
Issue | 1 |
Article Number | 18015 |
DOI | https://doi.org/10.1038/s41598-023-45155-7 |
Public URL | https://durham-repository.worktribe.com/output/1863090 |
Files
Published Journal Article
(2.2 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
You might also like
Explainable text-tabular models for predicting mortality risk in companion animals
(2024)
Journal Article
Evaluating ChatGPT text mining of clinical records for companion animal obesity monitoring
(2023)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search