Skip to main content

Research Repository

Advanced Search

PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records

Farrell, Sean; Appleton, Charlotte; Noble, Peter-John Mäntylä; Al Moubayed, Noura

PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records Thumbnail


Authors

Sean Farrell sean.farrell2@durham.ac.uk
PGR Student Doctor of Philosophy

Charlotte Appleton

Peter-John Mäntylä Noble



Abstract

Effective public health surveillance requires consistent monitoring of disease signals such that researchers and decision-makers can react dynamically to changes in disease occurrence. However, whilst surveillance initiatives exist in production animal veterinary medicine, comparable frameworks for companion animals are lacking. First-opinion veterinary electronic health records (EHRs) have the potential to reveal disease signals and often represent the initial reporting of clinical syndromes in animals presenting for medical attention, highlighting their possible significance in early disease detection. Yet despite their availability, there are limitations surrounding their free text-based nature, inhibiting the ability for national-level mortality and morbidity statistics to occur. This paper presents PetBERT, a large language model trained on over 500 million words from 5.1 million EHRs across the UK. PetBERT-ICD is the additional training of PetBERT as a multi-label classifier for the automated coding of veterinary clinical EHRs with the International Classification of Disease 11 framework, achieving F1 scores exceeding 83% across 20 disease codings with minimal annotations. PetBERT-ICD effectively identifies disease outbreaks, outperforming current clinician-assigned point-of-care labelling strategies up to 3 weeks earlier. The potential for PetBERT-ICD to enhance disease surveillance in veterinary medicine represents a promising avenue for advancing animal health and improving public health outcomes.

Citation

Farrell, S., Appleton, C., Noble, P. M., & Al Moubayed, N. (2023). PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records. Scientific Reports, 13(1), Article 18015. https://doi.org/10.1038/s41598-023-45155-7

Journal Article Type Article
Acceptance Date Oct 17, 2023
Online Publication Date Oct 21, 2023
Publication Date 2023
Deposit Date Nov 2, 2023
Publicly Available Date Nov 2, 2023
Journal Scientific Reports
Publisher Nature Research
Peer Reviewed Peer Reviewed
Volume 13
Issue 1
Article Number 18015
DOI https://doi.org/10.1038/s41598-023-45155-7
Public URL https://durham-repository.worktribe.com/output/1863090

Files

Published Journal Article (2.2 Mb)
PDF

Licence
http://creativecommons.org/licenses/by/4.0/

Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/

Copyright Statement
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.





You might also like



Downloadable Citations