Improved cattle farm classification: leveraging machine learning and linked national datasets

While many countries have registries of livestock farms, it can be challenging to obtain information on their primary production type. For example, for Swiss farms registered as keeping cattle, a distinction can only be made between milk-producing and non-milk-producing farms. The Swiss cattle indus...

Full description

Saved in:
Bibliographic Details
Main Authors: Guy-Alain Schnidrig, Rahel Struchen, Sara Schärrer, Dagmar Heim, Daniela Hadorn, Gertraud Schüpbach-Regula, Giulia Paternoster
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-02-01
Series:Frontiers in Veterinary Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fvets.2025.1517173/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832098363133132800
author Guy-Alain Schnidrig
Guy-Alain Schnidrig
Rahel Struchen
Sara Schärrer
Dagmar Heim
Daniela Hadorn
Gertraud Schüpbach-Regula
Giulia Paternoster
author_facet Guy-Alain Schnidrig
Guy-Alain Schnidrig
Rahel Struchen
Sara Schärrer
Dagmar Heim
Daniela Hadorn
Gertraud Schüpbach-Regula
Giulia Paternoster
author_sort Guy-Alain Schnidrig
collection DOAJ
description While many countries have registries of livestock farms, it can be challenging to obtain information on their primary production type. For example, for Swiss farms registered as keeping cattle, a distinction can only be made between milk-producing and non-milk-producing farms. The Swiss cattle industry consists of beef and dairy farms, with a strong predominance of small to medium-sized farms. A better differentiation of cattle production types would be beneficial for the planning and evaluation of surveillance programmes for cattle diseases and for the benchmarking antibiotic consumption. The aim of this study was to outline cattle production types of interest and to allow the classification of Swiss cattle farms according to production type in order to optimize surveillance. We collaborated with experts to define the five primary cattle production types: calf fattening, dairy cattle, cattle fattening, rearing cattle and suckler cows. In collaboration with the cantonal Veterinary Offices, we collected production types from 618 reference farms across 14 cantons and defined a total of 24 features by combining information from three national databases. Using farm-level data on milk production, age and sex distribution, cattle breeds, calving, births, slaughter, animal movements and antibiotic use, we trained three different machine learning models capable of classifying the five production types. Among these models, the Random Forest model demonstrated the highest level of performance, achieving an accuracy of 0.914 (95% CI: 0.890, 0.938) and an F1-Score of 0.879 (95% CI: 0.841, 0.913). In conclusion, together with experts, we have outlined five primary production types on cattle farms in Switzerland and developed a model that allows a reproducible, year-to-year classification of cattle farms using national datasets. Our flexible methodology could be adapted to other countries and datasets, enabling veterinary authorities to conduct more efficient and targeted disease surveillance in the future.
format Article
id doaj-art-debfc9ba021d4488894c65a04115155d
institution Kabale University
issn 2297-1769
language English
publishDate 2025-02-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Veterinary Science
spelling doaj-art-debfc9ba021d4488894c65a04115155d2025-02-05T12:12:08ZengFrontiers Media S.A.Frontiers in Veterinary Science2297-17692025-02-011210.3389/fvets.2025.15171731517173Improved cattle farm classification: leveraging machine learning and linked national datasetsGuy-Alain Schnidrig0Guy-Alain Schnidrig1Rahel Struchen2Sara Schärrer3Dagmar Heim4Daniela Hadorn5Gertraud Schüpbach-Regula6Giulia Paternoster7Veterinary Public Health Institute, Vetsuisse, University of Bern, Bern, SwitzerlandGraduate School of Cellular and Biomedical Sciences, University of Bern, Bern, SwitzerlandDepartment of Animal Health and Animal Welfare, Federal Food Safety and Veterinary Office (FSVO), Bern, SwitzerlandDepartment of Animal Health and Animal Welfare, Federal Food Safety and Veterinary Office (FSVO), Bern, SwitzerlandDepartment of Animal Health and Animal Welfare, Federal Food Safety and Veterinary Office (FSVO), Bern, SwitzerlandDepartment of Animal Health and Animal Welfare, Federal Food Safety and Veterinary Office (FSVO), Bern, SwitzerlandVeterinary Public Health Institute, Vetsuisse, University of Bern, Bern, SwitzerlandDepartment of Animal Health and Animal Welfare, Federal Food Safety and Veterinary Office (FSVO), Bern, SwitzerlandWhile many countries have registries of livestock farms, it can be challenging to obtain information on their primary production type. For example, for Swiss farms registered as keeping cattle, a distinction can only be made between milk-producing and non-milk-producing farms. The Swiss cattle industry consists of beef and dairy farms, with a strong predominance of small to medium-sized farms. A better differentiation of cattle production types would be beneficial for the planning and evaluation of surveillance programmes for cattle diseases and for the benchmarking antibiotic consumption. The aim of this study was to outline cattle production types of interest and to allow the classification of Swiss cattle farms according to production type in order to optimize surveillance. We collaborated with experts to define the five primary cattle production types: calf fattening, dairy cattle, cattle fattening, rearing cattle and suckler cows. In collaboration with the cantonal Veterinary Offices, we collected production types from 618 reference farms across 14 cantons and defined a total of 24 features by combining information from three national databases. Using farm-level data on milk production, age and sex distribution, cattle breeds, calving, births, slaughter, animal movements and antibiotic use, we trained three different machine learning models capable of classifying the five production types. Among these models, the Random Forest model demonstrated the highest level of performance, achieving an accuracy of 0.914 (95% CI: 0.890, 0.938) and an F1-Score of 0.879 (95% CI: 0.841, 0.913). In conclusion, together with experts, we have outlined five primary production types on cattle farms in Switzerland and developed a model that allows a reproducible, year-to-year classification of cattle farms using national datasets. Our flexible methodology could be adapted to other countries and datasets, enabling veterinary authorities to conduct more efficient and targeted disease surveillance in the future.https://www.frontiersin.org/articles/10.3389/fvets.2025.1517173/fullmachine learningsurveillancefarm topologymovement databaseantibiotic use datacattle
spellingShingle Guy-Alain Schnidrig
Guy-Alain Schnidrig
Rahel Struchen
Sara Schärrer
Dagmar Heim
Daniela Hadorn
Gertraud Schüpbach-Regula
Giulia Paternoster
Improved cattle farm classification: leveraging machine learning and linked national datasets
Frontiers in Veterinary Science
machine learning
surveillance
farm topology
movement database
antibiotic use data
cattle
title Improved cattle farm classification: leveraging machine learning and linked national datasets
title_full Improved cattle farm classification: leveraging machine learning and linked national datasets
title_fullStr Improved cattle farm classification: leveraging machine learning and linked national datasets
title_full_unstemmed Improved cattle farm classification: leveraging machine learning and linked national datasets
title_short Improved cattle farm classification: leveraging machine learning and linked national datasets
title_sort improved cattle farm classification leveraging machine learning and linked national datasets
topic machine learning
surveillance
farm topology
movement database
antibiotic use data
cattle
url https://www.frontiersin.org/articles/10.3389/fvets.2025.1517173/full
work_keys_str_mv AT guyalainschnidrig improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets
AT guyalainschnidrig improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets
AT rahelstruchen improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets
AT sarascharrer improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets
AT dagmarheim improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets
AT danielahadorn improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets
AT gertraudschupbachregula improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets
AT giuliapaternoster improvedcattlefarmclassificationleveragingmachinelearningandlinkednationaldatasets