Improving machine learning predictions to estimate fishing effort using vessel's tracking data

Small-Scale Fisheries (SSF) comprise over 80 % of the global fleet and serve as the primary income source for numerous coastal communities. However, these critical fisheries face various threats. To effectively monitor SSF activities and their ecological impacts, it is required precise estimation of...

Full description

Saved in:
Bibliographic Details
Main Authors: J. Samarão, A. Moreno, M.B. Gaspar, M.M. Rufino
Format: Article
Language:English
Published: Elsevier 2025-03-01
Series:Ecological Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1574954124004953
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832595382631137280
author J. Samarão
A. Moreno
M.B. Gaspar
M.M. Rufino
author_facet J. Samarão
A. Moreno
M.B. Gaspar
M.M. Rufino
author_sort J. Samarão
collection DOAJ
description Small-Scale Fisheries (SSF) comprise over 80 % of the global fleet and serve as the primary income source for numerous coastal communities. However, these critical fisheries face various threats. To effectively monitor SSF activities and their ecological impacts, it is required precise estimation of fishing effort using high-resolution spatio-temporal data. This information can identify areas with high fishing density, warranting protection of their main fishing grounds against other users (i.e. ocean grabbing), while also signalling potential stock depletion requiring management interventions and preserving the ecosystems from which these fisheries depend on.In this study, we propose a series of steps to enhance the performance of Machine Learning algorithms in estimating fishing effort. We assessed seven supervised ML algorithms, including Logistic Regression, Ridge Classifier, Random Forest Classifier, K-Neighbours, Gradient Boosting Classifier, LinearSVC, Recurrent Neural Networks and XGBoost, using four case studies, from bivalve dredge and octopus pots and traps fisheries.First, in a preliminary statistical analysis between common error measures derived from the confusion matrix was decided to use accuracy, precision, and sensitivity as evaluation criteria. We found that a simple moving average applied to speed, employed as a pre-processing technique using ten neighbouring points, showed up to 3 % improvement in results. Random Forest and XGBoost gave the best performances among the models compared (18 % change), using the variables Latitude, Longitude, Speed, Time, and Month (accuracies near 99 %)(61 % change). The proportion of the training/test dataset, showed a minimal impact on accuracy, with changes of less than 8 % when varying the training data percentage between 10 % and 90 %, making 60 % a suitable compromise. Considering the sampling unit to be (1) point-based (randomly selected pings) or (2) boat trip-based (randomly selected boat trips), leaded to changes in accuracy between 2.53 % and 3.99 %, depending on the model. Temporal resolution (ping rate) showed minimal effects on model performance, ranging from less than 2 % for intervals between 30 s (raw data with irregular time series) to 10 min (regular time series). As a post-processing step, it was concluded that replacing isolated data points with neighbouring values, significantly enhanced the detection of fishing events, with improvements ranging from 80 % to 250 %, depending on the model.In conclusion, this study presents a straightforward procedure for selecting a machine learning method and enhancing its power of classification using simple procedures. These approaches should be applied in all works using machine learning to produce fishing effort maps.
format Article
id doaj-art-2ceda81fd0ad4e2e8a380af975543f87
institution Kabale University
issn 1574-9541
language English
publishDate 2025-03-01
publisher Elsevier
record_format Article
series Ecological Informatics
spelling doaj-art-2ceda81fd0ad4e2e8a380af975543f872025-01-19T06:24:38ZengElsevierEcological Informatics1574-95412025-03-0185102953Improving machine learning predictions to estimate fishing effort using vessel's tracking dataJ. Samarão0A. Moreno1M.B. Gaspar2M.M. Rufino3Portuguese Institute for the Sea and the Atmosphere (IPMA), Av. Dr. Alfredo Magalhães Ramalho, 6, 1495-65 Lisboa, Portugal; Nova School of Science and Technology (FCT), Almada, PortugalPortuguese Institute for the Sea and the Atmosphere (IPMA), Av. Dr. Alfredo Magalhães Ramalho, 6, 1495-65 Lisboa, PortugalPortuguese Institute for the Sea and the Atmosphere (IPMA), Av. Dr. Alfredo Magalhães Ramalho, 6, 1495-65 Lisboa, Portugal; CCMARPortuguese Institute for the Sea and the Atmosphere (IPMA), Av. Dr. Alfredo Magalhães Ramalho, 6, 1495-65 Lisboa, Portugal; Centre of Statistics and its Applications (CEAUL), Faculty of Sciences, University of Lisbon, Portugal; Corresponding author.Small-Scale Fisheries (SSF) comprise over 80 % of the global fleet and serve as the primary income source for numerous coastal communities. However, these critical fisheries face various threats. To effectively monitor SSF activities and their ecological impacts, it is required precise estimation of fishing effort using high-resolution spatio-temporal data. This information can identify areas with high fishing density, warranting protection of their main fishing grounds against other users (i.e. ocean grabbing), while also signalling potential stock depletion requiring management interventions and preserving the ecosystems from which these fisheries depend on.In this study, we propose a series of steps to enhance the performance of Machine Learning algorithms in estimating fishing effort. We assessed seven supervised ML algorithms, including Logistic Regression, Ridge Classifier, Random Forest Classifier, K-Neighbours, Gradient Boosting Classifier, LinearSVC, Recurrent Neural Networks and XGBoost, using four case studies, from bivalve dredge and octopus pots and traps fisheries.First, in a preliminary statistical analysis between common error measures derived from the confusion matrix was decided to use accuracy, precision, and sensitivity as evaluation criteria. We found that a simple moving average applied to speed, employed as a pre-processing technique using ten neighbouring points, showed up to 3 % improvement in results. Random Forest and XGBoost gave the best performances among the models compared (18 % change), using the variables Latitude, Longitude, Speed, Time, and Month (accuracies near 99 %)(61 % change). The proportion of the training/test dataset, showed a minimal impact on accuracy, with changes of less than 8 % when varying the training data percentage between 10 % and 90 %, making 60 % a suitable compromise. Considering the sampling unit to be (1) point-based (randomly selected pings) or (2) boat trip-based (randomly selected boat trips), leaded to changes in accuracy between 2.53 % and 3.99 %, depending on the model. Temporal resolution (ping rate) showed minimal effects on model performance, ranging from less than 2 % for intervals between 30 s (raw data with irregular time series) to 10 min (regular time series). As a post-processing step, it was concluded that replacing isolated data points with neighbouring values, significantly enhanced the detection of fishing events, with improvements ranging from 80 % to 250 %, depending on the model.In conclusion, this study presents a straightforward procedure for selecting a machine learning method and enhancing its power of classification using simple procedures. These approaches should be applied in all works using machine learning to produce fishing effort maps.http://www.sciencedirect.com/science/article/pii/S1574954124004953Fishing effortMachine leaningSpatio-temporal high-resolution dataSmall scale fisheries
spellingShingle J. Samarão
A. Moreno
M.B. Gaspar
M.M. Rufino
Improving machine learning predictions to estimate fishing effort using vessel's tracking data
Ecological Informatics
Fishing effort
Machine leaning
Spatio-temporal high-resolution data
Small scale fisheries
title Improving machine learning predictions to estimate fishing effort using vessel's tracking data
title_full Improving machine learning predictions to estimate fishing effort using vessel's tracking data
title_fullStr Improving machine learning predictions to estimate fishing effort using vessel's tracking data
title_full_unstemmed Improving machine learning predictions to estimate fishing effort using vessel's tracking data
title_short Improving machine learning predictions to estimate fishing effort using vessel's tracking data
title_sort improving machine learning predictions to estimate fishing effort using vessel s tracking data
topic Fishing effort
Machine leaning
Spatio-temporal high-resolution data
Small scale fisheries
url http://www.sciencedirect.com/science/article/pii/S1574954124004953
work_keys_str_mv AT jsamarao improvingmachinelearningpredictionstoestimatefishingeffortusingvesselstrackingdata
AT amoreno improvingmachinelearningpredictionstoestimatefishingeffortusingvesselstrackingdata
AT mbgaspar improvingmachinelearningpredictionstoestimatefishingeffortusingvesselstrackingdata
AT mmrufino improvingmachinelearningpredictionstoestimatefishingeffortusingvesselstrackingdata