Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture Model

Location data collected from mobile devices via global positioning system often lack semantic information and can form sparse trajectories in space and time. This study investigates whether user age groups can be accurately classified solely from such sparse spatial–temporal trajectories. We propose...

Full description

Saved in:
Bibliographic Details
Main Authors: Yohei Kakimoto, Yuto Omae, Hirotaka Takahashi
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/2/982
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589176606818304
author Yohei Kakimoto
Yuto Omae
Hirotaka Takahashi
author_facet Yohei Kakimoto
Yuto Omae
Hirotaka Takahashi
author_sort Yohei Kakimoto
collection DOAJ
description Location data collected from mobile devices via global positioning system often lack semantic information and can form sparse trajectories in space and time. This study investigates whether user age groups can be accurately classified solely from such sparse spatial–temporal trajectories. We propose a feature extraction method based on a Gaussian mixture model (GMM), which assigns representative points (RPs) by clustering the location data and aggregating user trajectories into these RPs. We then construct three machine learning (ML) models—support vector classifier (SVC), random forest (RF), and deep neural network (DNN)—using the GMM-based features and compare their performance with that of the improved DNN (IDNN), which is an existing feature extraction approach. In our experiments, we introduced a missing value ratio <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>θ</mi><mi>th</mi></msub></semantics></math></inline-formula> to quantify trajectory sparsity and analyzed the effect of trajectory sparsity on the classification accuracy and generalizability performance of the ML models. The results indicate that GMM-based features outperform IDNN-based features in both classification accuracy and generalization performance. Notably, the RF model achieved the highest accuracy, whereas the SVC model displayed stable generalizability. As the missing value ratio <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>θ</mi><mi>th</mi></msub></semantics></math></inline-formula> increases, the IDNN becomes more susceptible to overfitting, whereas the GMM-based approach preserves accuracy and robustness. These findings suggest that sparse trajectories can still offer meaningful classification performance with appropriate feature design and model selection even without semantic information. This approach holds promise for domains where large-scale, sparse trajectory data are common, including urban planning, marketing analysis, and public policy.
format Article
id doaj-art-39455e76dbe848da9e4dfe5b5d7fb084
institution Kabale University
issn 2076-3417
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-39455e76dbe848da9e4dfe5b5d7fb0842025-01-24T13:21:34ZengMDPI AGApplied Sciences2076-34172025-01-0115298210.3390/app15020982Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture ModelYohei Kakimoto0Yuto Omae1Hirotaka Takahashi2College of Industrial Technology, Nihon University, 1-2-1 Izumicho, Narashino 275-8575, JapanCollege of Industrial Technology, Nihon University, 1-2-1 Izumicho, Narashino 275-8575, JapanDepartment of Design and Data Science and Research Center for Space Science, Advanced Research Laboratories, Tokyo City University, 3-3-1 Ushikubo-Nishi, Tsuzuki-ku, Yokohama 224-8551, JapanLocation data collected from mobile devices via global positioning system often lack semantic information and can form sparse trajectories in space and time. This study investigates whether user age groups can be accurately classified solely from such sparse spatial–temporal trajectories. We propose a feature extraction method based on a Gaussian mixture model (GMM), which assigns representative points (RPs) by clustering the location data and aggregating user trajectories into these RPs. We then construct three machine learning (ML) models—support vector classifier (SVC), random forest (RF), and deep neural network (DNN)—using the GMM-based features and compare their performance with that of the improved DNN (IDNN), which is an existing feature extraction approach. In our experiments, we introduced a missing value ratio <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>θ</mi><mi>th</mi></msub></semantics></math></inline-formula> to quantify trajectory sparsity and analyzed the effect of trajectory sparsity on the classification accuracy and generalizability performance of the ML models. The results indicate that GMM-based features outperform IDNN-based features in both classification accuracy and generalization performance. Notably, the RF model achieved the highest accuracy, whereas the SVC model displayed stable generalizability. As the missing value ratio <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>θ</mi><mi>th</mi></msub></semantics></math></inline-formula> increases, the IDNN becomes more susceptible to overfitting, whereas the GMM-based approach preserves accuracy and robustness. These findings suggest that sparse trajectories can still offer meaningful classification performance with appropriate feature design and model selection even without semantic information. This approach holds promise for domains where large-scale, sparse trajectory data are common, including urban planning, marketing analysis, and public policy.https://www.mdpi.com/2076-3417/15/2/982sparse trajectory classificationmobile device locationGaussian mixture modelmachine learning
spellingShingle Yohei Kakimoto
Yuto Omae
Hirotaka Takahashi
Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture Model
Applied Sciences
sparse trajectory classification
mobile device location
Gaussian mixture model
machine learning
title Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture Model
title_full Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture Model
title_fullStr Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture Model
title_full_unstemmed Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture Model
title_short Analysis of Sparse Trajectory Features Based on Mobile Device Location for User Group Classification Using Gaussian Mixture Model
title_sort analysis of sparse trajectory features based on mobile device location for user group classification using gaussian mixture model
topic sparse trajectory classification
mobile device location
Gaussian mixture model
machine learning
url https://www.mdpi.com/2076-3417/15/2/982
work_keys_str_mv AT yoheikakimoto analysisofsparsetrajectoryfeaturesbasedonmobiledevicelocationforusergroupclassificationusinggaussianmixturemodel
AT yutoomae analysisofsparsetrajectoryfeaturesbasedonmobiledevicelocationforusergroupclassificationusinggaussianmixturemodel
AT hirotakatakahashi analysisofsparsetrajectoryfeaturesbasedonmobiledevicelocationforusergroupclassificationusinggaussianmixturemodel