An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media

The traditional finite element program is executed on the CPU; however, it is challenging for the CPU to compute the ultra-large scale finite element model. In this paper, we present a set of efficient algorithms based on GPU acceleration technology for the dynamic response of fluid-saturated porous...

Full description

Saved in:
Bibliographic Details
Main Authors: Wancang Lin, Qinglong Zhou, Xinyi Chen, Wenhao Shi, Jie Ai
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/13/2/181
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588053197094912
author Wancang Lin
Qinglong Zhou
Xinyi Chen
Wenhao Shi
Jie Ai
author_facet Wancang Lin
Qinglong Zhou
Xinyi Chen
Wenhao Shi
Jie Ai
author_sort Wancang Lin
collection DOAJ
description The traditional finite element program is executed on the CPU; however, it is challenging for the CPU to compute the ultra-large scale finite element model. In this paper, we present a set of efficient algorithms based on GPU acceleration technology for the dynamic response of fluid-saturated porous media, named PNAM, encompassing the assembly of the global matrix and the iterative solution of equations. In the assembly part, the CSR storage format of the global matrix is directly obtained from the element matrix. For data with two million degrees of freedom, it merely takes approximately 1 s to generate all the data of global matrices, which is significantly superior to the CPU version. Regarding the iterative solution of equations, a novel algorithm based on the CUDA kernel function is proposed. For a data set with two million degrees of freedom, it takes only about 0.05 s to compute an iterative step and transfer the data to the CPU. The program is designed to calculate either in single or double precision. The change in precision has little impact on the assembly of the global matrix, but the calculation time of double precision is generally 1.5 to 2 times that of single precision in the iterative solution part for a model with 2 million degrees of freedom. PNAM has high computational efficiency and great compatibility, which can be used to solve not only saturated fluid problems but also a variety of other problems.
format Article
id doaj-art-45c733f9a61b455fa1c86581445c5e0a
institution Kabale University
issn 2227-7390
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj-art-45c733f9a61b455fa1c86581445c5e0a2025-01-24T13:39:39ZengMDPI AGMathematics2227-73902025-01-0113218110.3390/math13020181An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous MediaWancang Lin0Qinglong Zhou1Xinyi Chen2Wenhao Shi3Jie Ai4School of Resources and Security Engineering, Central South University, Changsha 410083, ChinaSchool of Resources and Security Engineering, Central South University, Changsha 410083, ChinaSchool of Resources and Security Engineering, Central South University, Changsha 410083, ChinaSchool of Resources and Security Engineering, Central South University, Changsha 410083, ChinaSchool of Resources and Security Engineering, Central South University, Changsha 410083, ChinaThe traditional finite element program is executed on the CPU; however, it is challenging for the CPU to compute the ultra-large scale finite element model. In this paper, we present a set of efficient algorithms based on GPU acceleration technology for the dynamic response of fluid-saturated porous media, named PNAM, encompassing the assembly of the global matrix and the iterative solution of equations. In the assembly part, the CSR storage format of the global matrix is directly obtained from the element matrix. For data with two million degrees of freedom, it merely takes approximately 1 s to generate all the data of global matrices, which is significantly superior to the CPU version. Regarding the iterative solution of equations, a novel algorithm based on the CUDA kernel function is proposed. For a data set with two million degrees of freedom, it takes only about 0.05 s to compute an iterative step and transfer the data to the CPU. The program is designed to calculate either in single or double precision. The change in precision has little impact on the assembly of the global matrix, but the calculation time of double precision is generally 1.5 to 2 times that of single precision in the iterative solution part for a model with 2 million degrees of freedom. PNAM has high computational efficiency and great compatibility, which can be used to solve not only saturated fluid problems but also a variety of other problems.https://www.mdpi.com/2227-7390/13/2/181CUDAu-p dynamic formulationfinite elementsspeedupGPU-acceleration
spellingShingle Wancang Lin
Qinglong Zhou
Xinyi Chen
Wenhao Shi
Jie Ai
An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media
Mathematics
CUDA
u-p dynamic formulation
finite elements
speedup
GPU-acceleration
title An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media
title_full An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media
title_fullStr An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media
title_full_unstemmed An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media
title_short An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media
title_sort efficient gpu accelerated algorithm for solving dynamic response of fluid saturated porous media
topic CUDA
u-p dynamic formulation
finite elements
speedup
GPU-acceleration
url https://www.mdpi.com/2227-7390/13/2/181
work_keys_str_mv AT wancanglin anefficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT qinglongzhou anefficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT xinyichen anefficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT wenhaoshi anefficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT jieai anefficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT wancanglin efficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT qinglongzhou efficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT xinyichen efficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT wenhaoshi efficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia
AT jieai efficientgpuacceleratedalgorithmforsolvingdynamicresponseoffluidsaturatedporousmedia