Multi-Component Temporal-Correlation Seismic Data Compression Algorithm Based on the PCA and DWT
Industrial application data acquisition systems can be sources of vast amounts of data. The seismic surveys conducted by oil and gas companies result in enormous datasets, often exceeding terabytes of data. The storage and communication demands these data require can only be achieved through compres...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Algorithms |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4893/18/1/33 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Industrial application data acquisition systems can be sources of vast amounts of data. The seismic surveys conducted by oil and gas companies result in enormous datasets, often exceeding terabytes of data. The storage and communication demands these data require can only be achieved through compression. Careful consideration must be given to minimize the reconstruction error of compressed data caused by lossy compression. This paper investigates the combination of principal component analysis (PCA), discrete wavelet transform (DWT), thresholding, quantization, and entropy encoding to compress such datasets. The proposed method is a lossy compression algorithm tuned by evaluating the reconstruction error in frequency ranges of interest, namely 0–20 Hz and 15–65 Hz. The PCA compression and decompression acts as a noise filter while the DWT drives the compression. The proposed method can be tuned through threshold and quantization percentages and the number of principal components to achieve compression rates of up to 31:1 with reconstruction residues energy of less than <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>4</mn><mo>%</mo></mrow></semantics></math></inline-formula> in the frequency ranges of 0–20 Hz, 15–65 Hz, and 60–105 Hz. |
---|---|
ISSN: | 1999-4893 |