A Survey of Data Partitioning and Sampling Methods to Support Big Data Analysis
Computer clusters with the shared-nothing architecture are the major computing platforms for big data processing and analysis. In cluster computing, data partitioning and sampling are two fundamental strategies to speed up the computation of big data and increase scalability. In this paper, we prese...
Saved in:
Main Authors: | Mohammad Sultan Mahmud, Joshua Zhexue Huang, Salman Salloum, Tamer Z. Emara, Kuanishbay Sadatdiynov |
---|---|
Format: | Article |
Language: | English |
Published: |
Tsinghua University Press
2020-06-01
|
Series: | Big Data Mining and Analytics |
Subjects: | |
Online Access: | https://www.sciopen.com/article/10.26599/BDMA.2019.9020015 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Survey of Distributed Computing Frameworks for Supporting Big Data Analysis
by: Xudong Sun, et al.
Published: (2023-06-01) -
Comprehensive Survey of Big Data Mining Approaches in Cloud Systems
by: Zainab Salih Ageed, et al.
Published: (2021-04-01) -
Big Data with Cloud Computing: Discussions and Challenges
by: Amanpreet Kaur Sandhu
Published: (2022-03-01) -
Design and Application of a Data-computation Integrated Database for Meteorological Grid Data
by: Wang Shu, et al.
Published: (2025-01-01) -
Super Partition: fast, flexible, and interpretable large-scale data reduction in R
by: Katelyn J. Queen, et al.
Published: (2025-01-01)