-
21
Applying MapReduce frameworks to a virtualization platform for Deep Web data source discovery
Published 2011-01-01Subjects: Get full text
Article -
22
Replication-Based Query Management for Resource Allocation Using Hadoop and MapReduce over Big Data
Published 2023-12-01“…Data generation rates are so scary, creating pressure to implement costly and straightforward data storage and recovery processes. MapReduce model functionality is used for creating a cluster parallel, distributed algorithm, and large datasets. …”
Get full text
Article -
23
An AkNN Algorithm for High-Dimensional Big Data
Published 2015-07-01Subjects: Get full text
Article -
24
An Adaptive Subspace Similarity Search Approach
Published 2015-07-01Subjects: Get full text
Article -
25
Research on real-time fusion method of multi-source heterogeneous flight trajectory data stream
Published 2020-09-01Subjects: Get full text
Article -
26
Research and application of prediction model based on ensemble BP neural network
Published 2016-02-01Subjects: Get full text
Article -
27
DSMC: A Novel Distributed Store-Retrieve Approach of Internet Data Using MapReduce Model and Community Detection in Big Data
Published 2014-11-01“…The traditional single-host web spider and data store approaches have some problems such as low efficiency and large memory requirement, so this paper proposes a big data store-retrieve approach DSMC (distributed store-retrieve approach using MapReduce model and community detection) based on distributed processing. …”
Get full text
Article -
28
Summary of Large-Scale Grapb Partitioning Algoritbms
Published 2014-07-01Subjects: Get full text
Article -
29
Design of Mobile Internet Big Data User Behavior Analysis Engine Based on Cloud Computing
Published 2013-03-01Subjects: Get full text
Article -
30
Survey of Distributed Computing Frameworks for Supporting Big Data Analysis
Published 2023-06-01“…In addition, we present a non-MapReduce distributed computing framework that has the potential to overcome big data analysis challenges.…”
Get full text
Article -
31
Parallel association rules incremental mining algorithm based on information entropy and genetic algorithm
Published 2021-05-01“…Aiming at the problems that in the big data environment, the Can-tree based incremental association rule algorithm had problems such as too much space occupation of the tree structure, inability to dynamically set the support threshold, and too much time consumption during the data transfer process between the Map and Reduce stages, the Map Reduce-based parallel association rules incremental mining algorithm using information entropy and genetic algorithm (MR-PARIMIEG)was proposed.Firstly, a similar items merging based on information entropy (SIM-IE) was designed to merge similar data items, and a Can tree based on the merged data set was constructed, thereby reducing the space occupation of the tree structure.Secondly, the dynamic support threshold obtaining using genetic algorithm (DST-GA) was proposed to obtain the relatively optimal dynamic support threshold in the big data environment, and frequent itemset mining was performed according to this threshold to avoid the unnecessary time consumption caused by mining redundant frequent patterns.Finally, in the process of MapReduce parallel operation, the parallel LZO data compression algorithm was used to compress the output data of the Map stage, thereby reducing the size of the transmitted data, and finally improving the running speed of the algorithm.Experimental simulation results show that MR-PARIMIEG has better performance when mining frequent item sets in the big data environment, and it is suitable for parallel processing of larger data sets.…”
Get full text
Article -
32
A Phoenix++ Based New Genetic Algorithm Involving Mechanism of Simulated Annealing
Published 2015-08-01“…Phoenix++ implements the MapReduce programming model that processes and generates large data sets with our parallel, distributed algorithm on a cluster. …”
Get full text
Article -
33
High Performance Frequent Subgraph Mining on Transaction Datasets: A Survey and Performance Comparison
Published 2019-09-01“…Although MapReduce came with many benefits, its disk I/O and non-iterative style model could not help much for FSM domain since subgraph mining process is an iterative approach. …”
Get full text
Article -
34
Maintenance Analysis Method Based on Big Data Processing Technology in Power Telecommunication Network
Published 2015-11-01“…The characteristics of power communication network equipment's maintenance,alarm and asset data were analyzed,and then the data preprocessing work was done by adopting attributes reduction and clustering method.The related index and influencing factors required by the analysis of equipment repair work were proposed.Meanwhile,the data mining method which combines the techniques of MapReduce and Apriori algorithms was presented.Finally,the proposed data mining method was applied to mine the actual network data.According to the mined results,the network maintenance work was analyzed from the perspectives of maintenance distribution,maintenance effectiveness on operated service,maintenance time-cost and so forth.…”
Get full text
Article -
35
DRAV: Detection and repair of data availability violations in Internet of Things
Published 2019-11-01“…DRAV uses algorithms in the MapReduce programming framework, and these include detection and repair algorithms based on enhanced conditional function dependency for data consistency violation, MapJoin, and ReduceJoin algorithms based on master data for k -nearest neighbor–based integrity violation detection, and repair algorithms. …”
Get full text
Article -
36
A discovery strategy for APT anomaly based on homologous behavior analysis
Published 2016-01-01“…As APT(advanced persistent threat)attacks are increasingly frequently,higher requirements for the detection of APT attacks were proposed.It was an effective method to early discover the attack behavior of APT based on homologous behavior analysis.Aiming at the problem of low efficiency of data authentication caused by excessive data,the historical behavior database with data label technology was established and the database was stored in the cloud.Relying on the Hadoop platform and the aggregate computing ability of MapReduce and the pseudorandom permutation technique,the whole traffic parallel detection of the network was realized.In order to determine whether there was a APT attack behavior,the detection of APT attacks was implemented by comparing the data labels in the database.Test results show that the proposed method can detect the abnormal behavior of APT from the network as soon as possibleand improve the efficiency of the whole data flow detection.…”
Get full text
Article -
37
Data Analysis Method of Intelligent Analysis Platform for Big Data of Film and Television
Published 2021-01-01“…The method selects Hadoop open source cloud platform as the basis, combines the MapReduce distributed programming model and HDFS distributed file storage system and other key cloud computing technologies. …”
Get full text
Article -
38
Supporting Efficient Family Joins for Big Data Tables via Multiple Freedom Family Index
Published 2025-01-01“…The Hadoop/MapReduce framework has been widely utilized for processing big data. …”
Get full text
Article -
39
An Improved Clustering Algorithm of Tunnel Monitoring Data for Cloud Computing
Published 2014-01-01“…It is a clustering algorithm using the MapReduce within cloud computing that deals with data. …”
Get full text
Article -
40
A Binary Cuckoo Search Big Data Algorithm Applied to Large-Scale Crew Scheduling Problems
Published 2018-01-01“…In this article, we propose a cuckoo search binary algorithm using the MapReduce programming paradigm implemented in the Apache Spark tool. …”
Get full text
Article