-
21
CUDT: A CUDA Based Decision Tree Algorithm
Published 2014-01-01“…In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. …”
Get full text
Article -
22
HIGH-PERFORMANCE SIMULATIONS OF POPULATION-GENETIC PROCESSES IN BACTERIAL COMMUNITIES USING THE HAPLOID EVOLUTIONARY CONSTRUCTOR SOFTWARE
Published 2015-01-01“…These high-performance versions are to be run on systems with shared and distributed memory, using CPU and/or GPU. Almost linear acceleration has been achieved on clusters and multi-core CPU. …”
Get full text
Article -
23
Accelerated computational implementation of reconciliation for continuous variable quantum key distribution on GPU
Published 2017-11-01“…For the low computing speed of reconciliation for current continuous variable quantum key distribution, CPU&amp;GPU-parallel reconciliation algorithms was designed based on LDPC of SEC protocol to speed up decoding computing.In order to raise decoding speed without sacrifice reconciliation efficiency,a static two-way cross linked list to efficiently store large scale sparse parity matrix was employed.The simulation experimental results show that the speed of the decoding rate reaches 16.4 kbit/s when the channel SNR is over 4.9 dB and the reliability of the 2×10<sup>5</sup>continuous variable quantum sequence,with reconciliation efficiency of 91.71%.The experimental based on the Geforce GT 650 MB GPU and the 2.5 GHz and 8 GB memory CPU hardware platform.Relative to the only CPU platform,computing speed increased by more than 15 times.…”
Get full text
Article -
24
An Efficient GPU-Accelerated Algorithm for Solving Dynamic Response of Fluid-Saturated Porous Media
Published 2025-01-01“…For a data set with two million degrees of freedom, it takes only about 0.05 s to compute an iterative step and transfer the data to the CPU. The program is designed to calculate either in single or double precision. …”
Get full text
Article -
25
-
26
Optimization of Hydraulic Machinery Bladings by Multilevel CFD Techniques
Published 2005-01-01“…The numerical design optimization for complex hydraulic machinery bladings requires a high number of design parameters and the use of a precise CFD solver yielding high computational costs. …”
Get full text
Article -
27
Refactoring Android Source Code Smells From Android Applications
Published 2025-01-01“…This study enhances the Android application development lifecycle by offering developers a feasible solution for optimizing CPU efficiency, reducing memory use, and minimizing battery consumption.…”
Get full text
Article -
28
LE-YOLOv5: A Lightweight and Efficient Neural Network for Steel Surface Defect Detection
Published 2024-01-01“…At the same time, under the condition that the inference time for an image on a CPU-dependent low computing power force remains the same, the accuracy has improved by 5.3% compared to YOLOv8. …”
Get full text
Article -
29
FCM Clustering Approach Optimization Using Parallel High-Speed Intel FPGA Technology
Published 2022-01-01“…Our experimental results (based on several datasets) show that the proposed method makes the FCM execution time more than 186 times faster than the conventional design running on a single-core CPU platform. Also, its processing power reached 89 giga floating points operations per second (GFLOPs).…”
Get full text
Article -
30
Parallel orchestration and deployment system for scalable heterogeneous service function chain supporting polymorphic network
Published 2022-09-01“…Aiming at the problem of second-level service function chain orchestration and deployment in the polymorphic data center with the scale of 10 000 servers, a scalable heterogeneous SFC parallel orchestration and deployment (SHOD) system was proposed.The scale-out orchestration time of service chain was reduced to seconds by partitioning the data center and adopting a parallel orchestrator design that minimizes resource usage.Using a mediator design model that satisfies the single-responsibility principle, the heterogeneous equipment scalable orchestration and deployment were realized as well as service function chain building chain.Experimental results show that in a data center with 10 000 servers, the scale-out orchestration time of SHOD system is reduced to <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML"> <mfrac> <mn>1</mn> <mrow> <mn>12</mn></mrow> </mfrac> </math></inline-formula> of the existing system-Daisy, while only 1.5 times CPU utilization overhead is introduced.…”
Get full text
Article -
31
Analysis of DoS attacks on Docker inter-component stdio copy
Published 2020-12-01“…In recent years,Docker has been widely deployed due to its flexibility and high scalability.However,its modular design leads to the DoS attacks on inter-component communication.A new DoS attack that outputs to stdout,causing high CPU usages among different Docker components.Analysis shows that the stdout output triggers the goroutines of Docker components.To find all goroutines setup paths,using the static analysis method to analyze the Docker components systematically was proposed.A static analysis framework was designed and implemented,and evaluated on Docker source code.The results show that static analysis framework finds 34 paths successfully,while 22 of them are confirmed by runtime verification.…”
Get full text
Article -
32
Multi-Softcore Architecture on FPGA
Published 2014-01-01“…The NIOS II features a general-purpose RISC CPU architecture designed to address a wide range of applications. …”
Get full text
Article -
33
RP-Ring: A Heterogeneous Multi-FPGA Accelerator
Published 2018-01-01“…Our accelerator can achieve two orders of magnitude speedup compared with CPU implementation.…”
Get full text
Article -
34
Research on LEO Satellite Network Routing Security
Published 2023-06-01“…The design of secure mechanisms and failure recovery mechanisms for inter-satellite routing has become pivotal in maintaining dependable communication within the LEO satellite network.To address the potential security threats faced by satellite networks, the impact of different routing attack behaviors on a typical inter-satellite routing protocol was analyzed and differentiated packet security authentication mechanisms and link failure recovery mechanisms were designed.Additionally, a satellite network emulation platform based on virtualization technology was constructed, enabled the verification of the effectiveness of the designed inter-satellite routing security mechanisms through the emulation of various routing attack scenarios.Furthermore, performance metrics such as CPU utilization and packet processing time were also evaluated before and after the introduction of security mechanisms.Experimental results demonstrated that the proposed inter-satellite routing security mechanism effectively mitigated multiple security threats in the space network environment while reduced communication latency caused by sudden link failures, thereby ensured secure and reliable communication within the LEO satellite network.…”
Get full text
Article -
35
Survey of FPGA based recurrent neural network accelerator
Published 2019-08-01“…Recurrent neural network(RNN) has been used wildly used in machine learning field in recent years,especially in dealing with sequential learning tasks compared with other neural network like CNN.However,RNN and its variants,such as LSTM,GRU and other fully connected networks,have high computational and storage complexity,which makes its inference calculation slow and difficult to be applied in products.On the one hand,traditional computing platforms such as CPU are not suitable for large-scale matrix operation of RNN.On the other hand,the shared memory and global memory of hardware acceleration platform GPU make the power consumption of GPU-based RNN accelerator higher.More and more research has been done on the RNN accelerator of the FPGA in recent years because of its parallel computing and low power consumption performance.An overview of the researches on RNN accelerator based on FPGA in recent years is given.The optimization algorithm of software level and the architecture design of hardware level used in these accelerator are summarized and some future research directions are proposed.…”
Get full text
Article -
36
Energy-efficient 3D multi-band I/O interface for enhanced mobile memory communication
Published 2025-01-01“…The proposed multi-band I/O (MBI) interface utilizes 3D integration, for two radio frequency band transceivers, and a CMOS driver with resistive feedback for the baseband (BB) transceiver. The proposed design implements a band-selective transformer to enable transmitting three bands, baseband and 2 RF bands (10&30 GHz), simultaneously. …”
Get full text
Article -
37
An Efficient GPU-Based Out-of-Core LU Solver of Parallel Higher-Order Method of Moments for Solving Airborne Array Problems
Published 2017-01-01“…A parallel framework involving MPI and CUDA is adopted to ensure that the procedures run on a hybrid CPU/GPU cluster. An efficient two-level out-of-core scheme is designed to break the bottleneck of both GPU memory and physical memory when solving electrically large and complex problems. …”
Get full text
Article -
38
The Comparison of Lately Proposed Harris Hawks Optimization and Jaya Optimization in Solving Directional Overcurrent Relays Coordination Problem
Published 2020-01-01“…The comparisons show that the robustness and consistency of HHO is relatively better than Jaya, while Jaya provides faster convergence rate with less CPU time and occasionally more competitive objective function value than HHO.…”
Get full text
Article -
39
Implementation of Membrane Algorithms on GPU
Published 2014-01-01“…Membrane algorithms are a new class of parallel algorithms, which attempt to incorporate some components of membrane computing models for designing efficient optimization algorithms, such as the structure of the models and the way of communication between cells. …”
Get full text
Article -
40
Implementasi Algoritma BFCC dan kNN pada Embedded System untuk Deteksi Dini Bronchitis
Published 2023-07-01“…Hasil pengujian menunjukkan rata-rata waktu komputasi sebesar 4,452 detik dan penggunaan CPU sebesar 26%, serta akurasi kNN sebesar 73% untuk perhitungan jarak Euclidean dengan nilai neighbour = 5. …”
Get full text
Article