-
161
Constructing and analyzing algorithms of tensor operation paralleling
Published 2022-09-01“…It is assumed that the time of execution for all computing operations is same and equal to a unit of time, and data transfer between computer devices is performed instantaneously without any time consuming (it is acceptable, for example, a parallel computing systems with shared memory). In particular, it is shown that for the tensors' addition the time of the fastest execution of algorithm for an unlimited number of processors is equal to the length of the maximum path in the graph. …”
Get full text
Article -
162
-
163
Maximum Mutual Information Feature Extraction Method Based on the Cloud Platform
Published 2013-10-01“…With the large-scale application of gene chip,gene expression data with high dimension which exists a large number of irrelevant and redundant features may reduce classifier performance problem.A maximum mutual information feature extraction method based on cloud platforms was proposed.Hadoop cloud computing platform could be a parallel computing after gene expression data segmentation,features was extracted at the same time combined with the maximum mutual information method and the characteristics of cloud computing platform filter model was realized.Simulation experiments show that the maximum mutual information feature extraction method based on the cloud platform can rapid extraction of features in a higher classification accuracy which save a lot of time resources to make a highly efficient gene feature extraction system.…”
Get full text
Article -
164
OBJECT TRACKING VIA COMPARISON OF COLOR HISTOGRAMS
Published 2016-09-01“…The histograms are compared by the Bhattacharia criterion. А parallel computing platform CUDA, developed to program GPUs, allows creation of real time or near-real time program realizations of the offered versions. …”
Get full text
Article -
165
On the optimal layout of (K p − C p ) n into grid and certain structures
Published 2025-05-01“…Abstract Interconnection networks constitute complex configurations of processors and communication links that facilitate data transmission between processors in a parallel computing system. Their architecture and design heavily depend on parameters such as wirelength, dilation, bandwidth, and minimum cutwidth. …”
Get full text
Article -
166
Building accurate numerical models
Published 2024-01-01“…Key strategies such as algorithm optimization, parallel computing, and efficient data management are essential for maximizing computational resources. …”
Get full text
Article -
167
Efficient and real-time lane detection using CUDA-based implementation
Published 2024-01-01“…To this end, we used CUDA for acceleration, taking advantage of its parallel computing capabilities to improve performance. …”
Get full text
Article -
168
Chopin: An open source R-language tool to support spatial analysis on parallelizable infrastructure
Published 2025-05-01“…Supporting popular R spatial-analysis libraries, chopin exploits parallel computing by partitioning data involved in each task. …”
Get full text
Article -
169
Parallel Implementation of Katsevich's FBP Algorithm
Published 2006-01-01“…For spiral cone-beam CT, parallel computing is an effective approach to resolving the problem of heavy computation burden. …”
Get full text
Article -
170
Survey of FPGA based recurrent neural network accelerator
Published 2019-08-01“…Recurrent neural network(RNN) has been used wildly used in machine learning field in recent years,especially in dealing with sequential learning tasks compared with other neural network like CNN.However,RNN and its variants,such as LSTM,GRU and other fully connected networks,have high computational and storage complexity,which makes its inference calculation slow and difficult to be applied in products.On the one hand,traditional computing platforms such as CPU are not suitable for large-scale matrix operation of RNN.On the other hand,the shared memory and global memory of hardware acceleration platform GPU make the power consumption of GPU-based RNN accelerator higher.More and more research has been done on the RNN accelerator of the FPGA in recent years because of its parallel computing and low power consumption performance.An overview of the researches on RNN accelerator based on FPGA in recent years is given.The optimization algorithm of software level and the architecture design of hardware level used in these accelerator are summarized and some future research directions are proposed.…”
Get full text
Article -
171
Numerical Solution of Diffusion Models in Biomedical Imaging on Multicore Processors
Published 2011-01-01“…This analysis is carried out in a multicore-based parallel computing environment.…”
Get full text
Article -
172
Pengukuran Performa Apache Spark dengan Library H2O Menggunakan Benchmark Hibench Berbasis Cloud Computing
Published 2019-10-01“…This concept is called parallel computing. Apache Spark has advantages compared to other similar frameworks such as Apache Hadoop, etc., where Apache Spark is able to process data in streaming, meaning that the data entered into the Apache Spark environment can be directly processed without waiting for other data to be collected. …”
Get full text
Article -
173
Deployment and Application of On-Board PHM Models Based on Docker Containers for Intelligent Operation and Maintenance
Published 2025-06-01“…Additionally, the parallel computing efficiency of multiple models is improved due to the lack of data interactions among them, facilitating superior portability and scalability.…”
Get full text
Article -
174
Deep learning-based multimodal trajectory prediction methods for autonomous driving: state of the art and perspectives
Published 2023-06-01“…Although deep learning methods have achieved better results than traditional trajectory prediction algorithms, there are still problems such as information loss, interaction and uncertainty difficulties in modelling, and lack of interpretability of predictions when implementing multimodal high-precision prediction for autonomous vehicles in heterogeneous, highly dynamic and complex changing environments.The newly developed Transformer's long-range modelling capability and parallel computing ability make it a great success not only in the field of natural language processing, but also in solving the above problems when extended to the task of multimodal trajectory prediction for autonomous driving.Based on this, the aim of this paper is to provide a comprehensive summary and review of past deep neural network-based approaches, in particular the Transformer-based approach.The advantages of Transformer over traditional sequential network, graphical neural network and generative model were also analyzed and classified in relation to existing challenges, simultaneously.Transformer models can be better applied to multimodal trajectory prediction tasks, and that such models have better generalisation and interpretability.Finally, the future directions of multimodal trajectory prediction were presented.…”
Get full text
Article -
175
Static Scheduling of Periodic Hardware Tasks with Precedence and Deadline Constraints on Reconfigurable Hardware Devices
Published 2011-01-01“…Using these three main stages, dynamic partial reconfiguration and mixed integer programming, pipelined scheduling and efficient placement are achieved and enable parallel computing of the task graph on the reconfigurable devices by optimizing placement/scheduling quality. …”
Get full text
Article -
176
Computational Modeling of Ganglion Cell Bicolor Opponent Receptive Fields and FPGA Adaptation for Parallel Arrays
Published 2024-08-01“…We also implement a circuit-level distributed parallel computing model on FPGAs. The results show that we are able to transfer information with low energy consumption and high parallelism. …”
Get full text
Article -
177
Data Mining for the Internet of Things: Literature Review and Challenges
Published 2015-08-01Get full text
Article -
178
Optimizing Euclidean Distance Computation
Published 2024-11-01“…From spatial data structures and approximate nearest neighbor algorithms to dimensionality reduction, vectorization, and parallel computing, various approaches exist to accelerate Euclidean distance computation in different contexts. …”
Get full text
Article -
179
A Domain Decomposition Method for Time Fractional Reaction-Diffusion Equation
Published 2014-01-01“…The computational complexity of one-dimensional time fractional reaction-diffusion equation is O(N2M) compared with O(NM) for classical integer reaction-diffusion equation. Parallel computing is used to overcome this challenge. Domain decomposition method (DDM) embodies large potential for parallelization of the numerical solution for fractional equations and serves as a basis for distributed, parallel computations. …”
Get full text
Article -
180
Solving the SAT Problem by Cell-Like P Systems with Channel States and Symport Rules
Published 2023-01-01“…Cell-like P systems with channel states, which are a variant of tissue P systems in membrane computing, can be viewed as highly parallel computing devices based on the nested structure of cells, where communication rules are classified as symport rules and antiport rules. …”
Get full text
Article