-
421
Pri-DDQN: learning adaptive traffic signal control strategy through a hybrid agent
Published 2024-11-01Subjects: Get full text
Article -
422
Adaptive Security Solutions for NOMA Networks: The Role of DDPG and RIS-Equipped UAVs
Published 2024-11-01Subjects: “…non-orthogonal multiple access, reconfigurable intelligent surface, unmanned aerial vehicles, deep reinforcement learning, deep deterministic policy gradient, physical layer security.…”
Get full text
Article -
423
Trust Region Policy Learning for Adaptive Drug Infusion with Communication Networks in Hypertensive Patients
Published 2025-01-01Subjects: Get full text
Article -
424
Multi-Fault-Tolerant Operation of Grid-Interfaced Photovoltaic Inverters Using Twin Delayed Deep Deterministic Policy Gradient Agent
Published 2024-12-01Subjects: “…reinforcement learning…”
Get full text
Article -
425
Controlled Signal Technique in VL‐NOMA Communication Under Interference‐Controlled Environment With Intelligent Reflecting Surfaces
Published 2025-01-01Subjects: “…deep reinforcement learning (DRL)…”
Get full text
Article -
426
AI/ML Enabled Automation System for Software Defined Disaggregated Open Radio Access Networks: Transforming Telecommunication Business
Published 2024-06-01Subjects: Get full text
Article -
427
Clinical Applications of Machine Learning
Published 2024-06-01“…This review introduces interpretable predictive machine learning approaches, natural language processing, image recognition, and reinforcement learning methodologies to familiarize end users. …”
Get full text
Article -
428
A Deep Q-Learning Algorithm With Guaranteed Convergence for Distributed and Uncoordinated Operation of Cognitive Radios
Published 2025-01-01“…This paper studies a deep reinforcement learning technique for distributed resource allocation among cognitive radios operating under an underlay dynamic spectrum access paradigm which does not require coordination between agents during learning. …”
Get full text
Article -
429
Data-driven joint routing, topology, and mobility design for FANET systems using a digital twin approach
Published 2025-01-01“…The decision support of the network digital twin is provided by model-based reinforcement learning using dynamic programming and policy iteration algorithm. …”
Get full text
Article -
430
Construction of Personalized Learning Platform Based on Intelligent Algorithm in the Context of Industry Education Integration
Published 2022-01-01“…Based on the concept of integration of production and education, with the help of intelligent recommendation algorithm of reinforcement learning, a personalized learning platform based on intelligent algorithm is constructed. …”
Get full text
Article -
431
An adaptive video stream transmission control method for wireless heterogeneous networks based on A3C
Published 2020-12-01“…The adaptive bit rate (ABR) algorithm has become the focus research in video transmission.However,due to the characteristics of 5G wireless heterogeneous networks,such as large fluctuation of channel bandwidth and obvious differences between different networks,the adaptive video stream transmission with multi-terminal cooperation was faced with great challenges.An adaptive video stream transmission control method based on deep reinforcement learning was proposed.First of all,a video stream dynamic programming model was established to jointly optimize the transmission rate and diversion strategy.Since the solution of this optimization problem depended on accurate channel estimation,dynamically changing channel state was difficult to achieve.Therefore,the dynamic programming problem was improved to reinforcement learning task,and the A3C algorithm was used to dynamically determine the video bit rate and diversion strategy.Finally,the simulation was carried out according to the measured network data,and compared with the traditional optimization method,the method proposed better improved the user QoE.…”
Get full text
Article -
432
Intelligent Buses in a Loop Service: Emergence of No-Boarding and Holding Strategies
Published 2020-01-01“…We study how N intelligent buses serving a loop of M bus stops learn a no-boarding strategy and a holding strategy by reinforcement learning. The no-boarding and holding strategies emerge from the actions of stay or leave when a bus is at a bus stop and everyone who wishes to alight has done so. …”
Get full text
Article -
433
A Neural Correlate of Predicted and Actual Reward-Value Information in Monkey Pedunculopontine Tegmental and Dorsal Raphe Nucleus during Saccade Tasks
Published 2011-01-01“…Dopamine, acetylcholine, and serotonin, the main modulators of the central nervous system, have been proposed to play important roles in the execution of movement, control of several forms of attentional behavior, and reinforcement learning. While the response pattern of midbrain dopaminergic neurons and its specific role in reinforcement learning have been revealed, the role of the other neuromodulators remains rather elusive. …”
Get full text
Article -
434
Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model
Published 2024-12-01“…Focusing on the issues of low efficiency and effectiveness in decision-making as well as the instability of traditional reinforcement learning model-based multi-function radar (MFR) jamming decision algorithms, a prior knowledge embedded long short-term memory (LSTM) network-proximal policy optimization (PPO) model based intelligent interference decision algorithm was developed. …”
Get full text
Article -
435
Fast Frequency Control Strategy Based on Worst-Case Network Attack Perception
Published 2024-12-01“…This strategy focuses on frequency stability in power systems with a high penetration of renewable energy, and utilizes a reinforcement learning agent to intelligently adjust the power setpoint of voltage source converters (VSCs), ensuring that both the frequency and the rate of change of the frequency remain within permissible limits. …”
Get full text
Article -
436
Multimedia Tasks-Oriented Edge Computing Offloading Scheme Based on Graph Neural Network in Vehicular Networks
Published 2025-01-01“…Second, deep Reinforcement Learning (DRL) is introduced to learn optimal task deployment policies, enhancing the efficiency and accuracy of task deployment. …”
Get full text
Article -
437
Perbandingan Metode Penyelesaian Permasalahan Optimasi Lintas Domain dengan Pendekatan Hyper-Heuristic Menggunakan Algoritma Reinforcement-Late Acceptance
Published 2021-10-01“…Dalam meningkatkan kinerja, penelitian ini menguji pengaruh dari adaptasi algoritma Reinforcement Learning (RL) sebagai strategi seleksi LLH dikombinasikan dengan algoritma Late Acceptance sebagai move acceptance, selanjutnya disebut algoritma Reinforcement Learning-Late acceptance (RL-LA). …”
Get full text
Article -
438
Deep Learning in Music Generation: A Comprehensive Investigation of Models, Challenges and Future Directions
Published 2025-01-01“…This review explores various and recent deep learning models, such as Long Short-Term Memory (LSTM) networks, Transformer-based models, Reinforcement Learning (RL), and Diffusion-based architectures, and how they are applied to music generation. …”
Get full text
Article -
439
Motor synergy and energy efficiency emerge in whole-body locomotion learning
Published 2025-01-01“…We investigated the emergence of synergies through deep reinforcement learning of whole-body locomotion tasks. We carried out a joint-space synergy analysis on whole-body control solutions for walking and running agents in simulated environments. …”
Get full text
Article -
440
Differential Grey Wolf Load-Balanced Stochastic Bellman Deep Reinforced Resource Allocation in Fog Environment
Published 2022-01-01“…In a Stochastic Gradient and Deep Reinforcement Learning-based Resource Allocation Model, a stochastic gradient bellman optimality function and Deep Reinforcement Learning are integrated for optimal resource allocation. …”
Get full text
Article