Showing 241 - 260 results of 306 for search '"reinforcement learning"', query time: 0.06s Refine Results
  1. 241

    A computing allocation strategy for Internet of things’ resources based on edge computing by Zengrong Zhang

    Published 2021-12-01
    “…In order to meet the demand for efficient computing services in big data scenarios, a cloud edge collaborative computing allocation strategy based on deep reinforcement learning by combining the powerful computing capabilities of cloud is proposed. …”
    Get full text
    Article
  2. 242

    Convolutional neural network (CNN) configuration using a learning automaton model for neonatal brain image segmentation. by Iran Sarafraz, Hamed Agahi, Azar Mahmoodzadeh

    Published 2025-01-01
    “…., size, length, and width of the filter in each layer along with the type of pooling functions with a reinforcement learning approach and an LA model are determined. …”
    Get full text
    Article
  3. 243

    Data-Driven Robust Optimization of the Vehicle Routing Problem with Uncertain Customers by Jingling Zhang, Yusu Sun, Qinbing Feng, Yanwei Zhao, Zheng Wang

    Published 2022-01-01
    “…The Q-learning algorithm in reinforcement learning is introduced into the high-level selection strategy using the hyper-heuristic algorithm, and a hyper-heuristic algorithm based on the Q-learning algorithm is designed to solve the problem. …”
    Get full text
    Article
  4. 244

    Bending obstacles when moving a mobile robot by A. V. Sidorenko, N. A. Saladukha

    Published 2023-08-01
    “…The developed software includes the Mobile Robotics Simulation Toolbox, Reinforcement Learning Toolbox, and the Gazebo visualization package for environment simulation. …”
    Get full text
    Article
  5. 245

    Environmental regulation, market power and low-carbon development of China's coal power industry chain —Based on both strategy and return perspectives by Jiaming Gao, Li Zhang

    Published 2025-03-01
    “…Furthermore, a reinforcement learning model has been developed using the payoff matrix. …”
    Get full text
    Article
  6. 246

    The Ecosystem of AI-Driven Robotics in Pediatric Neurorehabilitation by Tole Sutikno, Lina Handayani

    Published 2025-02-01
    “…We address this review to facilitate an in-depth analysis of the effective integration of advanced technologies, such as artificial emotional intelligence and interactive reinforcement learning, into rehabilitation practices. By critically assessing each element, from the psychological dynamics of patient engagement to the technical intricacies of real-time adaptive learning systems, we can better understand their pivotal roles in enhancing therapeutic efficacy. …”
    Get full text
    Article
  7. 247

    Q-Learning-Driven Butterfly Optimization Algorithm for Green Vehicle Routing Problem Considering Customer Preference by Weiping Meng, Yang He, Yongquan Zhou

    Published 2025-01-01
    “…This paper proposes a Q-learning-driven butterfly optimization algorithm (QLBOA) by integrating the Q-learning mechanism of reinforcement learning into the butterfly optimization algorithm (BOA). …”
    Get full text
    Article
  8. 248

    Research on Flexible Resource Dynamic Interactive Regulation Technology for Microgrids with High Permeable New Energy by Songsong Chen, Lutao Zhang, Ying Zhou, Ke Chen, Zhongdong Wang, Lihui Xie

    Published 2023-01-01
    “…The first layer adopts the IQ (λ) strategy, which avoids the overestimation and underestimation problems of traditional reinforcement learning by the coupled estimation method. The second layer adopts the HDQC allocation strategy, which solves the problem of low utilization of new energy in the proportional allocation method and improves the adaptability of the regulation strategy in the complex stochastic environment. …”
    Get full text
    Article
  9. 249

    Recurrent neural networks with transient trajectory explain working memory encoding mechanisms by Chenghao Liu, Shuncheng Jia, Hongxing Liu, Xuanle Zhao, Chengyu T. Li, Bo Xu, Tielin Zhang

    Published 2025-01-01
    “…Besides activity patterns resembling animal recordings and retained versatility to variable encoding time, TRNNs show better performance in delayed choice and spatial memory reinforcement learning tasks. Therefore, this study provides evidence supporting the transient activity theory to explain the WM mechanism from the model designing point of view.…”
    Get full text
    Article
  10. 250

    Artificial intelligence techniques applications in the wastewater: A comprehensive review by Zakur Yahya, Márquez Fausto, Al-Taie Ali, Alsaidi Saif, Alsadoon Abeer, Mirashrafi Seyed Bagher, Flaih Laith, Zakoor Yousif

    Published 2025-01-01
    “…The critical gaps and the future directions in the (AI) algorithms for the wastewater treatment, including the explain ability of the data-driven models or transfer Learning processes and reinforcement learning, are also addressed.…”
    Get full text
    Article
  11. 251

    Foot trajectory as a key factor for diverse gait patterns in quadruped robot locomotion by Shura Suzuki, Kosuke Matayoshi, Mitsuhiro Hayashibe, Dai Owaki

    Published 2025-01-01
    “…While substantial progress in robotic mobility has been achieved using reinforcement learning techniques, quadruped animals exhibit superior agility by employing fundamentally different strategies. …”
    Get full text
    Article
  12. 252

    Intentionally-underestimated value function at terminal state for temporal-difference learning with mis-designed reward by Taisuke Kobayashi

    Published 2025-03-01
    “…Robot control using reinforcement learning has become popular, but its learning process often terminates midway through an episode for safety and time-saving reasons. …”
    Get full text
    Article
  13. 253

    The Role of Cognitive Science and Big Data Technology in the Design of Business Information Management Systems by Yangfan Li

    Published 2022-01-01
    “…Based on the real-time computing technology of massive data, the label optimization scheme of collaborative filtering and reinforcement learning is used to realize the logistics distribution recommendation model and to solve the accuracy and real-time problems of logistics service distribution analysis.…”
    Get full text
    Article
  14. 254

    A Deep Q-Network-Based Collaborative Control Research for Smart Ammunition Formation by Jian Shen, Benkang Zhang, Qingyu Zhu, Pengyun Chen

    Published 2022-01-01
    “…Next, we describe the SAF collaborative control process as a Markov decision process (MDP) with the application of the reinforcement learning (RL) technique. Then, the basic framework ε-imitation action-selecting strategy and the algorithm details were developed to address the SAF control problem based on the DQN scheme. …”
    Get full text
    Article
  15. 255

    LazyAct: Lazy actor with dynamic state skip based on constrained MDP. by Hongjie Zhang, Zhenyu Chen, Hourui Deng, Chaosheng Feng

    Published 2025-01-01
    “…Deep reinforcement learning has achieved significant success in complex decision-making tasks. …”
    Get full text
    Article
  16. 256

    Multiple Unmanned Aerial Vehicle Collaborative Target Search by DRL: A DQN-Based Multi-Agent Partially Observable Method by Heng Xu, Dayong Zhu

    Published 2025-01-01
    “…In unknown environments, UAVs can significantly reduce the risk of casualties and improve the safety and covertness when performing missions. Reinforcement Learning allows agents to learn optimal policies through trials in the environment, enabling UAVs to respond autonomously according to the real-time conditions. …”
    Get full text
    Article
  17. 257

    Using EEG technology to enhance performance measurement in physical education by Zhaofeng Zhai, Zhaofeng Zhai, Lu Han, Wei Zhang

    Published 2025-02-01
    “…APEO combines biomechanical modeling, engagement prediction through recurrent neural networks, and reinforcement learning to tailor physical activity interventions. …”
    Get full text
    Article
  18. 258

    Human-Inspired Gait and Jumping Motion Generation for Bipedal Robots Using Model Predictive Control by Zhen Xu, Jianan Xie, Kenji Hashimoto

    Published 2025-01-01
    “…Currently, methods based on hybrid zero dynamics and reinforcement learning have been employed to enhance the walking and hopping capabilities of humanoid robots; however, model predictive control (MPC) presents two significant advantages: it can adapt to more complex task requirements and environmental conditions, and it allows for various walking and hopping patterns without extensive training and redesign. …”
    Get full text
    Article
  19. 259

    Smart City Traffic Flow and Signal Optimization Using STGCN-LSTM and PPO Algorithms by Tuxiang Lin, Rongliang Lin

    Published 2025-01-01
    “…Future research will explore edge computing, multi-agent reinforcement learning, and real-time data integration to further enhance scalability and adaptability.…”
    Get full text
    Article
  20. 260

    Customer segmentation in the digital marketing using a Q-learning based differential evolution algorithm integrated with K-means clustering. by Guanqun Wang

    Published 2025-01-01
    “…This paper proposes a customer segmentation framework within the realm of digital marketing, which integrates a reinforcement learning-based differential evolution algorithm with K-means clustering using dimensionality reduction techniques to address challenges in the customer segmentation process. …”
    Get full text
    Article