-
261
Loss Architecture Search for Few-Shot Object Recognition
Published 2020-01-01“…This procedure is repeated and implemented in the reinforcement learning framework for finding the best loss architecture such that the embedding network yields the highest validation accuracy. …”
Get full text
Article -
262
Optimization of the Rapid Design System for Arts and Crafts Based on Big Data and 3D Technology
Published 2021-01-01“…In the system design, the overall structure design, database design, and functional module design of the system are comprehensively elaborated, and the key issues such as 3D display and home layout generation algorithm based on reinforcement learning are analyzed and designed. In the implementation part of the system, the overall construction of the system and the composition of functional modules are introduced in detail and the main functional modules of the system are presented with interface diagrams. …”
Get full text
Article -
263
Artificial Intelligence as a Catalyst for Management System Adaptability, Agility and Resilience: Mapping the Research Agenda
Published 2025-01-01“…Likewise, its thematic and strategic evolution is characterized as a surprising one, managing to incorporate and relate concepts with a strong technical and IT character such as feature extraction, machine learning, reinforcement learning with concepts of a managerial nature as supporting customer-tailored interaction, employee skills development, company productivity, and innovation.…”
Get full text
Article -
264
Optimizing Spectrum Trading in Cognitive Mesh Network Using Machine Learning
Published 2012-01-01“…These complex contradicting objectives are embedded in our reinforcement learning (RL) model that is developed and implemented as shown in this paper. …”
Get full text
Article -
265
Agent-Based Modeling and Simulation for the Bus-Corridor Problem in a Many-to-One Mass Transit System
Published 2014-01-01“…By using multiagent modeling and the Bush-Mosteller reinforcement learning model, we simulated the day-to-day evolution of commuters’ departure time choice on a many-to-one mass transit system during the morning peak period. …”
Get full text
Article -
266
Parameterless-Growing-SOM and Its Application to a Voice Instruction Learning System
Published 2010-01-01“…The improved SOM is applied to construct a voice instruction learning system for partner robots adopting a simple reinforcement learning algorithm. User's instructions of voices are classified by the PL-G-SOM at first, then robots choose an expected action according to a stochastic policy. …”
Get full text
Article -
267
Modeling and Optimization of Multiaction Dynamic Dispatching Problem for Shared Autonomous Electric Vehicles
Published 2021-01-01“…Results show that (1) the Kuhn–Munkres algorithm ensures the computational effectiveness in the large-scale real-time application of the AMoD system; (2) the second optimization model considering long-term return can decrease average user waiting time and achieve a 2.78% increase in total revenue compared with the first model; (3) and integrating combinatorial optimization theory with reinforcement learning theory is a perfect package for solving the multiaction dynamic dispatching problem of SAEVs.…”
Get full text
Article -
268
Bridging theory and practice in peer-to-peer energy trading: market mechanisms and technological innovations
Published 2025-01-01“…As such, three market designs are discussed: centralized, decentralized, and distributed, and four pricing mechanisms, which are optimization, game theory, auction-based, and reinforcement learning. Enabling technologies discussed are Energy Internet, Internet of Things, Artificial intelligence, Blockchain, Communication networks, and battery flexibility. …”
Get full text
Article -
269
Minimizing Delay and Power Consumption at the Edge
Published 2025-01-01“…Prior work has mainly focused on two methodologies: (i) formulating non-linear optimizations that lead to NP-hard problems, which are processed via heuristics, and (ii) using AI-based formulations, such as reinforcement learning, that are then tested with simulations. …”
Get full text
Article -
270
An energy management strategy for integrated electricity-thermal energy systems using the DQN-CE algorithm
Published 2025-01-01“…To address the uncertainty and intermittency of renewable energy output in integrated electricity-thermal energy systems, a reinforcement learning method for energy management is proposed, aiming to minimize the operating costs of the system. …”
Get full text
Article -
271
Non-linear multi-objective optimization model of production planning based on fuzzy logic and machine learning
Published 2024-09-01“…This fuzzy logic is combined with machine learning algorithms such as neural networks and reinforcement learning to create an intelligent and flexible model that effectively adapts to sudden changes in dynamic environments. …”
Get full text
Article -
272
Multiple-Camera Patient Tracking Method Based on Motion-Group Parameter Reconstruction
Published 2024-12-01“…This is achieved by automated reinforcement learning and simultaneously applying the interdependences between the cameras. …”
Get full text
Article -
273
Enhancing lane detection in autonomous vehicles with multi-armed bandit ensemble learning
Published 2025-01-01“…The proposed technique optimizes the segmentation accuracy and treats the attained accuracy as a reward signal in the context of reinforcement learning by interacting with the environment through CNN model selection. …”
Get full text
Article -
274
Advanced Deep Learning Algorithms for Energy Optimization of Smart Cities
Published 2025-01-01“…These algorithms analyze real-time data from sensors and IoT devices to predict energy demand, enabling dynamic load balancing and reducing waste. Reinforcement learning models optimize power distribution by learning from historical patterns and adapting to changes in energy usage in real time. …”
Get full text
Article -
275
Delta opioid receptors affect acoustic features of song during vocal learning in zebra finches
Published 2025-01-01“…We wanted to study if they were also involved in naturally-occurring reinforcement learning behaviors such as vocal learning, using the zebra finch model system. …”
Get full text
Article -
276
Composition of Web Services Using Markov Decision Processes and Dynamic Programming
Published 2015-01-01“…Finally, a comparison with two popular reinforcement learning algorithms, sarsa and Q-learning, shows that these algorithms require one or two orders of magnitude and more time than policy iteration, iterative policy evaluation, and value iteration to handle WSC problems of the same complexity.…”
Get full text
Article -
277
Sliding Mode Control for Variable-Speed Trajectory Tracking of Underactuated Vessels with TD3 Algorithm Optimization
Published 2025-01-01“…An adaptive sliding mode controller (SMC) design with a reinforcement-learning parameter optimization method is proposed for variable-speed trajectory tracking control of underactuated vessels under scenarios involving model uncertainties and external environmental disturbances. …”
Get full text
Article -
278
A Sarsa(λ)-Based Control Model for Real-Time Traffic Light Coordination
Published 2014-01-01“…Considering dynamic characteristics of the actual traffic environment, reinforcement learning algorithm based traffic control approach can be applied to get optimal scheduling policy. …”
Get full text
Article -
279
EdgeGuard: Decentralized Medical Resource Orchestration via Blockchain-Secured Federated Learning in IoMT Networks
Published 2024-12-01“…We have made several technological advances, including a lightweight blockchain consensus mechanism designed for IoMT networks, an adaptive edge resource allocation method based on reinforcement learning, and a federated learning algorithm optimized for medical data with differential privacy. …”
Get full text
Article -
280
Model-Based Detection of Coordinated Attacks (DCA) in Distribution Systems
Published 2024-01-01“…In this paper, a novel proactive DCA strategy is proposed for early detection of CCA by establishing correlations among distinct attack events through model-based reinforcement learning that utilizes abductive reasoning to conclude the attacker goal. …”
Get full text
Article