Search Results - (structured OR (structures OR (structural OR structure))) global (convolution OR convolutional)

301

Dual-branch attention network-based stereoscopicvideo compression by TANG Shu, ZHAO Yu, YANG Shuli, XIE Xian-Zhong

Published 2025-01-01
“…First, a Local and Global Encoder-decoder Block (LGEDB) based on Transformer and channel attention was proposed, which accurately captured non-repetitive texture details in local regions and global structural information by integrating pixel-level self-attention within each local area and global attention across channels. …”

Get full text

Article

Save to List

Saved in:
302

A Dual-Stream Dental Panoramic X-Ray Image Segmentation Method Based on Transformer Heterogeneous Feature Complementation by Tian Ma, Jiahui Li, Zhenrui Dang, Yawen Li, Yuancheng Li

Published 2025-07-01
“…Furthermore, a Pooling-Cooperative Convolutional Module was designed, which enhances the model’s capability in detail extraction and boundary localization through weighted centroid features of dental structures and a latent edge extraction module. …”

Get full text

Article

Save to List

Saved in:
303

Fusion of Recurrence Plots and Gramian Angular Fields with Bayesian Optimization for Enhanced Time-Series Classification by Maria Mariani, Prince Appiah, Osei Tweneboah

Published 2025-07-01
“…Time-series classification remains a critical task across various domains, demanding models that effectively capture both local recurrence structures and global temporal dependencies. We introduce a novel framework that transforms time series into image representations by fusing recurrence plots (RPs) with both Gramian Angular Summation Fields (GASFs) and Gramian Angular Difference Fields (GADFs). …”

Get full text

Article

Save to List

Saved in:
304

Research on SeaTreasure Target Detection Technology Based on Improved YOLOv7-Tiny by Xiang Shi, Yunli Zhao, Jinrong Guo, Yan Liu, Yongqi Zhang

Published 2025-01-01
“…First, based on the YOLOv7-Tiny network, the MAFPN neck structure is used to replace the ELAN structure to achieve the multi-scale capture of semantic information of underwater sea treasures, and to enhance the UPA-YOLO model to accurately locate the targets of underwater sea treasures; second, the P2ELAN module is constructed and added to the backbone network, which makes use of the redundancy information in the feature map and dynamically adjusts the convolution kernel to adapt to data The P2ELAN module is added to the backbone network, using the redundant information in the feature map, dynamically adjusting the convolutional kernel to adapt to the lack of data, reducing the number of parameters in the model, and introducing the MSCA attention mechanism to inhibit the complex and changeable background features underwater, to improve the semantic feature extraction ability of the UPA-YOLO model for underwater targets, adding the MPDiou loss function to the improved algorithm model and completing the data validation of the detection model; finally, based on the TensorRT acceleration framework, the optimisation of the target detection Finally, based on the TensorRT acceleration framework, the target detection model is optimised, and the Jetson Nano edge device is used to complete the localisation deployment and realise the real-time target detection task of underwater sea treasures. …”

Get full text

Article

Save to List

Saved in:
305

Identification of diabetic retinopathy lesions in fundus images by integrating CNN and vision mamba models. by Zenglei Liu, Ailian Gao, Hui Sheng, Xueling Wang

Published 2025-01-01
“…The majority of deep learning techniques developed for medical image analysis rely on convolutional modules to extract the inherent structure of images within a certain local receptive field. …”

Get full text

Article

Save to List

Saved in:
306

An Mcformer encoder integrating Mamba and Cgmlp for improved acoustic feature extraction by Nurmemet Yolwas, Yongchao Li, Lixu Sun, Jian Peng, Zhiwu Sun, Yajie Wei, Yineng Cai

Published 2025-07-01
“…To address this limitation, the Mcformer encoder is introduced, which incorporates the Mamba module in parallel with multi-head attention blocks to enhance the model’s global context processing capabilities. Additionally, a Convolutional Gated Multilayer Perceptron (Cgmlp) structure is employed to improve the extraction of local features through deep convolutional layers. …”

Get full text

Article

Save to List

Saved in:
307

Efficient Image Super-Resolution With Multi-Branch Mixer Transformer by Long Zhang, Yi Wan

Published 2025-03-01
“…To address these problems, we propose a Multi-Branch Token Mixer (MBTM) to extract richer global and local information. Compared to other Transformer-based SR networks, MBTM achieves a balance between capturing global information and reducing the computational complexity of self-attention through its compact multi-branch structure. …”

Get full text

Article

Save to List

Saved in:
308

Distributed Photovoltaic Short-Term Power Prediction Based on Personalized Federated Multi-Task Learning by Wenxiang Luo, Yang Shen, Zewen Li, Fangming Deng

Published 2025-04-01
“…By improving the parallel pooling structure of a time series convolution network (TCN), an improved time series convolution network (iTCN) prediction model was established, and the channel attention mechanism CBAMANet was added to highlight the key meteorological characteristics’ information and improve the feature extraction ability of time series data in photovoltaic power prediction. …”

Get full text

Article

Save to List

Saved in:
309

Improved Asynchronous Federated Learning for Data Injection Pollution by Aiyou Li, Huoyou Li, Yanfang Liu, Guoli Ji

Published 2025-05-01
“…In our approach, the residual network is used to extract the static information of the image, the capsule network is used to extract the spatial dependence among the internal structures of the image, several layers of convolution are used to reduce the dimensions of both features, and the two extracted features are fused. …”

Get full text

Article

Save to List

Saved in:
310

Bearing fault diagnosis based on efficient cross space multiscale CNN transformer parallelism by Qi Chen, Feng Zhang, Yin Wang, Qing Yu, Genfeng Lang, Lixiong Zeng

Published 2025-04-01
“…Subsequently, parallel branches are employed to extract spatio-temporal features: the Convolutional Neural Network (CNN) branch integrates a multiscale feature extraction module, a Reversed Residual Structure (RRS), and an Efficient Multiscale Attention (EMA) mechanism to enhance local and global feature extraction capabilities; the Transformer branch combines Bidirectional Gated Recurrent Units (BiGRU) and Transformer to capture both local temporal dynamics and long-term dependencies. …”

Get full text

Article

Save to List

Saved in:
311

Infrared object detection for robot vision based on multiple focus diffusion and task interaction alignment by Jixu Zhang, Li Wang, Hung-Wei Li, Meng-Yen Hsieh, Shunxiang Zhang, Hua Wen, Meng Chen

Published 2025-07-01
“…The feature extraction module adopts a dual-stream fusion structure in the backbone network, which combines the local feature extraction of CNN with the global feature modeling of transformer. …”

Get full text

Article

Save to List

Saved in:
312

AfaMamba: Adaptive Feature Aggregation With Visual State Space Model for Remote Sensing Images Semantic Segmentation by Hongkun Chen, Huilan Luo, Chanjuan Wang

Published 2025-01-01
“…It employs a lightweight ResNet18 as the encoder, and during the decoding phase, it first utilizes a multiscale feature adaptive aggregation module to ensure that the output features from each stage of the encoder contain rich multiscale semantic information. Subsequently, the global-local Mamba structure combines the attention-optimized multiscale convolutional branches with the global branch of Mamba to facilitate effective interaction between global and local features. …”

Get full text

Article

Save to List

Saved in:
313

Vision Mamba and xLSTM-UNet for medical image segmentation by Xin Zhong, Gehao Lu, Hao Li

Published 2025-03-01
“…Abstract Deep learning-based medical image segmentation methods are generally divided into convolutional neural networks (CNNs) and Transformer-based models. …”

Get full text

Article

Save to List

Saved in:
314

YOLO-HVS: Infrared Small Target Detection Inspired by the Human Visual System by Xiaoge Wang, Yunlong Sheng, Qun Hao, Haiyuan Hou, Suzhen Nie

Published 2025-07-01
“…Meanwhile, the C2f_DWR (dilation-wise residual) module with regional-semantic dual residual structure is designed to significantly improve the efficiency of capturing multi-scale contextual information by expanding convolution and two-step feature extraction mechanism. …”

Get full text

Article

Save to List

Saved in:
315

Fine-Grained Extraction of Coastal Aquaculture Ponds From Remote Sensing Images Using an Edge-Supervised Multi-task Neural Network by Jian Qi, Min Ji, Fengxiang Jin, Jianran Xu, Hanyu Ji, Juan Wang

Published 2025-01-01
“…It notably enhances performance in complex environments and significantly boosts generalization capabilities by learning global structural features. First, a shared encoder–decoder architecture was constructed, leveraging large kernel depthwise separable convolution and residual optimization, thereby enhancing both local and global feature representations. …”

Get full text

Article

Save to List

Saved in:
316

A small object detection model in aerial images based on CPDD-YOLOv8 by Jingyang Wang, Jiayao Gao, Bo Zhang

Published 2025-01-01
“…Thirdly, a new DSC2f structure is proposed, which uses Dynamic Snake Convolution (DSConv) to take the place of the first standard Conv of Bottleneck in the C2f structure, so that the model can adapt to different inputs more effectively. …”

Get full text

Article

Save to List

Saved in:
317

A lightweight high-frequency mamba network for image super-resolution by Tao Wu, Wei Xu, Yajuan Wu

Published 2025-07-01
“…Various methods based on convolutional neural network (CNN) and Transformer structures have emerged, but few studies have mentioned how to combine these two parts of information. …”

Get full text

Article

Save to List

Saved in:
318

StomaYOLO: A Lightweight Maize Phenotypic Stomatal Cell Detector Based on Multi-Task Training by Ziqi Yang, Yiran Liao, Ziao Chen, Zhenzhen Lin, Wenyuan Huang, Yanxi Liu, Yuling Liu, Yamin Fan, Jie Xu, Lijia Xu, Jiong Mu

Published 2025-07-01
“…Maize (<i>Zea mays</i> L.), a vital global food crop, relies on its stomatal structure for regulating photosynthesis and responding to drought. …”

Get full text

Article

Save to List

Saved in:
319

ST-AGRNN: A Spatio-Temporal Attention-Gated Recurrent Neural Network for Traffic State Forecasting by Jian Yang, Jinhong Li, Lu Wei, Lei Gao, Fuqi Mao

Published 2022-01-01
“…In the proposed model, structure-based and location-based localized spatial features are obtained simultaneously by Graph Convolutional Networks (GCNs) and DeepWalk. …”

Get full text

Article

Save to List

Saved in:
320

Bitemporal Remote Sensing Change Detection With State-Space Models by Lukun Wang, Qihang Sun, Jiaming Pei, Muhammad Attique Khan, Maryam M. Al Dabel, Yasser D. Al-Otaibi, Ali Kashif Bashir

Published 2025-01-01
“…Change detection in very-high-resolution remote sensing images has gained significant attention, particularly with the rise of deep learning techniques such as convolutional neural networks and Transformers. The Mamba structure, successful in computer vision, has been applied to this domain, enhancing computational efficiency. …”

Get full text

Article

Save to List

Saved in:

[1]
Prev
11
12
13
14
15
16
17
18
19
20
21
Next
[25]

Dual-branch attention network-based stereoscopicvideo compression by TANG Shu, ZHAO Yu, YANG Shuli, XIE Xian-Zhong

A Dual-Stream Dental Panoramic X-Ray Image Segmentation Method Based on Transformer Heterogeneous Feature Complementation by Tian Ma, Jiahui Li, Zhenrui Dang, Yawen Li, Yuancheng Li

Fusion of Recurrence Plots and Gramian Angular Fields with Bayesian Optimization for Enhanced Time-Series Classification by Maria Mariani, Prince Appiah, Osei Tweneboah

Research on SeaTreasure Target Detection Technology Based on Improved YOLOv7-Tiny by Xiang Shi, Yunli Zhao, Jinrong Guo, Yan Liu, Yongqi Zhang

Identification of diabetic retinopathy lesions in fundus images by integrating CNN and vision mamba models. by Zenglei Liu, Ailian Gao, Hui Sheng, Xueling Wang

An Mcformer encoder integrating Mamba and Cgmlp for improved acoustic feature extraction by Nurmemet Yolwas, Yongchao Li, Lixu Sun, Jian Peng, Zhiwu Sun, Yajie Wei, Yineng Cai

Efficient Image Super-Resolution With Multi-Branch Mixer Transformer by Long Zhang, Yi Wan

Distributed Photovoltaic Short-Term Power Prediction Based on Personalized Federated Multi-Task Learning by Wenxiang Luo, Yang Shen, Zewen Li, Fangming Deng

Improved Asynchronous Federated Learning for Data Injection Pollution by Aiyou Li, Huoyou Li, Yanfang Liu, Guoli Ji

Bearing fault diagnosis based on efficient cross space multiscale CNN transformer parallelism by Qi Chen, Feng Zhang, Yin Wang, Qing Yu, Genfeng Lang, Lixiong Zeng

Infrared object detection for robot vision based on multiple focus diffusion and task interaction alignment by Jixu Zhang, Li Wang, Hung-Wei Li, Meng-Yen Hsieh, Shunxiang Zhang, Hua Wen, Meng Chen

AfaMamba: Adaptive Feature Aggregation With Visual State Space Model for Remote Sensing Images Semantic Segmentation by Hongkun Chen, Huilan Luo, Chanjuan Wang

Vision Mamba and xLSTM-UNet for medical image segmentation by Xin Zhong, Gehao Lu, Hao Li

YOLO-HVS: Infrared Small Target Detection Inspired by the Human Visual System by Xiaoge Wang, Yunlong Sheng, Qun Hao, Haiyuan Hou, Suzhen Nie

Fine-Grained Extraction of Coastal Aquaculture Ponds From Remote Sensing Images Using an Edge-Supervised Multi-task Neural Network by Jian Qi, Min Ji, Fengxiang Jin, Jianran Xu, Hanyu Ji, Juan Wang

A small object detection model in aerial images based on CPDD-YOLOv8 by Jingyang Wang, Jiayao Gao, Bo Zhang

A lightweight high-frequency mamba network for image super-resolution by Tao Wu, Wei Xu, Yajuan Wu

StomaYOLO: A Lightweight Maize Phenotypic Stomatal Cell Detector Based on Multi-Task Training by Ziqi Yang, Yiran Liao, Ziao Chen, Zhenzhen Lin, Wenyuan Huang, Yanxi Liu, Yuling Liu, Yamin Fan, Jie Xu, Lijia Xu, Jiong Mu

ST-AGRNN: A Spatio-Temporal Attention-Gated Recurrent Neural Network for Traffic State Forecasting by Jian Yang, Jinhong Li, Lu Wei, Lei Gao, Fuqi Mao

Bitemporal Remote Sensing Change Detection With State-Space Models by Lukun Wang, Qihang Sun, Jiaming Pei, Muhammad Attique Khan, Maryam M. Al Dabel, Yasser D. Al-Otaibi, Ali Kashif Bashir

Search Tools:

Refine Results

Institution

Format

Author

Language

Year of Publication