Showing 301 - 320 results of 481 for search '(structured OR (structures OR (structural OR structure))) global (convolution OR convolutional)', query time: 0.21s Refine Results
  1. 301

    Dual-branch attention network-based stereoscopicvideo compression by TANG Shu, ZHAO Yu, YANG Shuli, XIE Xian-Zhong

    Published 2025-01-01
    “…First, a Local and Global Encoder-decoder Block (LGEDB) based on Transformer and channel attention was proposed, which accurately captured non-repetitive texture details in local regions and global structural information by integrating pixel-level self-attention within each local area and global attention across channels. …”
    Get full text
    Article
  2. 302

    A Dual-Stream Dental Panoramic X-Ray Image Segmentation Method Based on Transformer Heterogeneous Feature Complementation by Tian Ma, Jiahui Li, Zhenrui Dang, Yawen Li, Yuancheng Li

    Published 2025-07-01
    “…Furthermore, a Pooling-Cooperative Convolutional Module was designed, which enhances the model’s capability in detail extraction and boundary localization through weighted centroid features of dental structures and a latent edge extraction module. …”
    Get full text
    Article
  3. 303

    Fusion of Recurrence Plots and Gramian Angular Fields with Bayesian Optimization for Enhanced Time-Series Classification by Maria Mariani, Prince Appiah, Osei Tweneboah

    Published 2025-07-01
    “…Time-series classification remains a critical task across various domains, demanding models that effectively capture both local recurrence structures and global temporal dependencies. We introduce a novel framework that transforms time series into image representations by fusing recurrence plots (RPs) with both Gramian Angular Summation Fields (GASFs) and Gramian Angular Difference Fields (GADFs). …”
    Get full text
    Article
  4. 304

    Research on SeaTreasure Target Detection Technology Based on Improved YOLOv7-Tiny by Xiang Shi, Yunli Zhao, Jinrong Guo, Yan Liu, Yongqi Zhang

    Published 2025-01-01
    “…First, based on the YOLOv7-Tiny network, the MAFPN neck structure is used to replace the ELAN structure to achieve the multi-scale capture of semantic information of underwater sea treasures, and to enhance the UPA-YOLO model to accurately locate the targets of underwater sea treasures; second, the P2ELAN module is constructed and added to the backbone network, which makes use of the redundancy information in the feature map and dynamically adjusts the convolution kernel to adapt to data The P2ELAN module is added to the backbone network, using the redundant information in the feature map, dynamically adjusting the convolutional kernel to adapt to the lack of data, reducing the number of parameters in the model, and introducing the MSCA attention mechanism to inhibit the complex and changeable background features underwater, to improve the semantic feature extraction ability of the UPA-YOLO model for underwater targets, adding the MPDiou loss function to the improved algorithm model and completing the data validation of the detection model; finally, based on the TensorRT acceleration framework, the optimisation of the target detection Finally, based on the TensorRT acceleration framework, the target detection model is optimised, and the Jetson Nano edge device is used to complete the localisation deployment and realise the real-time target detection task of underwater sea treasures. …”
    Get full text
    Article
  5. 305

    Identification of diabetic retinopathy lesions in fundus images by integrating CNN and vision mamba models. by Zenglei Liu, Ailian Gao, Hui Sheng, Xueling Wang

    Published 2025-01-01
    “…The majority of deep learning techniques developed for medical image analysis rely on convolutional modules to extract the inherent structure of images within a certain local receptive field. …”
    Get full text
    Article
  6. 306

    An Mcformer encoder integrating Mamba and Cgmlp for improved acoustic feature extraction by Nurmemet Yolwas, Yongchao Li, Lixu Sun, Jian Peng, Zhiwu Sun, Yajie Wei, Yineng Cai

    Published 2025-07-01
    “…To address this limitation, the Mcformer encoder is introduced, which incorporates the Mamba module in parallel with multi-head attention blocks to enhance the model’s global context processing capabilities. Additionally, a Convolutional Gated Multilayer Perceptron (Cgmlp) structure is employed to improve the extraction of local features through deep convolutional layers. …”
    Get full text
    Article
  7. 307

    Efficient Image Super-Resolution With Multi-Branch Mixer Transformer by Long Zhang, Yi Wan

    Published 2025-03-01
    “…To address these problems, we propose a Multi-Branch Token Mixer (MBTM) to extract richer global and local information. Compared to other Transformer-based SR networks, MBTM achieves a balance between capturing global information and reducing the computational complexity of self-attention through its compact multi-branch structure. …”
    Get full text
    Article
  8. 308

    Distributed Photovoltaic Short-Term Power Prediction Based on Personalized Federated Multi-Task Learning by Wenxiang Luo, Yang Shen, Zewen Li, Fangming Deng

    Published 2025-04-01
    “…By improving the parallel pooling structure of a time series convolution network (TCN), an improved time series convolution network (iTCN) prediction model was established, and the channel attention mechanism CBAMANet was added to highlight the key meteorological characteristics’ information and improve the feature extraction ability of time series data in photovoltaic power prediction. …”
    Get full text
    Article
  9. 309

    Improved Asynchronous Federated Learning for Data Injection Pollution by Aiyou Li, Huoyou Li, Yanfang Liu, Guoli Ji

    Published 2025-05-01
    “…In our approach, the residual network is used to extract the static information of the image, the capsule network is used to extract the spatial dependence among the internal structures of the image, several layers of convolution are used to reduce the dimensions of both features, and the two extracted features are fused. …”
    Get full text
    Article
  10. 310

    Bearing fault diagnosis based on efficient cross space multiscale CNN transformer parallelism by Qi Chen, Feng Zhang, Yin Wang, Qing Yu, Genfeng Lang, Lixiong Zeng

    Published 2025-04-01
    “…Subsequently, parallel branches are employed to extract spatio-temporal features: the Convolutional Neural Network (CNN) branch integrates a multiscale feature extraction module, a Reversed Residual Structure (RRS), and an Efficient Multiscale Attention (EMA) mechanism to enhance local and global feature extraction capabilities; the Transformer branch combines Bidirectional Gated Recurrent Units (BiGRU) and Transformer to capture both local temporal dynamics and long-term dependencies. …”
    Get full text
    Article
  11. 311

    Infrared object detection for robot vision based on multiple focus diffusion and task interaction alignment by Jixu Zhang, Li Wang, Hung-Wei Li, Meng-Yen Hsieh, Shunxiang Zhang, Hua Wen, Meng Chen

    Published 2025-07-01
    “…The feature extraction module adopts a dual-stream fusion structure in the backbone network, which combines the local feature extraction of CNN with the global feature modeling of transformer. …”
    Get full text
    Article
  12. 312

    AfaMamba: Adaptive Feature Aggregation With Visual State Space Model for Remote Sensing Images Semantic Segmentation by Hongkun Chen, Huilan Luo, Chanjuan Wang

    Published 2025-01-01
    “…It employs a lightweight ResNet18 as the encoder, and during the decoding phase, it first utilizes a multiscale feature adaptive aggregation module to ensure that the output features from each stage of the encoder contain rich multiscale semantic information. Subsequently, the global-local Mamba structure combines the attention-optimized multiscale convolutional branches with the global branch of Mamba to facilitate effective interaction between global and local features. …”
    Get full text
    Article
  13. 313

    Vision Mamba and xLSTM-UNet for medical image segmentation by Xin Zhong, Gehao Lu, Hao Li

    Published 2025-03-01
    “…Abstract Deep learning-based medical image segmentation methods are generally divided into convolutional neural networks (CNNs) and Transformer-based models. …”
    Get full text
    Article
  14. 314

    YOLO-HVS: Infrared Small Target Detection Inspired by the Human Visual System by Xiaoge Wang, Yunlong Sheng, Qun Hao, Haiyuan Hou, Suzhen Nie

    Published 2025-07-01
    “…Meanwhile, the C2f_DWR (dilation-wise residual) module with regional-semantic dual residual structure is designed to significantly improve the efficiency of capturing multi-scale contextual information by expanding convolution and two-step feature extraction mechanism. …”
    Get full text
    Article
  15. 315

    Fine-Grained Extraction of Coastal Aquaculture Ponds From Remote Sensing Images Using an Edge-Supervised Multi-task Neural Network by Jian Qi, Min Ji, Fengxiang Jin, Jianran Xu, Hanyu Ji, Juan Wang

    Published 2025-01-01
    “…It notably enhances performance in complex environments and significantly boosts generalization capabilities by learning global structural features. First, a shared encoder–decoder architecture was constructed, leveraging large kernel depthwise separable convolution and residual optimization, thereby enhancing both local and global feature representations. …”
    Get full text
    Article
  16. 316

    A small object detection model in aerial images based on CPDD-YOLOv8 by Jingyang Wang, Jiayao Gao, Bo Zhang

    Published 2025-01-01
    “…Thirdly, a new DSC2f structure is proposed, which uses Dynamic Snake Convolution (DSConv) to take the place of the first standard Conv of Bottleneck in the C2f structure, so that the model can adapt to different inputs more effectively. …”
    Get full text
    Article
  17. 317

    A lightweight high-frequency mamba network for image super-resolution by Tao Wu, Wei Xu, Yajuan Wu

    Published 2025-07-01
    “…Various methods based on convolutional neural network (CNN) and Transformer structures have emerged, but few studies have mentioned how to combine these two parts of information. …”
    Get full text
    Article
  18. 318

    StomaYOLO: A Lightweight Maize Phenotypic Stomatal Cell Detector Based on Multi-Task Training by Ziqi Yang, Yiran Liao, Ziao Chen, Zhenzhen Lin, Wenyuan Huang, Yanxi Liu, Yuling Liu, Yamin Fan, Jie Xu, Lijia Xu, Jiong Mu

    Published 2025-07-01
    “…Maize (<i>Zea mays</i> L.), a vital global food crop, relies on its stomatal structure for regulating photosynthesis and responding to drought. …”
    Get full text
    Article
  19. 319

    ST-AGRNN: A Spatio-Temporal Attention-Gated Recurrent Neural Network for Traffic State Forecasting by Jian Yang, Jinhong Li, Lu Wei, Lei Gao, Fuqi Mao

    Published 2022-01-01
    “…In the proposed model, structure-based and location-based localized spatial features are obtained simultaneously by Graph Convolutional Networks (GCNs) and DeepWalk. …”
    Get full text
    Article
  20. 320

    Bitemporal Remote Sensing Change Detection With State-Space Models by Lukun Wang, Qihang Sun, Jiaming Pei, Muhammad Attique Khan, Maryam M. Al Dabel, Yasser D. Al-Otaibi, Ali Kashif Bashir

    Published 2025-01-01
    “…Change detection in very-high-resolution remote sensing images has gained significant attention, particularly with the rise of deep learning techniques such as convolutional neural networks and Transformers. The Mamba structure, successful in computer vision, has been applied to this domain, enhancing computational efficiency. …”
    Get full text
    Article