IMViT: Adjacency Matrix-Based Lightweight Plain Vision Transformer

IMViT: Adjacency Matrix-Based Lightweight Plain Vision Transformer

Transformers are becoming dominant deep learning backbones for both computer vision and natural language processing. While extensive experiments prove its outstanding ability for large models, transformers with small sizes are not comparable with convolutional neural networks in various downstream t...

Full description

Saved in:

Bibliographic Details
Main Authors:	Qihao Chen, Yunfeng Yan, Xianbo Wang, Jishen Peng
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Image classification non-hierarchical vision transformer mask self-attention
Online Access:	https://ieeexplore.ieee.org/document/10849548/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ViT-DualAtt: An efficient pornographic image classification method based on Vision Transformer with dual attention
by: Zengyu Cai, et al.
Published: (2024-12-01)

Mirror Target YOLO: An Improved YOLOv8 Method With Indirect Vision for Heritage Buildings Fire Detection
by: Jian Liang, et al.
Published: (2025-01-01)

Leveraging two-dimensional pre-trained vision transformers for three-dimensional model generation via masked autoencoders
by: Muhammad Sajid, et al.
Published: (2025-01-01)

Transforming Alzheimer’s Disease Diagnosis: Implementing Vision Transformer (ViT) for MRI Images Classification
by: Dian Kurniasari, et al.
Published: (2025-01-01)

Squeeze-and-Excitation Vision Transformer for Lung Nodule Classification
by: Xiaozhong Xue, et al.
Published: (2025-01-01)

Hierarchical Feature Attention Learning Network for Detecting Object and Discriminative Parts in Fine-Grained Visual Classification
by: A. Yeong Han, et al.
Published: (2025-01-01)

Brain CT image classification based on mask RCNN and attention mechanism
by: Shoulin Yin, et al.
Published: (2024-11-01)

SkelETT—Skeleton-to-Emotion Transfer Transformer
by: Pedro Victor Vieira Paiva, et al.
Published: (2025-01-01)

Classification of Cigarette Types Using Computer Vision: An Analysis of Smoke Aggregation Features
by: Shishuan Guan, et al.
Published: (2025-01-01)

SViG: A Similarity-Thresholded Approach for Vision Graph Neural Networks
by: Ismael Elsharkawi, et al.
Published: (2025-01-01)

Mask-wearing affects infants' selective attention to familiar and unfamiliar audiovisual speech
by: Lauren N. Slivka, et al.
Published: (2025-02-01)

The effect of low vision rehabilitation on the quality of life and caregiver burden of low vision patients – a randomized trial
by: Chunling Cai, et al.
Published: (2025-01-01)

Multi-Agent Hierarchical Graph Attention Actor–Critic Reinforcement Learning
by: Tongyue Li, et al.
Published: (2024-12-01)

Vision Transformers for Image Classification: A Comparative Survey
by: Yaoli Wang, et al.
Published: (2025-01-01)

Out of sight into vision : there is more to good vision than reading the fine print /
by: Cohen,Neville S.

Hybrid Method for Point Cloud Classification
by: Abdurrahman Hazer, et al.
Published: (2025-01-01)

Enhancing furcation involvement classification on panoramic radiographs with vision transformers
by: Xuan Zhang, et al.
Published: (2025-01-01)

A Hybrid Contextual Embedding and Hierarchical Attention for Improving the Performance of Word Sense Disambiguation
by: Robbel Habtamu Yigzaw, et al.
Published: (2025-01-01)

Assessing bias and computational efficiency in vision transformers using early exits
by: Seth Nixon, et al.
Published: (2025-01-01)

Readiness for Perception and Action: Towards a More Mechanistic Understanding of Phasic Alertness
by: Christian H. Poth
Published: (2025-01-01)

Meditation-inspired Visioning
by: Rike Neuhoff
Published: (2023-08-01)

Enhancing Deepfake Detection Through Quantum Transfer Learning and Class-Attention Vision Transformer Architecture
by: Bekir Eray Katı, et al.
Published: (2025-01-01)

Hierarchical image classification using transfer learning to improve deep learning model performance for amazon parrots
by: Jung-Il Kim, et al.
Published: (2025-01-01)

Towards Detecting Associations of Canine Astrovirus and Caliciviruses with Health and Living Characteristics of Dogs in Greece
by: Efthymia Stamelou, et al.
Published: (2025-01-01)

RETRACTED: Modern Subtype Classification and Outlier Detection Using the Attention Embedder to Transform Ovarian Cancer Diagnosis
by: S. M. Nuruzzaman Nobel, et al.
Published: (2024-01-01)

Low-Rank Adaptation of Pre-Trained Large Vision Models for Improved Lung Nodule Malignancy Classification
by: Benjamin P. Veasey, et al.
Published: (2025-01-01)

Drawing-Aware Parkinson’s Disease Detection Through Hierarchical Deep Learning Models
by: Ioannis Kansizoglou, et al.
Published: (2025-01-01)

MythicVision: a deep learning powered mobile application for understanding Indian mythological deities using weight centric decision approach
by: Tauseef Khan, et al.
Published: (2025-01-01)

Vision impairment in boys recruited to the iREAD study
by: Jonathan Levine, et al.
Published: (2025-01-01)

AI-Driven Innovations in Tourism: Developing a Hybrid Framework for the Saudi Tourism Sector
by: Abdulkareem Alzahrani, et al.
Published: (2025-01-01)

HTTD: A Hierarchical Transformer for Accurate Table Detection in Document Images
by: Mahmoud SalahEldin Kasem, et al.
Published: (2025-01-01)

Attention to Monkeypox: An Interpretable Monkeypox Detection Technique Using Attention Mechanism
by: Avi Deb Raha, et al.
Published: (2024-01-01)

Neuromorphic Vision Data Coding: Classifying and Reviewing the Literature
by: Catarina Brites, et al.
Published: (2025-01-01)

Efficient guided inpainting of larger hole missing images based on hierarchical decoding network
by: Xiucheng Dong, et al.
Published: (2025-01-01)

An improved lightweight ConvNeXt for rice classification
by: Pengtao Lv, et al.
Published: (2025-01-01)

SuroTex: Surrounding texture datasetMendeley Data
by: Muhammad Ardi Putra, et al.
Published: (2025-04-01)

Mask Material Filtration Efficiency and Mask Fitting at the Crossroads: Implications during Pandemic Times
by: Karin Ardon-Dryer, et al.
Published: (2021-03-01)

FathomDEM: an improved global terrain map using a hybrid vision transformer model
by: Peter Uhe, et al.
Published: (2025-01-01)

PBVit: A Patch-Based Vision Transformer for Enhanced Brain Tumor Detection
by: Pratikkumar Chauhan, et al.
Published: (2025-01-01)

YOLOv8-CBAM: a study of sheep head identification in Ujumqin sheep
by: Qing Qin, et al.
Published: (2025-02-01)