CQS-Attention: Scaling Up the Standard Attention Computation for Infinitely Long Sequences

CQS-Attention: Scaling Up the Standard Attention Computation for Infinitely Long Sequences

Transformer models suffer from unaffordable high memory consumption when the sequence is long and standard self-attention is utilized. We developed a sequence parallelism scheme called CQS-Attention that can break the limit of sequence length. A long sequence is divided into multiple overlapping sub...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yiming Bian, Arun K. Somani
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Attention computation cyclic quorum sets parallel algorithm transformer
Online Access:	https://ieeexplore.ieee.org/document/10900388/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CqsA/LuxS-HapR Quorum sensing circuit modulates type VI secretion system VﬂT6SS2 in Vibrio fluvialis
by: Xiaoshu Liu, et al.
Published: (2021-01-01)

Hybrid Multi-Attention Network for Audio–Visual Emotion Recognition Through Multimodal Feature Fusion
by: Sathishkumar Moorthy, et al.
Published: (2025-03-01)

Directly Attention loss adjusted prioritized experience replay
by: Zhuoying Chen, et al.
Published: (2025-04-01)

Cyclic peptide membrane permeability prediction using deep learning model based on molecular attention transformer
by: Dawei Jiang, et al.
Published: (2025-03-01)

A lightweight fabric defect detection with parallel dilated convolution and dual attention mechanism
by: Zheqing Zhang, et al.
Published: (2025-08-01)

Wavelet-Attention Swin for Automatic Diabetic Retinopathy Classification
by: Rasha Ali Dihin, et al.
Published: (2024-08-01)

Global-local graph attention with cyclic pseudo-labels for bitcoin anti-money laundering detection
by: Meng Li, et al.
Published: (2025-07-01)

BiLSTM-Based Parallel CNN Models With Attention and Ensemble Mechanism for Twitter Sentiment Analysis
by: Anas W. Abulfaraj
Published: (2025-01-01)

Cyclic Learning Rate U-Shaped ResNet Embedded With Dual Attentions for Velocity Model Building
by: Chaobo Zhu, et al.
Published: (2025-01-01)

An improved lightweight tongue segmentation model with self-attention parallel network and progressive upsampling
by: Xuan Wang, et al.
Published: (2025-07-01)

Channel computation based on multi-scale attention residual network
by: Wengang Li, et al.
Published: (2025-05-01)

Microsaccades and covert attention: Evidence from a continuous, divided attention task
by: Aimee Elizabeth Ryan, et al.
Published: (2019-06-01)

Attention Investigation on Children with Attention Deficit Hyperactivity Disorder in Different Genders at School Age
by: Jingzhi HUANG, et al.
Published: (2020-10-01)

The modulation of selective attention and divided attention on cross-modal congruence
by: Honghui Xu, et al.
Published: (2025-04-01)

MoSViT: a lightweight vision transformer framework for efficient disease detection via precision attention mechanism
by: Yuanqi Chen, et al.
Published: (2025-03-01)

Rethinking attention: A unified perspective on top-down and bottom-up processes
by: Seyed Javad Saghravanian
Published: (2025-08-01)

INFLUENCE OF SMARTPHONES ON STUDENTS` ATTENTIONAL PROCESSES
by: M. A. Akopova
Published: (2020-08-01)

Attention and information acquisition: Comparison of mouse-click with eye-movement attention tracking
by: Steffen Egner, et al.
Published: (2018-11-01)

The Effect of Individual Attention Training Implemented on Students Without Attention Deficit Hyperactivity on The Skills of Focusing Attention and In-Class Activities
by: Şenay Ilık
Published: (2017-08-01)

Probabilistic Attention Map: A Probabilistic Attention Mechanism for Convolutional Neural Networks
by: Yifeng Liu, et al.
Published: (2024-12-01)

Comparison of the Effectiveness of Cognitive Play Therapy and Computer-based Working Memory Training on the Attention of Children with Attention Deficit Hyperactivity Disorder in Farsan City
by: Kamran Heydari Farsani, et al.
Published: (2024-09-01)

Fourier-mixed window attention for efficient and robust long sequence time-series forecasting
by: Nhat Thanh Tran, et al.
Published: (2025-05-01)

A Computational–Cognitive Model of Audio-Visual Attention in Dynamic Environments
by: Hamideh Yazdani, et al.
Published: (2025-05-01)

Heterogeneous attention multi-scale network for efficient weld seam classification
by: Enpei Guo, et al.
Published: (2025-04-01)

SADNet: sustained attention decoding in a driving task by self-attention convolutional neural network
by: Shuzhong Lai, et al.
Published: (2024-12-01)

Crowd counting in domain generalization based on multi-scale attention and hierarchy level enhancement
by: Jiarui Zhou, et al.
Published: (2025-01-01)

PolSAR Image Classification Framework With POA Align and Cyclic Channel Attention
by: Xiaoxiao Fang, et al.
Published: (2024-01-01)

Foot Pressure-Based Abnormal Gait Recognition With Multi-Scale Cross-Attention Fusion
by: Menghao Yuan, et al.
Published: (2025-01-01)

Scale-Sensitive Attention for Multi-Scale Maritime Vessel Detection Using EO/IR Cameras
by: Soohyun Wang, et al.
Published: (2024-12-01)

Quorum sensing modulates microbial community structure through regulation of secondary metabolites
by: April C. Armes, et al.
Published: (2025-07-01)

Resistance to quorum sensing inhibition spreads more slowly during host infection than antibiotic resistance
by: Tom Defoirdt
Published: (2025-12-01)

Systematic review of the value-driven attentional capture paradigm in visual attention studies: Evidence from 52 experiments
by: Miloš Stanković
Published: (2025-08-01)

A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
by: Feng Bian, et al.
Published: (2025-01-01)

Prediction of attention deficit hyperactivity disorder using the comprehensive attention test: a large-scale machine learning approach
by: Kwang Su Cha, et al.
Published: (2025-05-01)

Classification of retinopathy of prematurity based on mixed attention
by: CHEN Shaobin, et al.
Published: (2022-11-01)

Bladder lesion detection using EfficientNet and hybrid attention transformer through attention transformation
by: Poonam Sharma, et al.
Published: (2025-05-01)

Effect of a visual tracking intervention on attention and behavior of attention deficit hyper activity children
by: Shiva Janmohammadi, et al.
Published: (2020-04-01)

Attention Sustainability Impact on Social Status in the Class
by: Stanislava Varadinova
Published: (2020-03-01)

THE EFFECTS OF THE TOMATIS METHOD ON ATTENTION OF A CHILD WITH ATTENTION-DEFICIT/HYPERACTIVITY DISORDER: A CASE STUDY
by: Zlata Avdić, et al.
Published: (2025-07-01)

The strength of confidence is involved in controlling the intensity of attentional allocation
by: Kazuki Yoshida, et al.
Published: (2025-01-01)