STFormer: Spatio‐temporal former for hand–object interaction recognition from egocentric RGB video
Abstract In recent years, video‐based hand–object interaction has received widespread attention from researchers. However, due to the complexity and occlusion of hand movements, hand–object interaction recognition based on RGB videos remains a highly challenging task. Here, an end‐to‐end spatio‐temp...
Saved in:
| Main Authors: | Jiao Liang, Xihan Wang, Jiayi Yang, Quanli Gao |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Wiley
2024-09-01
|
| Series: | Electronics Letters |
| Subjects: | |
| Online Access: | https://doi.org/10.1049/ell2.70010 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
A Unified Framework for Recognizing Dynamic Hand Actions and Estimating Hand Pose from First-Person RGB Videos
by: Jiayi Yang, et al.
Published: (2025-06-01) -
Benchmarking 2D Egocentric Hand Pose Datasets
by: Olga Taran, et al.
Published: (2025-01-01) -
Visibility Aware In-Hand Object Pose Tracking in Videos With Transformers
by: Phan Xuan Tan, et al.
Published: (2025-01-01) -
Learning spatio-temporal context for basketball action pose estimation with a multi-stream network
by: Zhihao Zhang, et al.
Published: (2025-08-01) -
Empowering Efficient Spatio-Temporal Learning with a 3D CNN for Pose-Based Action Recognition
by: Ziliang Ren, et al.
Published: (2024-11-01)