Leveraging two-dimensional pre-trained vision transformers for three-dimensional model generation via masked autoencoders
Abstract Although the Transformer architecture has established itself as the industry standard for jobs involving natural language processing, it still has few uses in computer vision. In vision, attention is used in conjunction with convolutional networks or to replace individual convolutional netw...
Saved in:
| Main Authors: | Muhammad Sajid, Kaleem Razzaq Malik, Ateeq Ur Rehman, Tauqeer Safdar Malik, Masoud Alajmi, Ali Haider Khan, Amir Haider, Seada Hussen |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-01-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-87376-y |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Local pattern aware 3D video swin transformer with masked autoencoding for realtime augmented reality gesture interaction
by: Suli Wang
Published: (2025-07-01) -
Unsupervised Insulator Defect Detection Method Based on Masked Autoencoder
by: Yanying Song, et al.
Published: (2025-07-01) -
A hybrid steganography framework using DCT and GAN for secure data communication in the big data era
by: Kaleem Razzaq Malik, et al.
Published: (2025-06-01) -
Three-Dimensional Instance Segmentation of Rooms in Indoor Building Point Clouds Using Mask3D
by: Michael Brunklaus, et al.
Published: (2025-03-01) -
Spatial–Temporal Heatmap Masked Autoencoder for Skeleton-Based Action Recognition
by: Cunling Bian, et al.
Published: (2025-05-01)