Multiple Hierarchical Cross-Scale Transformer for Remote Sensing Scene Classification

The Transformer model can capture global contextual information but does not have an inherent inductive bias. In contrast, convolutional neural networks (CNNs) are highly praised in computer vision due to their strong inductive bias and local spatial correlation. To combine the advantages of the two...

Full description

Saved in:
Bibliographic Details
Main Authors: Dan Zhang, Wenping Ma, Licheng Jiao, Xu Liu, Yuting Yang, Fang Liu
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/17/1/42
Tags: Add Tag
No Tags, Be the first to tag this record!