TAMC: Textual Alignment and Masked Consistency for Open-Vocabulary 3D Scene Understanding

Three-dimensional (3D) Scene Understanding achieves environmental perception by extracting and analyzing point cloud data with wide applications including virtual reality, robotics, etc. Previous methods align the 2D image feature from a pre-trained CLIP model and the 3D point cloud feature for the...

Full description

Saved in:

Bibliographic Details
Main Authors:	Juan Wang, Zhijie Wang, Tomo Miyazaki, Yaohou Fan, Shinichiro Omachi
Format:	Article
Language:	English
Published:	MDPI AG 2024-09-01
Series:	Sensors
Subjects:	open vocabulary 3D Scene Understanding multi-modal learning contrastive learning Masked Consistency Textual Alignment
Online Access:	https://www.mdpi.com/1424-8220/24/19/6166
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.mdpi.com/1424-8220/24/19/6166

TAMC: Textual Alignment and Masked Consistency for Open-Vocabulary 3D Scene Understanding

Internet

Similar Items