TAMC: Textual Alignment and Masked Consistency for Open-Vocabulary 3D Scene Understanding

Three-dimensional (3D) Scene Understanding achieves environmental perception by extracting and analyzing point cloud data with wide applications including virtual reality, robotics, etc. Previous methods align the 2D image feature from a pre-trained CLIP model and the 3D point cloud feature for the...

Full description

Saved in:
Bibliographic Details
Main Authors: Juan Wang, Zhijie Wang, Tomo Miyazaki, Yaohou Fan, Shinichiro Omachi
Format: Article
Language:English
Published: MDPI AG 2024-09-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/24/19/6166
Tags: Add Tag
No Tags, Be the first to tag this record!