-
1
Audio-visual event localization with dual temporal-aware scene understanding and image-text knowledge bridging
Published 2024-11-01Subjects: “…Audio-visual event localization…”
Get full text
Article -
2
Embedding-based pair generation for contrastive representation learning in audio-visual surveillance data
Published 2025-01-01Subjects: Get full text
Article