Yixuan Pan's Blog
Home
About
Tags
Categories
Archives
0%
Multimodal
Tag
2023
08-07
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
08-07
Scaling Data Generation in Vision-and-Language Navigation
08-07
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
07-31
Visual Language Maps for Robot Navigation
07-29
ConceptFusion: Open-set Multimodal 3D Mapping
07-29
CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory
07-25
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
07-24
PROTO-CLIP: Vision-Language Prototypical Network for Few-Shot Learning
07-22
Generative Prompt Model for Weakly Supervised Object Localization
07-20
HomeRobot: Open-Vocabulary Mobile Manipulation
1
2
3
4