Navigating to Objects Specfied by Images
This Image Goal Navigation task requires reasoning over the relation of objects in the scene (e.g., disambiguating between instances of similar appearance) and exploring efficiently to discover where the goal is (e.g., entering bedrooms while searching for the bed).
sub-tasks
- exploration
- goal instance re-identification
- goal localization
- local navigation
Related Work
- Image Goal Navigation(ImageNav) exits ambiguous image goals (e.g., captures of nondescript walls) and is detached from potential user applications
- instance-based ImageNav task(InstanceImageNav)
- goal images depict an object instance
- goal images are independent of agent embodiment
- limitations of end-to-end methods
- high sample complexity
- overfitting
- poor sim-to-real transfer
- skills relating to visual scene understanding, semantic exploration, and long-term memory tend to be difficult to learn end-to-end