Auditory Learning vs Visual Learning

Augmented Hierarchical Scene Prior Learning with Context-based Scene Completion Network for Visual Semantic Navigation

Abstract: The Visual Semantic Navigation(VSN) requires the agent to navigate to a target object of specified category in a previously unseen scene. To tackle this task, the agent must learn a nimble ...

IEEE

DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information

Abstract: Current audio-visual representation learning can capture rough object categories (e.g., "animals" and "instruments"), but it lacks the ability to recognize fine-grained details, such as ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Augmented Hierarchical Scene Prior Learning with Context-based Scene Completion Network for Visual Semantic Navigation

DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information

Trending now