Search Results for author: Hsiang-Ting Chen

Found 2 papers, 1 papers with code

WebVLN: Vision-and-Language Navigation on Websites

1 code implementation • 25 Dec 2023 • Qi Chen, Dileepa Pitawela, Chongyang Zhao, Gengze Zhou, Hsiang-Ting Chen, Qi Wu

Vision-and-Language Navigation (VLN) task aims to enable AI agents to accurately understand and follow natural language instructions to navigate through real-world environments, ultimately reaching specific target locations.

Navigate Vision and Language Navigation

Paper
Code

Segment Beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation

no code implementations • 14 Dec 2023 • Renjie Wu, Hu Wang, Feras Dayoub, Hsiang-Ting Chen

The model consists of a vision teacher utilising panoramic information, an auditory teacher with 8-channel audio, and an audio-visual student that takes views with limited FoV and binaural audio as input and produce semantic segmentation for objects outside FoV.

Segmentation Semantic Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.