Search Results for author: Hsiang-Ting Chen

Found 2 papers, 1 papers with code

WebVLN: Vision-and-Language Navigation on Websites

1 code implementation25 Dec 2023 Qi Chen, Dileepa Pitawela, Chongyang Zhao, Gengze Zhou, Hsiang-Ting Chen, Qi Wu

Vision-and-Language Navigation (VLN) task aims to enable AI agents to accurately understand and follow natural language instructions to navigate through real-world environments, ultimately reaching specific target locations.

Navigate Vision and Language Navigation

Segment Beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation

no code implementations14 Dec 2023 Renjie Wu, Hu Wang, Feras Dayoub, Hsiang-Ting Chen

The model consists of a vision teacher utilising panoramic information, an auditory teacher with 8-channel audio, and an audio-visual student that takes views with limited FoV and binaural audio as input and produce semantic segmentation for objects outside FoV.

Segmentation Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.