Search Results for author: Ruipu Luo

Found 3 papers, 2 papers with code

DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning

1 code implementation • 2 Apr 2024 • Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuanjing Huang, Zhongyu Wei

For task completion, the agent needs to align and integrate various navigation modalities, including instruction, observation and navigation history.

Contrastive Learning Decision Making +2

Paper
Code

Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making

no code implementations • 16 Jul 2023 • Ruipu Luo, Jiwen Zhang, Zhongyu Wei

Vision language decision making (VLDM) is a challenging multimodal task.

Decision Making

Paper
Add Code

Valley: Video Assistant with Large Language model Enhanced abilitY

1 code implementation • 12 Jun 2023 • Ruipu Luo, Ziwang Zhao, Min Yang, Junwei DOng, Da Li, Pengcheng Lu, Tao Wang, Linmei Hu, Minghui Qiu, Zhongyu Wei

Large language models (LLMs), with their remarkable conversational capabilities, have demonstrated impressive performance across various applications and have emerged as formidable AI assistants.

Action Recognition Instruction Following +4

162

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.