Search Results for author: Sixun Dong

Found 4 papers, 4 papers with code

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

2 code implementations19 Jan 2024 Chenyu Wang, Weixin Luo, Qianyu Chen, Haonan Mai, Jindi Guo, Sixun Dong, Xiaohua, Xuan, Zhengxin Li, Lin Ma, Shenghua Gao

Recently, the astonishing performance of large language models (LLMs) in natural language comprehension and generation tasks triggered lots of exploration of using them as central controllers to build agent systems.

Language Modelling Large Language Model

RoomDesigner: Encoding Anchor-latents for Style-consistent and Shape-compatible Indoor Scene Generation

1 code implementation16 Oct 2023 Yiqun Zhao, Zibo Zhao, Jing Li, Sixun Dong, Shenghua Gao

Indoor scene generation aims at creating shape-compatible, style-consistent furniture arrangements within a spatially reasonable layout.

Quantization Scene Generation

Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos

1 code implementation CVPR 2023 Sixun Dong, Huazhang Hu, Dongze Lian, Weixin Luo, Yicheng Qian, Shenghua Gao

Sequential video understanding, as an emerging video understanding task, has driven lots of researchers' attention because of its goal-oriented nature.

Representation Learning Sentence +1

TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting

1 code implementation CVPR 2022 Huazhang Hu, Sixun Dong, Yiqun Zhao, Dongze Lian, Zhengxin Li, Shenghua Gao

Existing methods focus on performing repetitive action counting in short videos, which is tough for dealing with longer videos in more realistic scenarios.

Repetitive Action Counting

Cannot find the paper you are looking for? You can Submit a new open access paper.