Search Results for author: Haoxuan Qu

Found 10 papers, 1 papers with code

LLMs are Good Action Recognizers

no code implementations • 31 Mar 2024 • Haoxuan Qu, Yujun Cai, Jun Liu

Motivated by this, we propose a novel LLM-AR framework, in which we investigate treating the Large Language Model as an Action Recognizer.

Action Recognition Language Modelling +3

Paper
Add Code

GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner

no code implementations • 22 Mar 2024 • Haoxuan Qu, Ziyan Guo, Jun Liu

Recently, while text-driven human motion generation has received massive research attention, most existing text-driven motion generators are generally only designed to generate motion sequences in a blank background.

Paper
Add Code

Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning

no code implementations • 15 Mar 2024 • Hang Zhang, Wenxiao Zhang, Haoxuan Qu, Jun Liu

Human-centered dynamic scene understanding plays a pivotal role in enhancing the capability of robotic and autonomous systems, in which Video-based Human-Object Interaction (V-HOI) detection is a crucial task in semantic scene understanding, aimed at comprehensively understanding HOI relationships within a video to benefit the behavioral decisions of mobile robots and autonomous driving systems.

Autonomous Driving Human-Object Interaction Detection +2

Paper
Add Code

6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation

no code implementations • 29 Dec 2023 • Li Xu, Haoxuan Qu, Yujun Cai, Jun Liu

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds.

6D Pose Estimation using RGB Denoising +1

Paper
Add Code

LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition

1 code implementation • NeurIPS 2023 • Haoxuan Qu, Xiaofei Hui, Yujun Cai, Jun Liu

Open-set object recognition aims to identify if an object is from a class that has been encountered during training or not.

Object Object Recognition

Paper
Code

A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation

no code implementations • CVPR 2023 • Haoxuan Qu, Yujun Cai, Lin Geng Foo, Ajay Kumar, Jun Liu

Therefore, via minimizing the distance between the two characteristic functions, we can optimize the model to provide a more accurate localization result for the body joints in different sub-regions of the predicted heatmap.

Pose Estimation

Paper
Add Code

Improving the Reliability for Confidence Estimation

no code implementations • 13 Oct 2022 • Haoxuan Qu, Yanchao Li, Lin Geng Foo, Jason Kuen, Jiuxiang Gu, Jun Liu

Confidence estimation, a task that aims to evaluate the trustworthiness of the model's prediction output during deployment, has received lots of research attention recently, due to its importance for the safe deployment of deep models.

Image Classification Meta-Learning +1

Paper
Add Code

Heatmap Distribution Matching for Human Pose Estimation

no code implementations • 3 Oct 2022 • Haoxuan Qu, Li Xu, Yujun Cai, Lin Geng Foo, Jun Liu

In this paper, we show that optimizing the heatmap prediction in such a way, the model performance of body joint localization, which is the intrinsic objective of this task, may not be consistently improved during the optimization process of the heatmap prediction.

2D Human Pose Estimation Pose Estimation

Paper
Add Code

Meta Spatio-Temporal Debiasing for Video Scene Graph Generation

no code implementations • 23 Jul 2022 • Li Xu, Haoxuan Qu, Jason Kuen, Jiuxiang Gu, Jun Liu

Video scene graph generation (VidSGG) aims to parse the video content into scene graphs, which involves modeling the spatio-temporal contextual information in the video.

Graph Generation Meta-Learning +2

Paper
Add Code

Recent Advances of Continual Learning in Computer Vision: An Overview

no code implementations • 23 Sep 2021 • Haoxuan Qu, Hossein Rahmani, Li Xu, Bryan Williams, Jun Liu

In contrast to batch learning where all training data is available at once, continual learning represents a family of methods that accumulate knowledge and learn continuously with data available in sequential order.

Continual Learning Knowledge Distillation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.