Search Results for author: Yiqun Duan

Found 14 papers, 7 papers with code

Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs

no code implementations7 Apr 2024 Yiqun Duan, Qiang Zhang, Renjing Xu

The utilization of Large Language Models (LLMs) within the realm of reinforcement learning, particularly as planners, has garnered a significant degree of attention in recent scholarly literature.

Autonomous Driving Imitation Learning

Decode Neural signal as Speech

2 code implementations4 Mar 2024 Yiqian Yang, Yiqun Duan, Qiang Zhang, Renjing Xu, Hui Xiong

In this paper, we explore the brain-to-text translation of MEG signals in a speech-decoding formation.

Brain Computer Interface EEG

OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline

1 code implementation1 Dec 2023 Xianda Guo, Juntao Lu, Chenming Zhang, Yiqi Wang, Yiqun Duan, Tian Yang, Zheng Zhu, Long Chen

Based on OpenStereo, we conducted experiments and have achieved or surpassed the performance metrics reported in the original paper.

Autonomous Driving Autonomous Navigation +1

BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision

no code implementations21 Sep 2023 Jinzhao Zhou, Yiqun Duan, Yu-Cheng Chang, Yu-Kai Wang, Chin-Teng Lin

The proposed BELT method is a generic and efficient framework that bootstraps EEG representation learning using off-the-shelf large-scale pretrained language models (LMs).

Brain Decoding Contrastive Learning +8

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

2 code implementations12 Apr 2023 Yi Li, Hualiang Wang, Yiqun Duan, Xiaomeng Li

Contrastive Language-Image Pre-training (CLIP) is a powerful multimodal large vision model that has demonstrated significant benefits for downstream tasks, including many zero-shot learning and text-guided vision tasks.

Interactive Segmentation Open Vocabulary Semantic Segmentation +4

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

1 code implementation9 Mar 2023 Yiqun Duan, Xianda Guo, Zheng Zhu

We propose DiffusionDepth, a new approach that reformulates monocular depth estimation as a denoising diffusion process.

Denoising Monocular Depth Estimation

Generalizing Multimodal Variational Methods to Sets

no code implementations19 Dec 2022 Jinzhao Zhou, Yiqun Duan, Zhihong Chen, Yu-Cheng Chang, Chin-Teng Lin

Making sense of multiple modalities can yield a more comprehensive description of real-world phenomena.

Cross Task Neural Architecture Search for EEG Signal Classifications

1 code implementation1 Oct 2022 Yiqun Duan, Zhen Wang, Yi Li, Jianhang Tang, Yu-Kai Wang, Chin-Teng Lin

Recently, various neural network approaches have been proposed to improve the accuracy of EEG signal recognition.

EEG Emotion Recognition +2

Exploring Visual Interpretability for Contrastive Language-Image Pre-training

1 code implementation15 Sep 2022 Yi Li, Hualiang Wang, Yiqun Duan, Hang Xu, Xiaomeng Li

For this problem, we propose the Explainable Contrastive Language-Image Pre-training (ECLIP), which corrects the explainability via the Masked Max Pooling.

Retrieval text similarity

Continual Learning With Lifelong Vision Transformer

no code implementations CVPR 2022 Zhen Wang, Liu Liu, Yiqun Duan, Yajing Kong, DaCheng Tao

Continual learning methods aim at training a neural network from sequential data with streaming labels, relieving catastrophic forgetting.

Continual Learning

Progressive Open-Domain Response Generation with Multiple Controllable Attributes

no code implementations7 Jun 2021 Haiqin Yang, Xiaoyuan Yao, Yiqun Duan, Jianping Shen, Jie Zhong, Kun Zhang

More specifically, PHED deploys Conditional Variational AutoEncoder (CVAE) on Transformer to include one aspect of attributes at one stage.

Attribute Response Generation

Semantic Inference Network for Few-shot Streaming Label Learning

no code implementations1 Jan 2021 Zhen Wang, Liu Liu, Yiqun Duan, DaCheng Tao

In this work, we formulate and study few-shot streaming label learning (FSLL), which models emerging new labels with only a few annotated examples by utilizing the knowledge learned from past labels.

Meta-Learning Multi-Label Classification

Learning Internal Dense But External Sparse Structures of Deep Neural Network

no code implementations ICLR 2019 Yiqun Duan

In this paper, we bridge these two by proposing a new network structure with locally dense yet externally sparse connections.

Cannot find the paper you are looking for? You can Submit a new open access paper.