Search Results for author: Yaxin Peng

Found 12 papers, 3 papers with code

Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models

1 code implementation • 10 Mar 2024 • Minjie Zhu, Yichen Zhu, Xin Liu, Ning Liu, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Zhicai Ou, Feifei Feng, Jian Tang

Multimodal Large Language Models (MLLMs) have showcased impressive skills in tasks related to visual understanding and reasoning.

Ranked #69 on Visual Question Answering on MM-Vet

Visual Question Answering

303

Paper
Code

Language-Conditioned Robotic Manipulation with Fast and Slow Thinking

no code implementations • 8 Jan 2024 • Minjie Zhu, Yichen Zhu, Jinming Li, Junjie Wen, Zhiyuan Xu, Zhengping Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

The language-conditioned robotic manipulation aims to transfer natural language instructions into executable actions, from simple pick-and-place to tasks requiring intent recognition and visual reasoning.

Decision Making Intent Recognition +2

Paper
Add Code

Object-Centric Instruction Augmentation for Robotic Manipulation

no code implementations • 5 Jan 2024 • Junjie Wen, Yichen Zhu, Minjie Zhu, Jinming Li, Zhiyuan Xu, Zhengping Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

Humans interpret scenes by recognizing both the identities and positions of objects in their observations.

Language Modelling Large Language Model +1

Paper
Add Code

Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective

no code implementations • 18 Dec 2023 • Wanying Wang, Yichen Zhu, Yirui Zhou, Chaomin Shen, Jian Tang, Zhiyuan Xu, Yaxin Peng, Yangchun Zhang

Generative Adversarial Imitation Learning (GAIL) stands as a cornerstone approach in imitation learning.

Imitation Learning

Paper
Add Code

PMNN:Physical Model-driven Neural Network for solving time-fractional differential equations

no code implementations • 7 Oct 2023 • Zhiying Ma, Jie Hou, Wenhao Zhu, Yaxin Peng, Ying Li

It establishes a temporal iteration scheme based on physical model-driven neural networks which effectively combines deep neural networks (DNNs) with interpolation approximation of fractional derivatives.

Paper
Add Code

Recognizable Information Bottleneck

1 code implementation • 28 Apr 2023 • Yilin Lyu, Xin Liu, Mingyang Song, Xinyue Wang, Yaxin Peng, Tieyong Zeng, Liping Jing

The recent PAC-Bayes IB uses information complexity instead of information compression to establish a connection with the mutual information generalization bound.

Paper
Code

CP$^3$: Channel Pruning Plug-in for Point-based Networks

no code implementations • 23 Mar 2023 • Yaomin Huang, Ning Liu, Zhengping Che, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Guixu Zhang, Xinmei Liu, Feifei Feng, Jian Tang

CP$^3$ is elaborately designed to leverage the characteristics of point clouds and PNNs in order to enable 2D channel pruning methods for PNNs.

Paper
Add Code

CP3: Channel Pruning Plug-In for Point-Based Networks

no code implementations • CVPR 2023 • Yaomin Huang, Ning Liu, Zhengping Che, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Guixu Zhang, Xinmei Liu, Feifei Feng, Jian Tang

Directly implementing the 2D CNN channel pruning methods to PNNs undermine the performance of PNNs because of the different representations of 2D images and 3D point clouds as well as the network architecture disparity.

Paper
Add Code

Label-Guided Auxiliary Training Improves 3D Object Detector

1 code implementation • 24 Jul 2022 • Yaomin Huang, Xinmei Liu, Yichen Zhu, Zhiyuan Xu, Chaomin Shen, Zhengping Che, Guixu Zhang, Yaxin Peng, Feifei Feng, Jian Tang

Detecting 3D objects from point clouds is a practical yet challenging task that has attracted increasing attention recently.

3D Object Detection Object +1

Paper
Code

Hybrid Atlas Building with Deep Registration Priors

no code implementations • 13 Dec 2021 • Nian Wu, Jian Wang, Miaomiao Zhang, Guixu Zhang, Yaxin Peng, Chaomin Shen

Registration-based atlas building often poses computational challenges in high-dimensional image spaces.

Paper
Add Code

Defending Against Adversarial Attacks by Suppressing the Largest Eigenvalue of Fisher Information Matrix

no code implementations • 13 Sep 2019 • Chaomin Shen, Yaxin Peng, Guixu Zhang, Jinsong Fan

We propose a scheme for defending against adversarial attacks by suppressing the largest eigenvalue of the Fisher information matrix (FIM).

Adversarial Defense Traffic Sign Recognition

Paper
Add Code

The Adversarial Attack and Detection under the Fisher Information Metric

no code implementations • 9 Oct 2018 • Chenxiao Zhao, P. Thomas Fletcher, Mixue Yu, Yaxin Peng, Guixu Zhang, Chaomin Shen

By considering the data space as a non-linear space with the Fisher information metric induced from a neural network, we first propose an adversarial attack algorithm termed one-step spectral attack (OSSA).

Adversarial Attack

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.