Search Results for author: Xiaohan Zhang

Found 38 papers, 13 papers with code

How Does the Experimental Setting Affect the Conclusions of Neural Encoding Models?

no code implementations • LREC 2022 • Xiaohan Zhang, Shaonan Wang, Chengqing Zong

Based on these results, we suggest a block-wise cross-validation training method and an adequate data size for increasing the performance of linear encoding models.

Paper
Add Code

AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering

no code implementations • 18 Apr 2024 • Jingfeng Guo, Xiaohan Zhang, Baozhu Zhao, Qi Liu

Existing neural radiance fields (NeRF)-based novel view synthesis methods for large-scale outdoor scenes are mainly built on a single altitude.

Novel View Synthesis

Paper
Add Code

A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation

no code implementations • 4 Apr 2024 • Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Zijun Yao, Jing Zhang, Lei Hou, Juanzi Li

Recently, knowledge-grounded dialogue generation models, that intentionally invoke external knowledge resources to more informative responses, are also proven to be effective in reducing hallucination.

counterfactual Counterfactual Reasoning +2

Paper
Add Code

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

1 code implementation • 4 Apr 2024 • Hanyu Lai, Xiao Liu, Iat Long Iong, Shuntian Yao, Yuxuan Chen, Pengbo Shen, Hao Yu, Hanchen Zhang, Xiaohan Zhang, Yuxiao Dong, Jie Tang

Large language models (LLMs) have fueled many intelligent agent tasks, such as web navigation -- but most existing agents perform far from satisfying in real-world webpages due to three factors: (1) the versatility of actions on webpages, (2) HTML text exceeding model processing capacity, and (3) the complexity of decision-making due to the open-domain nature of web.

Decision Making Language Modelling +1

304

Paper
Code

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

1 code implementation • 3 Apr 2024 • Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Wenyi Zhao, Jie Tang, Yuxiao Dong

Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving.

Math

Paper
Code

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

no code implementations • 1 Apr 2024 • Zhenyu Hou, Yilin Niu, Zhengxiao Du, Xiaohan Zhang, Xiao Liu, Aohan Zeng, Qinkai Zheng, Minlie Huang, Hongning Wang, Jie Tang, Yuxiao Dong

The work presents our practices of aligning LLMs with human preferences, offering insights into the challenges and solutions in RLHF implementations.

Paper
Add Code

MapGuide: A Simple yet Effective Method to Reconstruct Continuous Language from Brain Activities

no code implementations • 26 Mar 2024 • Xinpei Zhao, Jingyuan Sun, Shaonan Wang, Jing Ye, Xiaohan Zhang, Chengqing Zong

In contrast, we propose a simple yet effective method that guides text reconstruction by directly comparing them with the predicted text embeddings mapped from brain activities.

Text Generation

Paper
Add Code

FORCE: Dataset and Method for Intuitive Physics Guided Human-object Interaction

no code implementations • 17 Mar 2024 • Xiaohan Zhang, Bharat Lal Bhatnagar, Sebastian Starke, Ilya Petrov, Vladimir Guzov, Helisa Dhamo, Eduardo Pérez-Pellitero, Gerard Pons-Moll

Our key insight is that human motion is dictated by the interrelation between the force exerted by the human and the perceived resistance.

Friction Human-Object Interaction Detection +1

Paper
Add Code

PointCore: Efficient Unsupervised Point Cloud Anomaly Detector Using Local-Global Features

1 code implementation • 4 Mar 2024 • Baozhu Zhao, Qiwei Xiong, Xiaohan Zhang, Jingfeng Guo, Qi Liu, Xiaofen Xing, Xiangmin Xu

Three-dimensional point cloud anomaly detection that aims to detect anomaly data points from a training set serves as the foundation for a variety of applications, including industrial inspection and autonomous driving.

Anomaly Detection Autonomous Driving

Paper
Code

MulCogBench: A Multi-modal Cognitive Benchmark Dataset for Evaluating Chinese and English Computational Language Models

no code implementations • 2 Mar 2024 • Yunhao Zhang, Xiaohan Zhang, Chong Li, Shaonan Wang, Chengqing Zong

Results show that language models share significant similarities with human cognitive data and the similarity patterns are modulated by the data modality and stimuli complexity.

Paper
Add Code

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

1 code implementation • 22 Dec 2023 • Qianrui Zhou, Hua Xu, Hao Li, Hanlei Zhang, Xiaohan Zhang, Yifan Wang, Kai Gao

To establish an optimal multimodal semantic environment for text modality, we develop a modality-aware prompting module (MAP), which effectively aligns and fuses features from text, video and audio modalities with similarity-based modality alignment and cross-modality attention mechanism.

Ranked #2 on Multimodal Intent Recognition on MIntRec

Contrastive Learning Multimodal Intent Recognition

Paper
Code

AlignBench: Benchmarking Chinese Alignment of Large Language Models

1 code implementation • 30 Nov 2023 • Xiao Liu, Xuanyu Lei, Shengyuan Wang, Yue Huang, Zhuoer Feng, Bosi Wen, Jiale Cheng, Pei Ke, Yifan Xu, Weng Lam Tam, Xiaohan Zhang, Lichao Sun, Hongning Wang, Jing Zhang, Minlie Huang, Yuxiao Dong, Jie Tang

We will provide public APIs for evaluating AlignBench with CritiqueLLM to facilitate the evaluation of LLMs' Chinese alignment.

Benchmarking

193

Paper
Code

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

1 code implementation • 28 Nov 2023 • Jinfeng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wenjing Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

In this paper, we present CharacterGLM, a series of models built upon ChatGLM, with model sizes ranging from 6B to 66B parameters.

Dialogue Generation

313

Paper
Code

Robust Tumor Segmentation with Hyperspectral Imaging and Graph Neural Networks

no code implementations • 20 Nov 2023 • Mayar Lotfy, Anna Alperovich, Tommaso Giannantonio, Bjorn Barz, Xiaohan Zhang, Felix Holm, Nassir Navab, Felix Boehm, Carolin Schwamborn, Thomas K. Hoffmann, Patrick J. Schuler

Despite the limited dataset, the GNN-based model significantly outperforms context-agnostic approaches, accurately distinguishing between healthy and tumor tissues, even in images from previously unseen patients.

Tumor Segmentation

Paper
Add Code

Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language

no code implementations • 5 Oct 2023 • Jingyuan Sun, Xiaohan Zhang, Marie-Francine Moens

To understand the algorithm that supports the human brain's language representation, previous research has attempted to predict neural responses to linguistic stimuli using embeddings generated by artificial neural networks (ANNs), a process known as neural encoding.

Natural Language Understanding

Paper
Add Code

GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement

no code implementations • 18 Aug 2023 • Xiaohan Zhang, Xingyu Li, Waqas Sultani, Chen Chen, Safwan Wshah

We attribute this deficiency to the lack of ability to extract the geometric layout of visual features and models' overfitting to low-level details.

Attribute Disentanglement

Paper
Add Code

Human Emotion Recognition Based On Galvanic Skin Response signal Feature Selection and SVM

no code implementations • 4 Jul 2023 • Di Fan, Mingyang Liu, Xiaohan Zhang, Xiaopeng Gong

A novel human emotion recognition method based on automatically selected Galvanic Skin Response (GSR) signal features and SVM is proposed in this paper.

Emotion Recognition feature selection

Paper
Add Code

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

1 code implementation • 15 Jun 2023 • Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi, Hailong Jin, Jinxin Liu, Yu Gu, Yuan YAO, Ning Ding, Lei Hou, Zhiyuan Liu, Bin Xu, Jie Tang, Juanzi Li

The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations.

Benchmarking Hallucination +1

Paper
Code

Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds

1 code implementation • 27 May 2023 • Yan Ding, Xiaohan Zhang, Saeid Amiri, Nieqing Cao, Hao Yang, Andy Kaminski, Chad Esselink, Shiqi Zhang

Each situation corresponds to a state instance wherein a robot is potentially unable to complete a task using a solution that normally works.

World Knowledge

Paper
Code

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

1 code implementation • 22 Apr 2023 • Bo Liu, Yuqian Jiang, Xiaohan Zhang, Qiang Liu, Shiqi Zhang, Joydeep Biswas, Peter Stone

LLM+P takes in a natural language description of a planning problem, then returns a correct (or optimal) plan for solving that problem in natural language.

Zero-shot Generalization

324

Paper
Code

Spatial-Language Attention Policies for Efficient Robot Learning

no code implementations • 21 Apr 2023 • Priyam Parashar, Vidhi Jain, Xiaohan Zhang, Jay Vakil, Sam Powers, Yonatan Bisk, Chris Paxton

We see a 4x improvement over baseline in mobile manipulation setting.

Decision Making Language Modelling +1

Paper
Add Code

GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation

1 code implementation • 28 Feb 2023 • Jing Zhang, Xiaokang Zhang, Daniel Zhang-li, Jifan Yu, Zijun Yao, Zeyao Ma, Yiqi Xu, Haohua Wang, Xiaohan Zhang, Nianyi Lin, Sunrui Lu, Juanzi Li, Jie Tang

We present GLM-Dialog, a large-scale language model (LLM) with 10B parameters capable of knowledge-grounded conversation in Chinese using a search engine to access the Internet knowledge.

Dialogue Evaluation Dialogue Generation +2

Paper
Code

Intra-operative Brain Tumor Detection with Deep Learning-Optimized Hyperspectral Imaging

no code implementations • 6 Feb 2023 • Tommaso Giannantonio, Anna Alperovich, Piercosimo Semeraro, Manfredo Atzori, Xiaohan Zhang, Christoph Hauger, Alexander Freytag, Siri Luthman, Roeland Vandebriel, Murali Jayapala, Lien Solie, Steven de Vleeschouwer

Surgery for gliomas (intrinsic brain tumors), especially when low-grade, is challenging due to the infiltrative nature of the lesion.

Paper
Add Code

Time-sensitive Learning for Heterogeneous Federated Edge Intelligence

no code implementations • 26 Jan 2023 • Yong Xiao, Xiaohan Zhang, Guangming Shi, Marwan Krunz, Diep N. Nguyen, Dinh Thai Hoang

A joint optimization algorithm is proposed to minimize the overall time consumption of model training by selecting participating edge servers, local epoch number.

Decision Making Edge-computing +1

Paper
Add Code

Cross-view Geo-localization via Learning Disentangled Geometric Layout Correspondence

1 code implementation • 8 Dec 2022 • Xiaohan Zhang, Xingyu Li, Waqas Sultani, Yi Zhou, Safwan Wshah

We attribute this deficiency to the lack of ability to extract the spatial configuration of visual feature layouts and models' overfitting on low-level details from the training set.

Attribute counterfactual

Paper
Code

Cross-View Image Sequence Geo-localization

1 code implementation • 25 Oct 2022 • Xiaohan Zhang, Waqas Sultani, Safwan Wshah

In this paper, we present the first cross-view geo-localization method that works on a sequence of limited FOV images.

Paper
Code

Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization

no code implementations • 19 Oct 2022 • Thomas Lew, Sumeet Singh, Mario Prats, Jeffrey Bingham, Jonathan Weisz, Benjie Holson, Xiaohan Zhang, Vikas Sindhwani, Yao Lu, Fei Xia, Peng Xu, Tingnan Zhang, Jie Tan, Montserrat Gonzalez

This problem is challenging, as it requires planning wiping actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Revisiting Optimal Convergence Rate for Smooth and Non-convex Stochastic Decentralized Optimization

no code implementations • 14 Oct 2022 • Kun Yuan, Xinmeng Huang, Yiming Chen, Xiaohan Zhang, Yingya Zhang, Pan Pan

While (Lu and Sa, 2021) have recently provided an optimal rate for non-convex stochastic decentralized optimization with weight matrices defined over linear graphs, the optimal rate with general weight matrices remains unclear.

Paper
Add Code

Robot Task Planning and Situation Handling in Open Worlds

no code implementations • 4 Oct 2022 • Yan Ding, Xiaohan Zhang, Saeid Amiri, Nieqing Cao, Hao Yang, Chad Esselink, Shiqi Zhang

This paper introduces a novel algorithm (COWP) for open-world task planning and situation handling that dynamically augments the robot's action knowledge with task-oriented common sense.

Common Sense Reasoning Robot Task Planning +1

Paper
Add Code

COUCH: Towards Controllable Human-Chair Interactions

no code implementations • 1 May 2022 • Xiaohan Zhang, Bharat Lal Bhatnagar, Vladimir Guzov, Sebastian Starke, Gerard Pons-Moll

In this work, we study the problem of synthesizing scene interactions conditioned on different contact positions on the object.

Human-Object Interaction Detection Object

Paper
Add Code

Deep Class Incremental Learning from Decentralized Data

no code implementations • 11 Mar 2022 • Xiaohan Zhang, Songlin Dong, Jinjie Chen, Qi Tian, Yihong Gong, Xiaopeng Hong

In this paper, we focus on a new and challenging decentralized machine learning paradigm in which there are continuous inflows of data to be addressed and the data are stored in multiple repositories.

Class Incremental Learning Incremental Learning +1

Paper
Add Code

Visual and Object Geo-localization: A Comprehensive Survey

no code implementations • 30 Dec 2021 • Daniel Wilson, Xiaohan Zhang, Waqas Sultani, Safwan Wshah

The concept of geo-localization refers to the process of determining where on earth some `entity' is located, typically using Global Positioning System (GPS) coordinates.

3D Reconstruction Object

Paper
Add Code

Actor-Critic Algorithm for High-dimensional PDEs

no code implementations • NeurIPS Workshop DLDE 2021 • Xiaohan Zhang

Our model advances the state-of-the-art machine learning PDE solvers in a few aspects: 1) the trainable parameters are reduced by $N$ times, where $N$ is the number of steps to discretize the PDE in time, 2) the model convergence rate is an order of magnitude faster, 3) our model has fewer tuning hyperparameters.

BIG-bench Machine Learning Vocal Bursts Intensity Prediction

Paper
Add Code

Object Tracking and Geo-localization from Street Images

no code implementations • 13 Jul 2021 • Daniel Wilson, Thayer Alshaabi, Colin Van Oort, Xiaohan Zhang, Jonathan Nelson, Safwan Wshah

Geo-localizing static objects from street images is challenging but also very important for road asset mapping and autonomous driving.

Autonomous Driving Object +1

Paper
Add Code

Episodic memory governs choices: An RNN-based reinforcement learning model for decision-making task

no code implementations • 24 Jan 2021 • Xiaohan Zhang, Lu Liu, Guodong Long, Jing Jiang, Shenquan Liu

Typical methods to study cognitive function are to record the electrical activities of animal neurons during the training of animals performing behavioral tasks.

Decision Making Hippocampus +3

Paper
Add Code

Efficient Golf Ball Detection and Tracking Based on Convolutional Neural Networks and Kalman Filter

1 code implementation • 17 Dec 2020 • Tianxiao Zhang, Xiaohan Zhang, Yiju Yang, Zongbo Wang, Guanghui Wang

The detection is performed on small image patches instead of the entire image to increase the performance of small ball detection.

Object object-detection +1

Paper
Code

Actor-Critic Algorithm for High-dimensional Partial Differential Equations

no code implementations • 7 Oct 2020 • Xiaohan Zhang

We develop a deep learning model to effectively solve high-dimensional nonlinear parabolic partial differential equations (PDE).

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Scale Calibrated Training: Improving Generalization of Deep Networks via Scale-Specific Normalization

no code implementations • 31 Aug 2019 • Zhuoran Yu, Aojun Zhou, Yukun Ma, Yudian Li, Xiaohan Zhang, Ping Luo

Experiment results show that SCT improves accuracy of single Resnet-50 on ImageNet by 1. 7% and 11. 5% accuracy when testing on image sizes of 224 and 128 respectively.

Data Augmentation Image Classification +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.