Search Results for author: Xiaohan Zhang

Found 38 papers, 13 papers with code

How Does the Experimental Setting Affect the Conclusions of Neural Encoding Models?

no code implementations LREC 2022 Xiaohan Zhang, Shaonan Wang, Chengqing Zong

Based on these results, we suggest a block-wise cross-validation training method and an adequate data size for increasing the performance of linear encoding models.

AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering

no code implementations18 Apr 2024 Jingfeng Guo, Xiaohan Zhang, Baozhu Zhao, Qi Liu

Existing neural radiance fields (NeRF)-based novel view synthesis methods for large-scale outdoor scenes are mainly built on a single altitude.

Novel View Synthesis

A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation

no code implementations4 Apr 2024 Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Zijun Yao, Jing Zhang, Lei Hou, Juanzi Li

Recently, knowledge-grounded dialogue generation models, that intentionally invoke external knowledge resources to more informative responses, are also proven to be effective in reducing hallucination.

counterfactual Counterfactual Reasoning +2

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

1 code implementation4 Apr 2024 Hanyu Lai, Xiao Liu, Iat Long Iong, Shuntian Yao, Yuxuan Chen, Pengbo Shen, Hao Yu, Hanchen Zhang, Xiaohan Zhang, Yuxiao Dong, Jie Tang

Large language models (LLMs) have fueled many intelligent agent tasks, such as web navigation -- but most existing agents perform far from satisfying in real-world webpages due to three factors: (1) the versatility of actions on webpages, (2) HTML text exceeding model processing capacity, and (3) the complexity of decision-making due to the open-domain nature of web.

Decision Making Language Modelling +1

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

1 code implementation3 Apr 2024 Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Wenyi Zhao, Jie Tang, Yuxiao Dong

Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving.

Math

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

no code implementations1 Apr 2024 Zhenyu Hou, Yilin Niu, Zhengxiao Du, Xiaohan Zhang, Xiao Liu, Aohan Zeng, Qinkai Zheng, Minlie Huang, Hongning Wang, Jie Tang, Yuxiao Dong

The work presents our practices of aligning LLMs with human preferences, offering insights into the challenges and solutions in RLHF implementations.

MapGuide: A Simple yet Effective Method to Reconstruct Continuous Language from Brain Activities

no code implementations26 Mar 2024 Xinpei Zhao, Jingyuan Sun, Shaonan Wang, Jing Ye, Xiaohan Zhang, Chengqing Zong

In contrast, we propose a simple yet effective method that guides text reconstruction by directly comparing them with the predicted text embeddings mapped from brain activities.

Text Generation

PointCore: Efficient Unsupervised Point Cloud Anomaly Detector Using Local-Global Features

1 code implementation4 Mar 2024 Baozhu Zhao, Qiwei Xiong, Xiaohan Zhang, Jingfeng Guo, Qi Liu, Xiaofen Xing, Xiangmin Xu

Three-dimensional point cloud anomaly detection that aims to detect anomaly data points from a training set serves as the foundation for a variety of applications, including industrial inspection and autonomous driving.

Anomaly Detection Autonomous Driving

MulCogBench: A Multi-modal Cognitive Benchmark Dataset for Evaluating Chinese and English Computational Language Models

no code implementations2 Mar 2024 Yunhao Zhang, Xiaohan Zhang, Chong Li, Shaonan Wang, Chengqing Zong

Results show that language models share significant similarities with human cognitive data and the similarity patterns are modulated by the data modality and stimuli complexity.

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

1 code implementation22 Dec 2023 Qianrui Zhou, Hua Xu, Hao Li, Hanlei Zhang, Xiaohan Zhang, Yifan Wang, Kai Gao

To establish an optimal multimodal semantic environment for text modality, we develop a modality-aware prompting module (MAP), which effectively aligns and fuses features from text, video and audio modalities with similarity-based modality alignment and cross-modality attention mechanism.

Contrastive Learning Multimodal Intent Recognition

Robust Tumor Segmentation with Hyperspectral Imaging and Graph Neural Networks

no code implementations20 Nov 2023 Mayar Lotfy, Anna Alperovich, Tommaso Giannantonio, Bjorn Barz, Xiaohan Zhang, Felix Holm, Nassir Navab, Felix Boehm, Carolin Schwamborn, Thomas K. Hoffmann, Patrick J. Schuler

Despite the limited dataset, the GNN-based model significantly outperforms context-agnostic approaches, accurately distinguishing between healthy and tumor tissues, even in images from previously unseen patients.

Tumor Segmentation

Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language

no code implementations5 Oct 2023 Jingyuan Sun, Xiaohan Zhang, Marie-Francine Moens

To understand the algorithm that supports the human brain's language representation, previous research has attempted to predict neural responses to linguistic stimuli using embeddings generated by artificial neural networks (ANNs), a process known as neural encoding.

Natural Language Understanding

GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement

no code implementations18 Aug 2023 Xiaohan Zhang, Xingyu Li, Waqas Sultani, Chen Chen, Safwan Wshah

We attribute this deficiency to the lack of ability to extract the geometric layout of visual features and models' overfitting to low-level details.

Attribute Disentanglement

Human Emotion Recognition Based On Galvanic Skin Response signal Feature Selection and SVM

no code implementations4 Jul 2023 Di Fan, Mingyang Liu, Xiaohan Zhang, Xiaopeng Gong

A novel human emotion recognition method based on automatically selected Galvanic Skin Response (GSR) signal features and SVM is proposed in this paper.

Emotion Recognition feature selection

Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds

1 code implementation27 May 2023 Yan Ding, Xiaohan Zhang, Saeid Amiri, Nieqing Cao, Hao Yang, Andy Kaminski, Chad Esselink, Shiqi Zhang

Each situation corresponds to a state instance wherein a robot is potentially unable to complete a task using a solution that normally works.

World Knowledge

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

1 code implementation22 Apr 2023 Bo Liu, Yuqian Jiang, Xiaohan Zhang, Qiang Liu, Shiqi Zhang, Joydeep Biswas, Peter Stone

LLM+P takes in a natural language description of a planning problem, then returns a correct (or optimal) plan for solving that problem in natural language.

Zero-shot Generalization

GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation

1 code implementation28 Feb 2023 Jing Zhang, Xiaokang Zhang, Daniel Zhang-li, Jifan Yu, Zijun Yao, Zeyao Ma, Yiqi Xu, Haohua Wang, Xiaohan Zhang, Nianyi Lin, Sunrui Lu, Juanzi Li, Jie Tang

We present GLM-Dialog, a large-scale language model (LLM) with 10B parameters capable of knowledge-grounded conversation in Chinese using a search engine to access the Internet knowledge.

Dialogue Evaluation Dialogue Generation +2

Time-sensitive Learning for Heterogeneous Federated Edge Intelligence

no code implementations26 Jan 2023 Yong Xiao, Xiaohan Zhang, Guangming Shi, Marwan Krunz, Diep N. Nguyen, Dinh Thai Hoang

A joint optimization algorithm is proposed to minimize the overall time consumption of model training by selecting participating edge servers, local epoch number.

Decision Making Edge-computing +1

Cross-view Geo-localization via Learning Disentangled Geometric Layout Correspondence

1 code implementation8 Dec 2022 Xiaohan Zhang, Xingyu Li, Waqas Sultani, Yi Zhou, Safwan Wshah

We attribute this deficiency to the lack of ability to extract the spatial configuration of visual feature layouts and models' overfitting on low-level details from the training set.

Attribute counterfactual

Cross-View Image Sequence Geo-localization

1 code implementation25 Oct 2022 Xiaohan Zhang, Waqas Sultani, Safwan Wshah

In this paper, we present the first cross-view geo-localization method that works on a sequence of limited FOV images.

Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization

no code implementations19 Oct 2022 Thomas Lew, Sumeet Singh, Mario Prats, Jeffrey Bingham, Jonathan Weisz, Benjie Holson, Xiaohan Zhang, Vikas Sindhwani, Yao Lu, Fei Xia, Peng Xu, Tingnan Zhang, Jie Tan, Montserrat Gonzalez

This problem is challenging, as it requires planning wiping actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations.

reinforcement-learning Reinforcement Learning (RL)

Revisiting Optimal Convergence Rate for Smooth and Non-convex Stochastic Decentralized Optimization

no code implementations14 Oct 2022 Kun Yuan, Xinmeng Huang, Yiming Chen, Xiaohan Zhang, Yingya Zhang, Pan Pan

While (Lu and Sa, 2021) have recently provided an optimal rate for non-convex stochastic decentralized optimization with weight matrices defined over linear graphs, the optimal rate with general weight matrices remains unclear.

Robot Task Planning and Situation Handling in Open Worlds

no code implementations4 Oct 2022 Yan Ding, Xiaohan Zhang, Saeid Amiri, Nieqing Cao, Hao Yang, Chad Esselink, Shiqi Zhang

This paper introduces a novel algorithm (COWP) for open-world task planning and situation handling that dynamically augments the robot's action knowledge with task-oriented common sense.

Common Sense Reasoning Robot Task Planning +1

COUCH: Towards Controllable Human-Chair Interactions

no code implementations1 May 2022 Xiaohan Zhang, Bharat Lal Bhatnagar, Vladimir Guzov, Sebastian Starke, Gerard Pons-Moll

In this work, we study the problem of synthesizing scene interactions conditioned on different contact positions on the object.

Human-Object Interaction Detection Object

Deep Class Incremental Learning from Decentralized Data

no code implementations11 Mar 2022 Xiaohan Zhang, Songlin Dong, Jinjie Chen, Qi Tian, Yihong Gong, Xiaopeng Hong

In this paper, we focus on a new and challenging decentralized machine learning paradigm in which there are continuous inflows of data to be addressed and the data are stored in multiple repositories.

Class Incremental Learning Incremental Learning +1

Visual and Object Geo-localization: A Comprehensive Survey

no code implementations30 Dec 2021 Daniel Wilson, Xiaohan Zhang, Waqas Sultani, Safwan Wshah

The concept of geo-localization refers to the process of determining where on earth some `entity' is located, typically using Global Positioning System (GPS) coordinates.

3D Reconstruction Object

Actor-Critic Algorithm for High-dimensional PDEs

no code implementations NeurIPS Workshop DLDE 2021 Xiaohan Zhang

Our model advances the state-of-the-art machine learning PDE solvers in a few aspects: 1) the trainable parameters are reduced by $N$ times, where $N$ is the number of steps to discretize the PDE in time, 2) the model convergence rate is an order of magnitude faster, 3) our model has fewer tuning hyperparameters.

BIG-bench Machine Learning Vocal Bursts Intensity Prediction

Object Tracking and Geo-localization from Street Images

no code implementations13 Jul 2021 Daniel Wilson, Thayer Alshaabi, Colin Van Oort, Xiaohan Zhang, Jonathan Nelson, Safwan Wshah

Geo-localizing static objects from street images is challenging but also very important for road asset mapping and autonomous driving.

Autonomous Driving Object +1

Episodic memory governs choices: An RNN-based reinforcement learning model for decision-making task

no code implementations24 Jan 2021 Xiaohan Zhang, Lu Liu, Guodong Long, Jing Jiang, Shenquan Liu

Typical methods to study cognitive function are to record the electrical activities of animal neurons during the training of animals performing behavioral tasks.

Decision Making Hippocampus +3

Efficient Golf Ball Detection and Tracking Based on Convolutional Neural Networks and Kalman Filter

1 code implementation17 Dec 2020 Tianxiao Zhang, Xiaohan Zhang, Yiju Yang, Zongbo Wang, Guanghui Wang

The detection is performed on small image patches instead of the entire image to increase the performance of small ball detection.

Object object-detection +1

Actor-Critic Algorithm for High-dimensional Partial Differential Equations

no code implementations7 Oct 2020 Xiaohan Zhang

We develop a deep learning model to effectively solve high-dimensional nonlinear parabolic partial differential equations (PDE).

reinforcement-learning Reinforcement Learning (RL) +1

Scale Calibrated Training: Improving Generalization of Deep Networks via Scale-Specific Normalization

no code implementations31 Aug 2019 Zhuoran Yu, Aojun Zhou, Yukun Ma, Yudian Li, Xiaohan Zhang, Ping Luo

Experiment results show that SCT improves accuracy of single Resnet-50 on ImageNet by 1. 7% and 11. 5% accuracy when testing on image sizes of 224 and 128 respectively.

Data Augmentation Image Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.