Search Results for author: Zihan Zhang

Found 55 papers, 16 papers with code

AI WALKUP: A Computer-Vision Approach to Quantifying MDS-UPDRS in Parkinson's Disease

no code implementations • 2 Apr 2024 • Xiang Xiang, Zihan Zhang, Jing Ma, Yao Deng

Parkinson's Disease (PD) is the second most common neurodegenerative disorder.

Paper
Add Code

Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning

no code implementations • 29 Mar 2024 • Qinhao Zhou, Zihan Zhang, Xiang Xiang, Ke Wang, Yuchuan Wu, Yongbin Li

As intelligent agents, LLMs need to have the capabilities of task planning, long-term memory, and the ability to leverage external tools to achieve satisfactory performance.

Hallucination

Paper
Add Code

Horizon-Free Regret for Linear Markov Decision Processes

no code implementations • 15 Mar 2024 • Zihan Zhang, Jason D. Lee, Yuxin Chen, Simon S. Du

A recent line of works showed regret bounds in reinforcement learning (RL) can be (nearly) independent of planning horizon, a. k. a.~the horizon-free bounds.

LEMMA Reinforcement Learning (RL)

Paper
Add Code

ResLoRA: Identity Residual Mapping in Low-Rank Adaption

1 code implementation • 28 Feb 2024 • Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

As one of the most popular parameter-efficient fine-tuning (PEFT) methods, low-rank adaptation (LoRA) is commonly applied to fine-tune large language models (LLMs).

3,180

Paper
Code

RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering

1 code implementation • 26 Feb 2024 • Zihan Zhang, Meng Fang, Ling Chen

Based on our findings, we propose Time-Aware Adaptive Retrieval (TA-ARE), a simple yet effective method that helps LLMs assess the necessity of retrieval without calibration or additional training.

Open-Domain Question Answering Retrieval

Paper
Code

HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition

no code implementations • 24 Feb 2024 • Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Large language models (LLMs) have emerged as a promising alternative to expensive human evaluations.

Language Modelling Large Language Model

Paper
Add Code

We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity

no code implementations • 22 Feb 2024 • Miao Xin, Zhongrui You, Zihan Zhang, Taoran Jiang, Tingjia Xu, Haotian Liang, Guojing Ge, Yuchen Ji, Shentong Mo, Jian Cheng

We present SpaceAgents-1, a system for learning human and multi-robot collaboration (HMRC) strategies under microgravity conditions.

Decision Making

Paper
Add Code

Text Diffusion with Reinforced Conditioning

no code implementations • 19 Feb 2024 • Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Diffusion models have demonstrated exceptional capability in generating high-quality images, videos, and audio.

Paper
Add Code

Improving Domain Adaptation through Extended-Text Reading Comprehension

1 code implementation • 14 Jan 2024 • Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

To enhance the domain-specific capabilities of large language models, continued pre-training on a domain-specific corpus is a prevalent method.

Clustering Domain Adaptation +1

3,180

Paper
Code

BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators

no code implementations • 8 Jan 2024 • Zihan Zhang, Jiayao Sun, Xianjun Xia, Chuanzeng Huang, Yijian Xiao, Lei Xie

Packet loss is a common and unavoidable problem in voice over internet phone (VoIP) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

SELM: Speech Enhancement Using Discrete Tokens and Language Models

no code implementations • 15 Dec 2023 • Ziqian Wang, Xinfa Zhu, Zihan Zhang, YuanJun Lv, Ning Jiang, Guoqing Zhao, Lei Xie

Given the intrinsic similarity between speech generation and speech enhancement, harnessing semantic information holds potential advantages for speech enhancement tasks.

Self-Supervised Learning Speech Enhancement

Paper
Add Code

Optimal Multi-Distribution Learning

no code implementations • 8 Dec 2023 • Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee

Focusing on a hypothesis class of Vapnik-Chervonenkis (VC) dimension $d$, we propose a novel algorithm that yields an $varepsilon$-optimal randomized hypothesis with a sample complexity on the order of $(d+k)/\varepsilon^2$ (modulo some logarithmic factor), matching the best-known lower bound.

Fairness

Paper
Add Code

OSM vs HD Maps: Map Representations for Trajectory Prediction

no code implementations • 4 Nov 2023 • Jing-Yan Liao, Parth Doshi, Zihan Zhang, David Paz, Henrik Christensen

While High Definition (HD) Maps have long been favored for their precise depictions of static road elements, their accessibility constraints and susceptibility to rapid environmental changes impede the widespread deployment of autonomous driving, especially in the motion forecasting task.

Motion Forecasting Trajectory Prediction

Paper
Add Code

Turn-Level Active Learning for Dialogue State Tracking

1 code implementation • 23 Oct 2023 • Zihan Zhang, Meng Fang, Fanghua Ye, Ling Chen, Mohammad-Reza Namazi-Rad

Dialogue state tracking (DST) plays an important role in task-oriented dialogue systems.

Active Learning Dialogue State Tracking +1

Paper
Code

CITB: A Benchmark for Continual Instruction Tuning

1 code implementation • 23 Oct 2023 • Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad

In this work, we establish a CIT benchmark consisting of learning and evaluation protocols.

Continual Learning

Paper
Code

Democratizing Reasoning Ability: Tailored Learning from Large Language Model

1 code implementation • 20 Oct 2023 • Zhaoyang Wang, Shaohan Huang, Yuxuan Liu, Jiahai Wang, Minghui Song, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

In this paper, we propose a tailored learning approach to distill such reasoning ability to smaller LMs to facilitate the democratization of the exclusive reasoning ability.

Instruction Following Language Modelling +1

Paper
Code

Auto Search Indexer for End-to-End Document Retrieval

no code implementations • 19 Oct 2023 • Tianchi Yang, Minghui Song, Zihan Zhang, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang

Generative retrieval, which is a new advanced paradigm for document retrieval, has recently attracted research interests, since it encodes all documents into the model and directly generates the retrieved documents.

Retrieval

Paper
Add Code

How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances

1 code implementation • 11 Oct 2023 • Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad, Jun Wang

Although large language models (LLMs) are impressive in solving various tasks, they can quickly be outdated after deployment.

World Knowledge

110

Paper
Code

An Exploration of Task-decoupling on Two-stage Neural Post Filter for Real-time Personalized Acoustic Echo Cancellation

no code implementations • 7 Oct 2023 • Zihan Zhang, Jiayao Sun, Xianjun Xia, Ziqian Wang, Xiaopeng Yan, Yijian Xiao, Lei Xie

Utilization of speaker representation has extended the frontier of AEC, thus attracting many researchers' interest in personalized acoustic echo cancellation (PAEC).

Acoustic echo cancellation Speech Enhancement

Paper
Add Code

Calibrating LLM-Based Evaluator

no code implementations • 23 Sep 2023 • Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Recent advancements in large language models (LLMs) on language modeling and emergent capabilities make them a promising reference-free evaluator of natural language generation quality, and a competent alternative to human evaluation.

In-Context Learning Language Modelling +1

Paper
Add Code

Universal scaling relation and criticality in metabolism and growth of Escherichia coli

no code implementations • 9 Aug 2023 • Shaohua Guan, Zhichao Zhang, Zihan Zhang, Hualin Shi

The metabolic network plays a crucial role in regulating bacterial metabolism and growth, but it is subject to inherent molecular stochasticity.

Relation

Paper
Add Code

Classification with Deep Neural Networks and Logistic Loss

no code implementations • 31 Jul 2023 • Zihan Zhang, Lei Shi, Ding-Xuan Zhou

In this paper, we aim to fill this gap by establishing a novel and elegant oracle-type inequality, which enables us to deal with the boundedness restriction of the target function, and using it to derive sharp convergence rates for fully connected ReLU DNN classifiers trained with logistic loss.

Binary Classification Classification +1

Paper
Add Code

TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis

no code implementations • 27 Jul 2023 • Zihan Zhang, Richard Liu, Kfir Aberman, Rana Hanocka

The gradual nature of a diffusion process that synthesizes samples in small increments constitutes a key ingredient of Denoising Diffusion Probabilistic Models (DDPM), which have presented unprecedented quality in image synthesis and been recently explored in the motion domain.

Denoising Image Generation +1

Paper
Add Code

Settling the Sample Complexity of Online Reinforcement Learning

no code implementations • 25 Jul 2023 • Zihan Zhang, Yuxin Chen, Jason D. Lee, Simon S. Du

While a number of recent works achieved asymptotically minimal regret in online RL, the optimality of these results is only guaranteed in a ``large-sample'' regime, imposing enormous burn-in cost in order for their algorithms to operate optimally.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes

no code implementations • 28 Jun 2023 • Zihan Zhang, Qiaomin Xie

In the online setting, we propose model-free RL algorithms based on reference-advantage decomposition.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding

1 code implementation • 16 May 2023 • Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng, Qi Zhang

Recent studies have shown that dual encoder models trained with the sentence-level translation ranking task are effective methods for cross-lingual sentence embedding.

Language Modelling Sentence +3

Paper
Code

Pre-training Language Model as a Multi-perspective Course Learner

no code implementations • 6 May 2023 • Beiduo Chen, Shaohan Huang, Zihan Zhang, Wu Guo, ZhenHua Ling, Haizhen Huang, Furu Wei, Weiwei Deng, Qi Zhang

Besides, two self-correction courses are proposed to bridge the chasm between the two encoders by creating a "correction notebook" for secondary-supervision.

Language Modelling Masked Language Modeling

Paper
Add Code

Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge

no code implementations • 14 Mar 2023 • Mingshuai Liu, Shubo Lv, Zihan Zhang, Runduo Han, Xiang Hao, Xianjun Xia, Li Chen, Yijian Xiao, Lei Xie

Achieving 0. 446 in the final score and 0. 517 in the P. 835 score, our system ranks 4th in the non-real-time track.

Vocal Bursts Valence Prediction

Paper
Add Code

Two-step Band-split Neural Network Approach for Full-band Residual Echo Suppression

no code implementations • 13 Mar 2023 • Zihan Zhang, Shimin Zhang, Mingshuai Liu, Yanhong Leng, Zhe Han, Li Chen, Lei Xie

This paper describes a Two-step Band-split Neural Network (TBNN) approach for full-band acoustic echo cancellation.

Acoustic echo cancellation

Paper
Add Code

Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments

no code implementations • 31 Jan 2023 • Runlong Zhou, Zihan Zhang, Simon S. Du

We further initiate the study on model-free algorithms with variance-dependent regret bounds by designing a reference-function-based algorithm with a novel capped-doubling reference update schedule.

Paper
Add Code

Decoupling MaxLogit for Out-of-Distribution Detection

no code implementations • CVPR 2023 • Zihan Zhang, Xiang Xiang

We demonstrate the effectiveness of our logit-based OOD detection methods on CIFAR-10, CIFAR-100 and ImageNet and establish state-of-the-art performance.

Ranked #13 on Out-of-Distribution Detection on ImageNet-1k vs Curated OODs (avg.)

Out-of-Distribution Detection

Paper
Add Code

RoChBert: Towards Robust BERT Fine-tuning for Chinese

1 code implementation • 28 Oct 2022 • Zihan Zhang, Jinfeng Li, Ning Shi, Bo Yuan, Xiangyu Liu, Rong Zhang, Hui Xue, Donghong Sun, Chao Zhang

Despite of the superb performance on a wide range of tasks, pre-trained language models (e. g., BERT) have been proved vulnerable to adversarial texts.

Data Augmentation Language Modelling

Paper
Code

Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning

no code implementations • 15 Oct 2022 • Zihan Zhang, Yuhang Jiang, Yuan Zhou, Xiangyang Ji

Meanwhile, we show that to achieve $\tilde{O}(\mathrm{poly}(S, A, H)\sqrt{K})$ regret, the number of batches is at least $\Omega\left(H/\log_A(K)+ \log_2\log_2(K) \right)$, which matches our upper bound up to logarithmic terms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Predicting Blossom Date of Cherry Tree With Support Vector Machine and Recurrent Neural Network

1 code implementation • 10 Oct 2022 • Hongyi Zheng, Yanyu Chen, Zihan Zhang

Our project probes the relationship between temperatures and the blossom date of cherry trees.

Paper
Code

Hybrid Supervised and Reinforcement Learning for the Design and Optimization of Nanophotonic Structures

no code implementations • 8 Sep 2022 • Christopher Yeung, Benjamin Pham, Zihan Zhang, Katherine T. Fountaine, Aaswath P. Raman

From higher computational efficiency to enabling the discovery of novel and complex structures, deep learning has emerged as a powerful framework for the design and optimization of nanophotonic circuits and components.

Computational Efficiency reinforcement-learning +1

Paper
Add Code

A Pilot Study of Relating MYCN-Gene Amplification with Neuroblastoma-Patient CT Scans

no code implementations • 21 May 2022 • Zihan Zhang, Xiang Xiang, Xuehua Peng, Jianbo Shao

Neuroblastoma is one of the most common cancers in infants, and the initial diagnosis of this disease is difficult.

Paper
Add Code

GANimator: Neural Motion Synthesis from a Single Sequence

1 code implementation • 5 May 2022 • Peizhuo Li, Kfir Aberman, Zihan Zhang, Rana Hanocka, Olga Sorkine-Hornung

We present GANimator, a generative model that learns to synthesize novel motions from a single, short motion sequence.

Motion Synthesis Style Transfer

373

Paper
Code

Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics

1 code implementation • NAACL 2022 • Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad

Recent work incorporates pre-trained word embeddings such as BERT embeddings into Neural Topic Models (NTMs), generating highly coherent topics.

Clustering Sentence +3

Paper
Code

Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies

no code implementations • 24 Mar 2022 • Zihan Zhang, Xiangyang Ji, Simon S. Du

This paper gives the first polynomial-time algorithm for tabular Markov Decision Processes (MDP) that enjoys a regret bound \emph{independent on the planning horizon}.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Long-Tailed Classification with Gradual Balanced Loss and Adaptive Feature Generation

no code implementations • 28 Feb 2022 • Zihan Zhang, Xiang Xiang

The real-world data distribution is essentially long-tailed, which poses great challenge to the deep model.

Ranked #14 on Long-tail Learning on CIFAR-100-LT (ρ=10)

Long-tail Learning

Paper
Add Code

PromptBERT: Improving BERT Sentence Embeddings with Prompts

1 code implementation • 12 Jan 2022 • Ting Jiang, Jian Jiao, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Denvy Deng, Qi Zhang

We propose PromptBERT, a novel contrastive learning method for learning better sentence representation.

Contrastive Learning Denoising +6

316

Paper
Code

Callee: Recovering Call Graphs for Binaries with Transfer and Contrastive Learning

1 code implementation • 2 Nov 2021 • Wenyu Zhu, Zhiyao Feng, Zihan Zhang, Jianjun Chen, Zhijian Ou, Min Yang, Chao Zhang

Recovering binary programs' call graphs is crucial for inter-procedural analysis tasks and applications based on them. transfer One of the core challenges is recognizing targets of indirect calls (i. e., indirect callees).

Contrastive Learning Question Answering +1

Paper
Code

Improving Non-autoregressive Generation with Mixup Training

1 code implementation • 21 Oct 2021 • Ting Jiang, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Liangjie Zhang, Qi Zhang

While pre-trained language models have achieved great success on various natural language understanding tasks, how to effectively leverage them into non-autoregressive generation tasks remains a challenge.

Natural Language Understanding Paraphrase Generation +2

Paper
Code

Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits

no code implementations • 15 Oct 2021 • Zihan Zhang, Xiangyang Ji, Yuan Zhou

We study the optimal batch-regret tradeoff for batch linear contextual bandits.

Multi-Armed Bandits

Paper
Add Code

Learning from My Friends: Few-Shot Personalized Conversation Systems via Social Networks

no code implementations • 21 May 2021 • Zhiliang Tian, Wei Bi, Zihan Zhang, Dongkyu Lee, Yiping Song, Nevin L. Zhang

The task requires models to generate personalized responses for a speaker given a few conversations from the speaker and a social network.

Meta-Learning

Paper
Add Code

A New Metric on Symmetric Group and Applications to Block Permutation Codes

no code implementations • 9 Mar 2021 • Chaoping Xing, Zihan Zhang

In this paper, by introducing a novel metric closely related to the block permutation metric, we build a bridge between some advanced algebraic methods and codes in the block permutation metric.

Information Theory Combinatorics Information Theory

Paper
Add Code

Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

no code implementations • NeurIPS 2021 • Zihan Zhang, Jiaqi Yang, Xiangyang Ji, Simon S. Du

With the new confidence sets, we obtain the follow regret bounds: For linear bandits, we obtain an $\tilde{O}(poly(d)\sqrt{1 + \sum_{k=1}^{K}\sigma_k^2})$ data-dependent regret bound, where $d$ is the feature dimension, $K$ is the number of rounds, and $\sigma_k^2$ is the \emph{unknown} variance of the reward at the $k$-th round.

LEMMA

Paper
Add Code

Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition

no code implementations • NeurIPS 2020 • Zihan Zhang, Yuan Zhou, Xiangyang Ji

We study the reinforcement learning problem in the setting of finite-horizon1episodic Markov Decision Processes (MDPs) with S states, A actions, and episode length H. We propose a model-free algorithm UCB-ADVANTAGE and prove that it achieves \tilde{O}(\sqrt{H^2 SAT}) regret where T=KH and K is the number of episodes to play.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Nearly Minimax Optimal Reward-free Reinforcement Learning

no code implementations • 12 Oct 2020 • Zihan Zhang, Simon S. Du, Xiangyang Ji

In the planning phase, the agent needs to return a near-optimal policy for arbitrary reward functions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon

no code implementations • 28 Sep 2020 • Zihan Zhang, Xiangyang Ji, Simon S. Du

Episodic reinforcement learning generalizes contextual bandits and is often perceived to be more difficult due to long planning horizon and unknown state-dependent transitions.

Decision Making Multi-Armed Bandits +2

Paper
Add Code

Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity

no code implementations • 6 Jun 2020 • Zihan Zhang, Yuan Zhou, Xiangyang Ji

In this paper we consider the problem of learning an $\epsilon$-optimal policy for a discounted Markov Decision Process (MDP).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition

no code implementations • 21 Apr 2020 • Zihan Zhang, Yuan Zhou, Xiangyang Ji

We study the reinforcement learning problem in the setting of finite-horizon episodic Markov Decision Processes (MDPs) with $S$ states, $A$ actions, and episode length $H$.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Regret Minimization for Reinforcement Learning by Evaluating the Optimal Bias Function

no code implementations • NeurIPS 2019 • Zihan Zhang, Xiangyang Ji

We present an algorithm based on the \emph{Optimism in the Face of Uncertainty} (OFU) principle which is able to learn Reinforcement Learning (RL) modeled by Markov decision process (MDP) with finite state-action space efficiently.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

HAXMLNet: Hierarchical Attention Network for Extreme Multi-Label Text Classification

no code implementations • 24 Mar 2019 • Ronghui You, Zihan Zhang, Suyang Dai, Shanfeng Zhu

Extreme multi-label text classification (XMTC) addresses the problem of tagging each text with the most relevant labels from an extreme-scale label set.

General Classification Multi Label Text Classification +2

Paper
Add Code

AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification

3 code implementations • NeurIPS 2019 • Ronghui You, Zihan Zhang, Ziye Wang, Suyang Dai, Hiroshi Mamitsuka, Shanfeng Zhu

We propose a new label tree-based deep learning model for XMTC, called AttentionXML, with two unique features: 1) a multi-label attention mechanism with raw text as input, which allows to capture the most relevant part of text to each label; and 2) a shallow and wide probabilistic label tree (PLT), which allows to handle millions of labels, especially for "tail labels".

General Classification Multi-Label Text Classification +3

237

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.