Search Results for author: Kaixuan Huang

Found 21 papers, 5 papers with code

CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments

no code implementations • 27 Apr 2024 • Kaixuan Huang, Yuanhao Qu, Henry Cousins, William A. Johnson, Di Yin, Mihir Shah, Denny Zhou, Russ Altman, Mengdi Wang, Le Cong

We showcase the potential of CRISPR-GPT for assisting non-expert researchers with gene-editing experiments from scratch and validate the agent's effectiveness in a real-world use case.

Paper
Add Code

Diffusion Model for Data-Driven Black-Box Optimization

no code implementations • 20 Mar 2024 • Zihao Li, Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Yinyu Ye, Minshuo Chen, Mengdi Wang

In this paper, we focus on diffusion models, a powerful generative AI technology, and investigate their potential for black-box optimization over complex structured variables.

Paper
Add Code

Embodied LLM Agents Learn to Cooperate in Organized Teams

1 code implementation • 19 Mar 2024 • Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang

Large Language Models (LLMs) have emerged as integral tools for reasoning, planning, and decision-making, drawing upon their extensive world knowledge and proficiency in language-related tasks.

Decision Making World Knowledge

Paper
Code

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

no code implementations • 7 Feb 2024 • Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson

We develop methods to identify critical regions that are vital for safety guardrails, and that are disentangled from utility-relevant regions at both the neuron and rank levels.

Paper
Add Code

A 5' UTR Language Model for Decoding Untranslated Regions of mRNA and Function Predictions

no code implementations • 5 Oct 2023 • Yanyi Chu, Dan Yu, Yupeng Li, Kaixuan Huang, Yue Shen, Le Cong, Jason Zhang, Mengdi Wang

The model outperformed the best-known benchmark by up to 42% for predicting the Mean Ribosome Loading, and by up to 60% for predicting the Translation Efficiency and the mRNA Expression Level.

Language Modelling Translation

Paper
Add Code

A Variational Auto-Encoder Enabled Multi-Band Channel Prediction Scheme for Indoor Localization

no code implementations • 19 Sep 2023 • Ruihao Yuan, Kaixuan Huang, Pan Yang, Shunqing Zhang

Indoor localization is getting increasing demands for various cutting-edged technologies, like Virtual/Augmented reality and smart home.

Indoor Localization

Paper
Add Code

Deep Reinforcement Learning for Efficient and Fair Allocation of Health Care Resources

no code implementations • 15 Sep 2023 • Yikuan Li, Chengsheng Mao, Kaixuan Huang, Hanyin Wang, Zheng Yu, Mengdi Wang, Yuan Luo

Scarcity of health care resources could result in the unavoidable consequence of rationing.

Fairness reinforcement-learning

Paper
Add Code

Scaling In-Context Demonstrations with Structured Attention

no code implementations • 5 Jul 2023 • Tianle Cai, Kaixuan Huang, Jason D. Lee, Mengdi Wang

However, their capabilities of in-context learning are limited by the model architecture: 1) the use of demonstrations is constrained by a maximum sentence length due to positional embeddings; 2) the quadratic complexity of attention hinders users from using more demonstrations efficiently; 3) LLMs are shown to be sensitive to the order of the demonstrations.

In-Context Learning Sentence

Paper
Add Code

Visual Adversarial Examples Jailbreak Aligned Large Language Models

1 code implementation • 22 Jun 2023 • Xiangyu Qi, Kaixuan Huang, Ashwinee Panda, Peter Henderson, Mengdi Wang, Prateek Mittal

Recently, there has been a surge of interest in integrating vision into Large Language Models (LLMs), exemplified by Visual Language Models (VLMs) such as Flamingo and GPT-4.

GPT-4

116

Paper
Code

Deep Reinforcement Learning for Cost-Effective Medical Diagnosis

1 code implementation • 20 Feb 2023 • Zheng Yu, Yikuan Li, Joseph Kim, Kaixuan Huang, Yuan Luo, Mengdi Wang

In this work, we use reinforcement learning (RL) to find a dynamic policy that selects lab test panels sequentially based on previous observations, ensuring accurate testing at a low cost.

Anomaly Detection Medical Diagnosis +3

Paper
Code

Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data

no code implementations • 14 Feb 2023 • Minshuo Chen, Kaixuan Huang, Tuo Zhao, Mengdi Wang

Furthermore, the generated distribution based on the estimated score function captures the data geometric structures and converges to a close vicinity of the data distribution.

Paper
Add Code

Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization

no code implementations • 29 Jun 2022 • Kaixuan Huang, Yu Wu, Xuezhou Zhang, Shenyinying Tu, Qingyun Wu, Mengdi Wang, Huazheng Wang

Online influence maximization aims to maximize the influence spread of a content in a social network with unknown network model by selecting a few seed nodes.

Model-based Reinforcement Learning reinforcement-learning +1

Paper
Add Code

High Precision Indoor Localization with Dummy Antennas -- An Experimental Study

no code implementations • 24 Aug 2021 • Kaixuan Huang, Chenlu Xiang, Shunqing Zhang, Shugong Xu, Xianfeng Ma, Qinglong Xian, Hua Yang

With the rising demand for indoor localization, high precision technique-based fingerprints became increasingly important nowadays.

Indoor Localization Vocal Bursts Intensity Prediction

Paper
Add Code

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

no code implementations • NeurIPS 2021 • Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

While the theory of RL has traditionally focused on linear function approximation (or eluder dimension) approaches, little is known about nonlinear RL with neural net approximations of the Q functions.

Reinforcement Learning (RL)

Paper
Add Code

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

no code implementations • NeurIPS 2021 • Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

This work considers a large family of bandit problems where the unknown underlying reward function is non-concave, including the low-rank generalized linear bandit problems and two-layer neural network with polynomial activation bandit problem.

Paper
Add Code

A Short Note on the Relationship of Information Gain and Eluder Dimension

no code implementations • 6 Jul 2021 • Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei

Eluder dimension and information gain are two widely used methods of complexity measures in bandit and reinforcement learning.

LEMMA reinforcement-learning +1

Paper
Add Code

Fast Federated Learning in the Presence of Arbitrary Device Unavailability

1 code implementation • NeurIPS 2021 • Xinran Gu, Kaixuan Huang, Jingzhao Zhang, Longbo Huang

In this case, the convergence of popular FL algorithms such as FedAvg is severely influenced by the straggling devices.

Federated Learning

Paper
Code

Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- A Neural Tangent Kernel Perspective

no code implementations • NeurIPS 2020 • Kaixuan Huang, Yuqing Wang, Molei Tao, Tuo Zhao

We then compare the kernel of deep ResNets with that of deep FFNets and discover that the class of functions induced by the kernel of FFNets is asymptotically not learnable, as the depth goes to infinity.

Paper
Add Code

Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? -- A Neural Tangent Kernel Perspective

no code implementations • 14 Feb 2020 • Kaixuan Huang, Yuqing Wang, Molei Tao, Tuo Zhao

Paper
Add Code

On the Convergence of FedAvg on Non-IID Data

2 code implementations • ICLR 2020 • Xiang Li, Kaixuan Huang, Wenhao Yang, Shusen Wang, Zhihua Zhang

In this paper, we analyze the convergence of \texttt{FedAvg} on non-iid data and establish a convergence rate of $\mathcal{O}(\frac{1}{T})$ for strongly convex and smooth problems, where $T$ is the number of SGDs.

Edge-computing Federated Learning

239

Paper
Code

Distributionally Robust Optimization Leads to Better Generalization: on SGD and Beyond

no code implementations • ICLR 2019 • Jikai Hou, Kaixuan Huang, Zhihua Zhang

In this paper, we adopt distributionally robust optimization (DRO) (Ben-Tal et al., 2013) in hope to achieve a better generalization in deep learning tasks.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.