Search Results for author: Wei Xiong

Found 51 papers, 15 papers with code

ZhichunRoad at SemEval-2022 Task 2: Adversarial Training and Contrastive Learning for Multiword Representations

no code implementations • SemEval (NAACL) 2022 • Xuange Cui, Wei Xiong, Songlin Wang

This paper presents our contribution to the SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. We explore the impact of three different pre-trained multilingual language models in the SubTaskA. By enhancing the model generalization and robustness, we use the exponential moving average (EMA) method and the adversarial attack strategy. In SubTaskB, we add an effective cross-attention module for modeling the relationships of two sentences. We jointly train the model with a contrastive learning objective and employ a momentum contrast to enlarge the number of negative pairs. Additionally, we use the alignment and uniformity properties to measure the quality of sentence embeddings. Our approach obtained competitive results in both subtasks.

Adversarial Attack Contrastive Learning +2

Paper
Add Code

A Multi-modal one-class generative adversarial network for anomaly detection in manufacturing

no code implementations • ICLR 2019 • Shuhui Qu, Janghwan Lee, Wei Xiong, Wonhyouk Jang, Jie Wang

Since the generated samples simulate the low density area for each modal, the discriminator could directly detect anomalies from normal data.

Generative Adversarial Network One-Class Classification

Paper
Add Code

Example-Guided Image Synthesis using Masked Spatial-Channel Attention and Self-Supervision

no code implementations • ECCV 2020 • Haitian Zheng, Haofu Liao, Lele Chen, Wei Xiong, Tianlang Chen, Jiebo Luo

Example-guided image synthesis has recently been attempted to synthesize an image from a semantic label map and an exemplary image.

Image Generation Spatial Interpolation

Paper
Add Code

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing

no code implementations • 8 Apr 2024 • Jing Gu, Yilin Wang, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang

Compared with existing methods for personalized subject swapping, SwapAnything has three unique advantages: (1) precise control of arbitrary objects and parts rather than the main subject, (2) more faithful preservation of context pixels, (3) better adaptation of the personalized concept to the image.

Image Generation Object

Paper
Add Code

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

no code implementations • 15 Mar 2024 • Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, He Zhang, Wei Xiong, Daniel Aliaga

Generative object compositing emerges as a promising new avenue for compositional image editing.

Object

Paper
Add Code

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

no code implementations • 13 Mar 2024 • Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang

To mitigate this issue, we propose Bootstrapped Preference Optimization (BPO), which conducts preference learning with datasets containing negative responses bootstrapped from the model itself.

Ranked #29 on Visual Question Answering on MM-Vet

Language Modelling Large Language Model +1

Paper
Add Code

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

1 code implementation • 28 Feb 2024 • Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang

Additionally, DPA models user preferences as directions (i. e., unit vectors) in the reward space to achieve user-dependent preference control.

Paper
Code

Diffusion Model-Based Image Editing: A Survey

1 code implementation • 27 Feb 2024 • Yi Huang, Jiancheng Huang, Yifan Liu, Mingfu Yan, Jiaxi Lv, Jianzhuang Liu, Wei Xiong, He Zhang, Shifeng Chen, Liangliang Cao

In this survey, we provide an exhaustive overview of existing methods using diffusion models for image editing, covering both theoretical and practical aspects in the field.

Denoising Image Inpainting +1

239

Paper
Code

A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference

no code implementations • 11 Feb 2024 • Chenlu Ye, Wei Xiong, Yuheng Zhang, Nan Jiang, Tong Zhang

In this work, we provide theoretical insights for a recently proposed learning paradigm, Nash learning from human feedback (NLHF), which considered a general preference model and formulated the alignment process as a game between two competitive LLMs.

Paper
Add Code

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

no code implementations • 18 Dec 2023 • Wei Xiong, Hanze Dong, Chenlu Ye, Ziqi Wang, Han Zhong, Heng Ji, Nan Jiang, Tong Zhang

This includes an iterative version of the Direct Preference Optimization (DPO) algorithm for online settings, and a multi-step rejection sampling strategy for offline scenarios.

Language Modelling Large Language Model

Paper
Add Code

Earthfarseer: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model

no code implementations • 13 Dec 2023 • Hao Wu, Shilong Wang, Yuxuan Liang, Zhengyang Zhou, Wei Huang, Wei Xiong, Kun Wang

Efficiently modeling spatio-temporal (ST) physical processes and observations presents a challenging problem for the deep learning community.

Paper
Add Code

Relightful Harmonization: Lighting-aware Portrait Background Replacement

no code implementations • 11 Dec 2023 • Mengwei Ren, Wei Xiong, Jae Shin Yoon, Zhixin Shu, Jianming Zhang, HyunJoon Jung, Guido Gerig, He Zhang

Portrait harmonization aims to composite a subject into a new background, adjusting its lighting and color to ensure harmony with the background scene.

Paper
Add Code

Windformer:Bi-Directional Long-Distance Spatio-Temporal Network For Wind Speed Prediction

1 code implementation • 24 Nov 2023 • XueWei Li, Zewen Shang, Zhiqiang Liu, Jian Yu, Wei Xiong, Mei Yu

History and future time information includes the trend of airflow changes, whether this dynamic information can be utilized will also affect the prediction effect.

Management Time Series

Paper
Code

Mitigating the Alignment Tax of RLHF

no code implementations • 12 Sep 2023 • Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan YAO, Tong Zhang

Building on the analysis and the observation that averaging different layers of the transformer leads to significantly different reward-tax trade-offs, we propose Adaptive Model Averaging (AMA) to adaptively find various combination ratios of model layers.

Common Sense Reasoning Continual Learning

Paper
Add Code

AI-GOMS: Large AI-Driven Global Ocean Modeling System

no code implementations • 6 Aug 2023 • Wei Xiong, Yanfei Xiang, Hao Wu, Shuyi Zhou, Yuze Sun, Muyuan Ma, Xiaomeng Huang

Here, we present AI-GOMS, a large AI-driven global ocean modeling system, for accurate and efficient global ocean daily prediction.

Paper
Add Code

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

1 code implementation • 21 Jun 2023 • Shizhe Diao, Rui Pan, Hanze Dong, Ka Shun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang

As the number of available models and specialized tasks keeps growing, the job of general finetuning becomes highly nontrivial.

7,973

Paper
Code

Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources

no code implementations • 14 Jun 2023 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Then, a novel HetPEVI algorithm is proposed, which simultaneously considers the sample uncertainties from a finite number of data samples per data source and the source uncertainties due to a finite number of available data sources.

Offline RL reinforcement-learning +1

Paper
Add Code

Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration

1 code implementation • NeurIPS 2023 • Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang

To achieve this, existing sample-efficient online RL algorithms typically consist of three components: estimation, planning, and exploration.

Paper
Code

PastNet: Introducing Physical Inductive Biases for Spatio-temporal Video Prediction

1 code implementation • 19 May 2023 • Hao Wu, Wei Xiong, Fan Xu, Xiao Luo, Chong Chen, Xian-Sheng Hua, Haixin Wang

In this paper, we investigate the challenge of spatio-temporal video prediction, which involves generating future videos based on historical data streams.

Video Prediction

Paper
Code

Reward Teaching for Federated Multi-armed Bandits

no code implementations • 3 May 2023 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Rigorous analyses demonstrate that when facing clients with UCB1, TWL outperforms TAL in terms of the dependencies on sub-optimality gaps thanks to its adaptive design.

Multi-Armed Bandits

Paper
Add Code

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

1 code implementation • 13 Apr 2023 • Hanze Dong, Wei Xiong, Deepanshu Goyal, Yihan Zhang, Winnie Chow, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang

Utilizing a reward model and a sufficient number of samples, our approach selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples.

Ethics

7,973

Paper
Code

InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning

no code implementations • 6 Apr 2023 • Jing Shi, Wei Xiong, Zhe Lin, Hyun Joon Jung

First, we learn the general concept of the input images by converting them to a textual token with a learnable image encoder.

Diffusion Personalization Tuning Free Text-to-Image Generation

Paper
Add Code

ZhichunRoad at Amazon KDD Cup 2022: MultiTask Pre-Training for E-Commerce Product Search

1 code implementation • 31 Jan 2023 • Xuange Cui, Wei Xiong, Songlin Wang

In this paper, we propose a robust multilingual model to improve the quality of search results.

Contrastive Learning

Paper
Code

Koopman neural operator as a mesh-free solver of non-linear partial differential equations

1 code implementation • 24 Jan 2023 • Wei Xiong, Xiaomeng Huang, Ziyang Zhang, Ruixuan Deng, Pei Sun, Yang Tian

In machine learning, numerous latest advances of solver designs are accomplished in developing neural operators, a kind of mesh-free approximators of the infinite-dimensional operators that map between different parameterization spaces of equation solutions.

180

Paper
Code

KoopmanLab: machine learning for solving complex physics equations

1 code implementation • 3 Jan 2023 • Wei Xiong, Muyuan Ma, Xiaomeng Huang, Ziyang Zhang, Pei Sun, Yang Tian

To overcome this challenge, we present KoopmanLab, an efficient module of the Koopman neural operator family, for learning PDEs without analytic solutions or closed forms.

180

Paper
Code

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

no code implementations • 12 Dec 2022 • Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang

In this paper, we consider the contextual bandit with general function approximation and propose a computationally efficient algorithm to achieve a regret of $\tilde{O}(\sqrt{T}+\zeta)$.

Multi-Armed Bandits Reinforcement Learning (RL)

Paper
Add Code

GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond

no code implementations • 3 Nov 2022 • Han Zhong, Wei Xiong, Sirui Zheng, LiWei Wang, Zhaoran Wang, Zhuoran Yang, Tong Zhang

The proposed algorithm modifies the standard posterior sampling algorithm in two aspects: (i) we use an optimistic prior distribution that biases towards hypotheses with higher values and (ii) a loglikelihood function is set to be the empirical loss evaluated on the historical data, where the choice of loss function supports both model-free and model-based learning.

Decision Making Reinforcement Learning (RL)

Paper
Add Code

A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games

no code implementations • 4 Oct 2022 • Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang

Existing studies on provably efficient algorithms for Markov games (MGs) almost exclusively build on the "optimism in the face of uncertainty" (OFU) principle.

Paper
Add Code

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

no code implementations • 31 May 2022 • Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, LiWei Wang, Tong Zhang

We also extend our techniques to the two-player zero-sum Markov games (MGs), and establish a new performance lower bound for MGs, which tightens the existing result, and verifies the nearly minimax optimality of the proposed algorithm.

Offline RL Reinforcement Learning (RL)

Paper
Add Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

116

Paper
Code

Breast Cancer Induced Bone Osteolysis Prediction Using Temporal Variational Auto-Encoders

no code implementations • 20 Mar 2022 • Wei Xiong, Neil Yeung, Shubo Wang, Haofu Liao, Liyun Wang, Jiebo Luo

Its ability of predicting the development of bone lesions in cancer-invading bones can assist in assessing the risk of impending fractures and choosing proper treatments in breast cancer bone metastasis.

Computed Tomography (CT)

Paper
Add Code

Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets

no code implementations • 15 Feb 2022 • Han Zhong, Wei Xiong, Jiyuan Tan, LiWei Wang, Tong Zhang, Zhaoran Wang, Zhuoran Yang

When the dataset does not have uniform coverage over all policy pairs, finding an approximate NE involves challenges in three aspects: (i) distributional shift between the behavior policy and the optimal policy, (ii) function approximation to handle large state space, and (iii) minimax optimization for equilibrium solving.

Paper
Add Code

Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization

1 code implementation • NeurIPS 2021 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

In this paper, we propose BEACON -- Batched Exploration with Adaptive COmmunicatioN -- that closes this gap.

Efficient Exploration Multi-Armed Bandits

Paper
Code

(Almost) Free Incentivized Exploration from Decentralized Learning Agents

1 code implementation • NeurIPS 2021 • Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen

In this work, we break this barrier and study incentivized exploration with multiple and long-term strategic agents, who have more complicated behaviors that often appear in real-world applications.

Multi-Armed Bandits

Paper
Code

Distributional Reinforcement Learning for Multi-Dimensional Reward Functions

1 code implementation • NeurIPS 2021 • Pushi Zhang, Xiaoyu Chen, Li Zhao, Wei Xiong, Tao Qin, Tie-Yan Liu

To fully inherit the benefits of distributional RL and hybrid reward architectures, we introduce Multi-Dimensional Distributional DQN (MD3QN), which extends distributional RL to model the joint return distribution from multiple reward sources.

Distributional Reinforcement Learning reinforcement-learning +1

Paper
Code

GANet: Glyph-Attention Network for Few-Shot Font Generation

no code implementations • 29 Sep 2021 • Mingtao Guo, Wei Xiong, Zheng Wang, Yong Tang, Ting Wu

Font generation is a valuable but challenging task, it is time consuming and costly to design font libraries which cover all glyphs with various styles.

Font Generation

Paper
Add Code

Heterogeneous Network Embedding for Deep Semantic Relevance Match in E-commerce Search

no code implementations • 13 Jan 2021 • Ziyang Liu, Zhaomeng Cheng, Yunjiang Jiang, Yue Shang, Wei Xiong, Sulong Xu, Bo Long, Di Jin

We propose in this paper a novel Second-order Relevance, which is fundamentally different from the previous First-order Relevance, to improve result relevance prediction.

Network Embedding

Paper
Add Code

PMGT-VR: A decentralized proximal-gradient algorithmic framework with variance reduction

no code implementations • 30 Dec 2020 • Haishan Ye, Wei Xiong, Tong Zhang

This paper considers the decentralized composite optimization problem.

Paper
Add Code

BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search

no code implementations • 20 Oct 2020 • Yunjiang Jiang, Yue Shang, Ziyang Liu, Hongwei Shen, Yun Xiao, Wei Xiong, Sulong Xu, Weipeng Yan, Di Jin

Relevance has significant impact on user experience and business profit for e-commerce search platform.

Sentiment Analysis

Paper
Add Code

Image Sentiment Transfer

no code implementations • 19 Jun 2020 • Tianlang Chen, Wei Xiong, Haitian Zheng, Jiebo Luo

In this paper, we propose an effective and flexible framework that performs image sentiment transfer at the object level.

Disentanglement Image-to-Image Translation +2

Paper
Add Code

LinksIQ: Robust and Efficient Modulation Recognition with Imperfect Spectrum Scans

no code implementations • 7 May 2020 • Wei Xiong, Karyn Doke, Petko Bogdanov, Mariya Zheleva

While critical for the practical progress of spectrum sharing, modulation recognition has so far been investigated under unrealistic assumptions: (i) a transmitter's bandwidth must be scanned alone and in full, (ii) prior knowledge of the technology must be available and (iii) a transmitter must be trustworthy.

Paper
Add Code

Unsupervised Low-light Image Enhancement with Decoupled Networks

no code implementations • 6 May 2020 • Wei Xiong, Ding Liu, Xiaohui Shen, Chen Fang, Jiebo Luo

In this paper, we tackle the problem of enhancing real-world low-light images with significant noise in an unsupervised fashion.

Image-to-Image Translation Low-Light Image Enhancement

Paper
Add Code

Example-Guided Image Synthesis across Arbitrary Scenes using Masked Spatial-Channel Attention and Self-Supervision

no code implementations • 18 Apr 2020 • Haitian Zheng, Haofu Liao, Lele Chen, Wei Xiong, Tianlang Chen, Jiebo Luo

Example-guided image synthesis has recently been attempted to synthesize an image from a semantic label map and an exemplary image.

Image Generation Spatial Interpolation

Paper
Add Code

Decentralized Multi-player Multi-armed Bandits with No Collision Information

no code implementations • 29 Feb 2020 • Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

The decentralized stochastic multi-player multi-armed bandit (MP-MAB) problem, where the collision information is not available to the players, is studied in this paper.

Multi-Armed Bandits

Paper
Add Code

Fine-grained Image-to-Image Transformation towards Visual Recognition

no code implementations • CVPR 2020 • Wei Xiong, Yutong He, Yixuan Zhang, Wenhan Luo, Lin Ma, Jiebo Luo

In this paper, we aim at transforming an image with a fine-grained category to synthesize new images that preserve the identity of the input image, which can thereby benefit the subsequent fine-grained image recognition and few-shot learning tasks.

Few-Shot Learning Fine-Grained Image Recognition

Paper
Add Code

Example-Guided Scene Image Synthesis using Masked Spatial-Channel Attention and Patch-Based Self-Supervision

no code implementations • 27 Nov 2019 • Haitian Zheng, Haofu Liao, Lele Chen, Wei Xiong, Tianlang Chen, Jiebo Luo

Example-guided image synthesis has been recently attempted to synthesize an image from a semantic label map and an exemplary image.

Image Generation Spatial Interpolation

Paper
Add Code

Foreground-aware Image Inpainting

no code implementations • CVPR 2019 • Wei Xiong, Jiahui Yu, Zhe Lin, Jimei Yang, Xin Lu, Connelly Barnes, Jiebo Luo

We show that by such disentanglement, the contour completion model predicts reasonable contours of objects, and further substantially improves the performance of image inpainting.

Disentanglement Image Inpainting

Paper
Add Code

CariGAN: Caricature Generation through Weakly Paired Adversarial Learning

no code implementations • 1 Nov 2018 • Wenbin Li, Wei Xiong, Haofu Liao, Jing Huo, Yang Gao, Jiebo Luo

Furthermore, an attention mechanism is introduced to encourage our model to focus on the key facial parts so that more vivid details in these regions can be generated.

Caricature