Search Results for author: Ying Fan

Found 17 papers, 11 papers with code

Coevolution of Resource and Strategies in Common-Pool Resource Dilemmas: A Coupled Human-Environmental System Model

no code implementations • 20 Jan 2024 • Chengyi Tu, Renfei Chen, Ying Fan, YongLiang Yang

The users' strategies evolve according to different processes that capture effects of payoff, resource, and noise.

Paper
Add Code

Impact of resource availability and conformity effect on sustainability of common-pool resources

no code implementations • 11 Oct 2023 • Chengyi Tu, Renfei Chen, Ying Fan, Xuwei Pan

However, there is still a lack of a novel and comprehensive framework for modelling extraction of common-pool resources and cooperation of human agents that can account for different factors that shape the system behavior and outcomes.

Paper
Add Code

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

2 code implementations • 25 May 2023 • Ying Fan, Olivia Watkins, Yuqing Du, Hao liu, MoonKyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee

We focus on diffusion models, defining the fine-tuning task as an RL problem, and updating the pre-trained text-to-image diffusion models using policy gradient to maximize the feedback-trained reward.

reinforcement-learning Reinforcement Learning (RL)

32,801

Paper
Code

Domain Generalization via Nuclear Norm Regularization

1 code implementation • 13 Mar 2023 • Zhenmei Shi, Yifei Ming, Ying Fan, Frederic Sala, YIngyu Liang

In this paper, we propose a simple and effective regularization method based on the nuclear norm of the learned features for domain generalization.

Domain Generalization

Paper
Code

Optimizing DDPM Sampling with Shortcut Fine-Tuning

1 code implementation • 31 Jan 2023 • Ying Fan, Kangwook Lee

In this study, we propose Shortcut Fine-Tuning (SFT), a new approach for addressing the challenge of fast sampling of pretrained Denoising Diffusion Probabilistic Models (DDPMs).

Denoising Reinforcement Learning (RL)

Paper
Code

Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance

1 code implementation • 13 Dec 2022 • Dohyun Kwon, Ying Fan, Kangwook Lee

Specifically, we prove that the Wasserstein distance is upper bounded by the square root of the objective function up to multiplicative constants and a fixed constant offset.

Audio Synthesis Image Generation

Paper
Code

POEM: Out-of-Distribution Detection with Posterior Sampling

1 code implementation • 28 Jun 2022 • Yifei Ming, Ying Fan, Yixuan Li

In this work, we propose a novel posterior sampling-based outlier mining framework, POEM, which facilitates efficient use of outlier data and promotes learning a compact decision boundary between ID and OOD data for improved detection.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Paper
Code

Towards Unknown-aware Deep Q-Learning

no code implementations • 29 Sep 2021 • Ying Fan, Sharon Li

Furthermore, we provide theoretical guarantees that our method can improve OOD uncertainty estimation while ensuring the convergence performance of the in-distribution environment.

Out of Distribution (OOD) Detection Q-Learning +1

Paper
Add Code

Real Negatives Matter: Continuous Training with Real Negatives for Delayed Feedback Modeling

1 code implementation • 29 Apr 2021 • Siyu Gu, Xiang-Rong Sheng, Ying Fan, Guorui Zhou, Xiaoqiang Zhu

If conversion happens outside the waiting window, this sample will be duplicated and ingested into the training pipeline with a positive label.

Paper
Code

Model-based Reinforcement Learning for Continuous Control with Posterior Sampling

1 code implementation • 20 Nov 2020 • Ying Fan, Yifei Ming

In this paper, we study model-based posterior sampling for reinforcement learning (PSRL) in continuous state-action spaces theoretically and empirically.

Continuous Control Efficient Exploration +6

Paper
Code

Efficient Exploration for Model-based Reinforcement Learning with Continuous States and Actions

no code implementations • 28 Sep 2020 • Ying Fan, Yifei Ming

Our bound can be extended to nonlinear cases as well: using linear kernels on the feature representation $\phi$, the Bayesian regret bound becomes $\tilde{O}(H^{3/2}d_{\phi}\sqrt{T})$, where $d_\phi$ is the dimension of the representation space.

Efficient Exploration Gaussian Processes +4

Paper
Add Code

Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction

1 code implementation • 10 Jun 2020 • Pi Qi, Xiaoqiang Zhu, Guorui Zhou, Yujing Zhang, Zhe Wang, Lejian Ren, Ying Fan, Kun Gai

Serving the main traffic in our real system now, SIM models user behavior data with maximum length reaching up to 54000, pushing SOTA to 54x.

Click-Through Rate Prediction Recommendation Systems

310

Paper
Code

A hyperbolic Embedding Model for Directed Networks

no code implementations • 9 Jun 2019 • Zongning Wu, Zengru Di, Ying Fan

Here, we discuss how to multiplex node information as an embedding foundation through identifying the bipartite structure of directed networks; and we proposed the generally mapping framework which hybrids the topological structure of complex networks, directed links and the hidden metrics space.

Physics and Society

Paper
Add Code

Efficient Model-Free Reinforcement Learning Using Gaussian Process

no code implementations • 11 Dec 2018 • Ying Fan, Letian Chen, Yizhou Wang

Efficient Reinforcement Learning usually takes advantage of demonstration or good exploration strategy.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Deep Interest Evolution Network for Click-Through Rate Prediction

15 code implementations • 11 Sep 2018 • Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, Kun Gai

Easy-to-use, Modular and Extendible package of deep-learning based CTR models. DeepFM, DeepInterestNetwork(DIN), DeepInterestEvolutionNetwork(DIEN), DeepCrossNetwork(DCN), AttentionalFactorizationMachine(AFM), Neural Factorization Machine(NFM), AutoInt

Ranked #1 on Click-Through Rate Prediction on Amazon Dataset