Search Results for author: Ying Fan

Found 17 papers, 11 papers with code

Coevolution of Resource and Strategies in Common-Pool Resource Dilemmas: A Coupled Human-Environmental System Model

no code implementations20 Jan 2024 Chengyi Tu, Renfei Chen, Ying Fan, YongLiang Yang

The users' strategies evolve according to different processes that capture effects of payoff, resource, and noise.

Impact of resource availability and conformity effect on sustainability of common-pool resources

no code implementations11 Oct 2023 Chengyi Tu, Renfei Chen, Ying Fan, Xuwei Pan

However, there is still a lack of a novel and comprehensive framework for modelling extraction of common-pool resources and cooperation of human agents that can account for different factors that shape the system behavior and outcomes.

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

2 code implementations25 May 2023 Ying Fan, Olivia Watkins, Yuqing Du, Hao liu, MoonKyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee

We focus on diffusion models, defining the fine-tuning task as an RL problem, and updating the pre-trained text-to-image diffusion models using policy gradient to maximize the feedback-trained reward.

reinforcement-learning Reinforcement Learning (RL)

Domain Generalization via Nuclear Norm Regularization

1 code implementation13 Mar 2023 Zhenmei Shi, Yifei Ming, Ying Fan, Frederic Sala, YIngyu Liang

In this paper, we propose a simple and effective regularization method based on the nuclear norm of the learned features for domain generalization.

Domain Generalization

Optimizing DDPM Sampling with Shortcut Fine-Tuning

1 code implementation31 Jan 2023 Ying Fan, Kangwook Lee

In this study, we propose Shortcut Fine-Tuning (SFT), a new approach for addressing the challenge of fast sampling of pretrained Denoising Diffusion Probabilistic Models (DDPMs).

Denoising Reinforcement Learning (RL)

Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance

1 code implementation13 Dec 2022 Dohyun Kwon, Ying Fan, Kangwook Lee

Specifically, we prove that the Wasserstein distance is upper bounded by the square root of the objective function up to multiplicative constants and a fixed constant offset.

Audio Synthesis Image Generation

POEM: Out-of-Distribution Detection with Posterior Sampling

1 code implementation28 Jun 2022 Yifei Ming, Ying Fan, Yixuan Li

In this work, we propose a novel posterior sampling-based outlier mining framework, POEM, which facilitates efficient use of outlier data and promotes learning a compact decision boundary between ID and OOD data for improved detection.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Towards Unknown-aware Deep Q-Learning

no code implementations29 Sep 2021 Ying Fan, Sharon Li

Furthermore, we provide theoretical guarantees that our method can improve OOD uncertainty estimation while ensuring the convergence performance of the in-distribution environment.

Out of Distribution (OOD) Detection Q-Learning +1

Real Negatives Matter: Continuous Training with Real Negatives for Delayed Feedback Modeling

1 code implementation29 Apr 2021 Siyu Gu, Xiang-Rong Sheng, Ying Fan, Guorui Zhou, Xiaoqiang Zhu

If conversion happens outside the waiting window, this sample will be duplicated and ingested into the training pipeline with a positive label.

Model-based Reinforcement Learning for Continuous Control with Posterior Sampling

1 code implementation20 Nov 2020 Ying Fan, Yifei Ming

In this paper, we study model-based posterior sampling for reinforcement learning (PSRL) in continuous state-action spaces theoretically and empirically.

Continuous Control Efficient Exploration +6

Efficient Exploration for Model-based Reinforcement Learning with Continuous States and Actions

no code implementations28 Sep 2020 Ying Fan, Yifei Ming

Our bound can be extended to nonlinear cases as well: using linear kernels on the feature representation $\phi$, the Bayesian regret bound becomes $\tilde{O}(H^{3/2}d_{\phi}\sqrt{T})$, where $d_\phi$ is the dimension of the representation space.

Efficient Exploration Gaussian Processes +4

A hyperbolic Embedding Model for Directed Networks

no code implementations9 Jun 2019 Zongning Wu, Zengru Di, Ying Fan

Here, we discuss how to multiplex node information as an embedding foundation through identifying the bipartite structure of directed networks; and we proposed the generally mapping framework which hybrids the topological structure of complex networks, directed links and the hidden metrics space.

Physics and Society

Efficient Model-Free Reinforcement Learning Using Gaussian Process

no code implementations11 Dec 2018 Ying Fan, Letian Chen, Yizhou Wang

Efficient Reinforcement Learning usually takes advantage of demonstration or good exploration strategy.

reinforcement-learning Reinforcement Learning (RL)

Deep Interest Evolution Network for Click-Through Rate Prediction

15 code implementations11 Sep 2018 Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, Kun Gai

Easy-to-use, Modular and Extendible package of deep-learning based CTR models. DeepFM, DeepInterestNetwork(DIN), DeepInterestEvolutionNetwork(DIEN), DeepCrossNetwork(DCN), AttentionalFactorizationMachine(AFM), Neural Factorization Machine(NFM), AutoInt

Click-Through Rate Prediction

Rocket Launching: A Universal and Efficient Framework for Training Well-performing Light Net

2 code implementations14 Aug 2017 Guorui Zhou, Ying Fan, Runpeng Cui, Weijie Bian, Xiaoqiang Zhu, Kun Gai

Models applied on real time response task, like click-through rate (CTR) prediction model, require high accuracy and rigorous response time.

Click-Through Rate Prediction

Deep Interest Network for Click-Through Rate Prediction

17 code implementations21 Jun 2017 Guorui Zhou, Chengru Song, Xiaoqiang Zhu, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, Kun Gai

In this way, user features are compressed into a fixed-length representation vector, in regardless of what candidate ads are.

Click-Through Rate Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.