no code implementations • 20 Jan 2024 • Chengyi Tu, Renfei Chen, Ying Fan, YongLiang Yang
The users' strategies evolve according to different processes that capture effects of payoff, resource, and noise.
no code implementations • 11 Oct 2023 • Chengyi Tu, Renfei Chen, Ying Fan, Xuwei Pan
However, there is still a lack of a novel and comprehensive framework for modelling extraction of common-pool resources and cooperation of human agents that can account for different factors that shape the system behavior and outcomes.
2 code implementations • 25 May 2023 • Ying Fan, Olivia Watkins, Yuqing Du, Hao liu, MoonKyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee
We focus on diffusion models, defining the fine-tuning task as an RL problem, and updating the pre-trained text-to-image diffusion models using policy gradient to maximize the feedback-trained reward.
1 code implementation • 13 Mar 2023 • Zhenmei Shi, Yifei Ming, Ying Fan, Frederic Sala, YIngyu Liang
In this paper, we propose a simple and effective regularization method based on the nuclear norm of the learned features for domain generalization.
1 code implementation • 31 Jan 2023 • Ying Fan, Kangwook Lee
In this study, we propose Shortcut Fine-Tuning (SFT), a new approach for addressing the challenge of fast sampling of pretrained Denoising Diffusion Probabilistic Models (DDPMs).
1 code implementation • 13 Dec 2022 • Dohyun Kwon, Ying Fan, Kangwook Lee
Specifically, we prove that the Wasserstein distance is upper bounded by the square root of the objective function up to multiplicative constants and a fixed constant offset.
1 code implementation • 28 Jun 2022 • Yifei Ming, Ying Fan, Yixuan Li
In this work, we propose a novel posterior sampling-based outlier mining framework, POEM, which facilitates efficient use of outlier data and promotes learning a compact decision boundary between ID and OOD data for improved detection.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
no code implementations • 29 Sep 2021 • Ying Fan, Sharon Li
Furthermore, we provide theoretical guarantees that our method can improve OOD uncertainty estimation while ensuring the convergence performance of the in-distribution environment.
1 code implementation • 29 Apr 2021 • Siyu Gu, Xiang-Rong Sheng, Ying Fan, Guorui Zhou, Xiaoqiang Zhu
If conversion happens outside the waiting window, this sample will be duplicated and ingested into the training pipeline with a positive label.
1 code implementation • 20 Nov 2020 • Ying Fan, Yifei Ming
In this paper, we study model-based posterior sampling for reinforcement learning (PSRL) in continuous state-action spaces theoretically and empirically.
no code implementations • 28 Sep 2020 • Ying Fan, Yifei Ming
Our bound can be extended to nonlinear cases as well: using linear kernels on the feature representation $\phi$, the Bayesian regret bound becomes $\tilde{O}(H^{3/2}d_{\phi}\sqrt{T})$, where $d_\phi$ is the dimension of the representation space.
1 code implementation • 10 Jun 2020 • Pi Qi, Xiaoqiang Zhu, Guorui Zhou, Yujing Zhang, Zhe Wang, Lejian Ren, Ying Fan, Kun Gai
Serving the main traffic in our real system now, SIM models user behavior data with maximum length reaching up to 54000, pushing SOTA to 54x.
no code implementations • 9 Jun 2019 • Zongning Wu, Zengru Di, Ying Fan
Here, we discuss how to multiplex node information as an embedding foundation through identifying the bipartite structure of directed networks; and we proposed the generally mapping framework which hybrids the topological structure of complex networks, directed links and the hidden metrics space.
Physics and Society
no code implementations • 11 Dec 2018 • Ying Fan, Letian Chen, Yizhou Wang
Efficient Reinforcement Learning usually takes advantage of demonstration or good exploration strategy.
15 code implementations • 11 Sep 2018 • Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, Kun Gai
Easy-to-use, Modular and Extendible package of deep-learning based CTR models. DeepFM, DeepInterestNetwork(DIN), DeepInterestEvolutionNetwork(DIEN), DeepCrossNetwork(DCN), AttentionalFactorizationMachine(AFM), Neural Factorization Machine(NFM), AutoInt
Ranked #1 on Click-Through Rate Prediction on Amazon Dataset
2 code implementations • 14 Aug 2017 • Guorui Zhou, Ying Fan, Runpeng Cui, Weijie Bian, Xiaoqiang Zhu, Kun Gai
Models applied on real time response task, like click-through rate (CTR) prediction model, require high accuracy and rigorous response time.
17 code implementations • 21 Jun 2017 • Guorui Zhou, Chengru Song, Xiaoqiang Zhu, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, Kun Gai
In this way, user features are compressed into a fixed-length representation vector, in regardless of what candidate ads are.
Ranked #1 on Click-Through Rate Prediction on Amazon