Search Results for author: Yinxiao Li

Found 10 papers, 5 papers with code

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

no code implementations11 Jan 2024 Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang

Additionally, Parrot employs a joint optimization approach for the T2I model and the prompt expansion network, facilitating the generation of quality-aware text prompts, thus further enhancing the final image quality.

Reinforcement Learning (RL) Text-to-Image Generation

SVDiff: Compact Parameter Space for Diffusion Fine-Tuning

1 code implementation ICCV 2023 Ligong Han, Yinxiao Li, Han Zhang, Peyman Milanfar, Dimitris Metaxas, Feng Yang

Diffusion models have achieved remarkable success in text-to-image generation, enabling the creation of high-quality images from text prompts or other modalities.

Data Augmentation Efficient Diffusion Personalization +1

MaxViT: Multi-Axis Vision Transformer

14 code implementations4 Apr 2022 Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li

We also show that our proposed model expresses strong generative modeling capability on ImageNet, demonstrating the superior potential of MaxViT blocks as a universal vision module.

Image Classification object-detection +1

MAXIM: Multi-Axis MLP for Image Processing

1 code implementation CVPR 2022 Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li

In this work, we present a multi-axis MLP based architecture called MAXIM, that can serve as an efficient and flexible general-purpose vision backbone for image processing tasks.

Deblurring Image Deblurring +6

COMISR: Compression-Informed Video Super-Resolution

2 code implementations ICCV 2021 Yinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang, Peyman Milanfar

Most video super-resolution methods focus on restoring high-resolution video frames from low-resolution videos without taking into account compression.

Video Super-Resolution

PERF-Net: Pose Empowered RGB-Flow Net

no code implementations28 Sep 2020 Yinxiao Li, Zhichao Lu, Xuehan Xiong, Jonathan Huang

In recent years, many works in the video action recognition literature have shown that two stream models (combining spatial and temporal input streams) are necessary for achieving state of the art performance.

Action Classification Action Recognition +1

Handling Position Bias for Unbiased Learning to Rank in Hotels Search

no code implementations28 Feb 2020 Yinxiao Li

The online A/B test results show that this method leads to an improved search ranking model.

Learning-To-Rank Position +1

Model-Driven Feed-Forward Prediction for Manipulation of Deformable Objects

no code implementations15 Jul 2016 Yinxiao Li, Yan Wang, Yonghao Yue, Danfei Xu, Michael Case, Shih-Fu Chang, Eitan Grinspun, Peter Allen

A fully featured 3D model of the garment is constructed in real-time and volumetric features are then used to obtain the most similar model in the database to predict the object category and pose.

Object Pose Estimation +1

Articulated Pose Estimation Using Hierarchical Exemplar-Based Models

no code implementations13 Dec 2015 Jiongxin Liu, Yinxiao Li, Peter Allen, Peter Belhumeur

Exemplar-based models have achieved great success on localizing the parts of semi-rigid objects.

Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.