Search Results for author: Yinxiao Li

Found 10 papers, 5 papers with code

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

no code implementations • 11 Jan 2024 • Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang

Additionally, Parrot employs a joint optimization approach for the T2I model and the prompt expansion network, facilitating the generation of quality-aware text prompts, thus further enhancing the final image quality.

Reinforcement Learning (RL) Text-to-Image Generation

Paper
Add Code

SVDiff: Compact Parameter Space for Diffusion Fine-Tuning

1 code implementation • ICCV 2023 • Ligong Han, Yinxiao Li, Han Zhang, Peyman Milanfar, Dimitris Metaxas, Feng Yang

Diffusion models have achieved remarkable success in text-to-image generation, enabling the creation of high-quality images from text prompts or other modalities.

Data Augmentation Efficient Diffusion Personalization +1

354

Paper
Code

MaxViT: Multi-Axis Vision Transformer

14 code implementations • 4 Apr 2022 • Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li

We also show that our proposed model expresses strong generative modeling capability on ImageNet, demonstrating the superior potential of MaxViT blocks as a universal vision module.

Ranked #1 on Object Detection on COCO 2017

Image Classification object-detection +1

29,676

Paper
Code

MAXIM: Multi-Axis MLP for Image Processing

1 code implementation • CVPR 2022 • Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li

In this work, we present a multi-axis MLP based architecture called MAXIM, that can serve as an efficient and flexible general-purpose vision backbone for image processing tasks.

Ranked #1 on Deblurring on HIDE (trained on GOPRO)

Deblurring Image Deblurring +6

937

Paper
Code

COMISR: Compression-Informed Video Super-Resolution

2 code implementations • ICCV 2021 • Yinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang, Peyman Milanfar

Most video super-resolution methods focus on restoring high-resolution video frames from low-resolution videos without taking into account compression.

Ranked #6 on Video Super-Resolution on MSU Super-Resolution for Video Compression

Video Super-Resolution

32,753

Paper
Code

PERF-Net: Pose Empowered RGB-Flow Net

no code implementations • 28 Sep 2020 • Yinxiao Li, Zhichao Lu, Xuehan Xiong, Jonathan Huang

In recent years, many works in the video action recognition literature have shown that two stream models (combining spatial and temporal input streams) are necessary for achieving state of the art performance.

Ranked #5 on Action Recognition on UCF101

Action Classification Action Recognition +1

Paper
Add Code

Handling Position Bias for Unbiased Learning to Rank in Hotels Search

no code implementations • 28 Feb 2020 • Yinxiao Li

The online A/B test results show that this method leads to an improved search ranking model.

Learning-To-Rank Position +1

Paper
Add Code

Looking Fast and Slow: Memory-Guided Mobile Video Object Detection

2 code implementations • 25 Mar 2019 • Mason Liu, Menglong Zhu, Marie White, Yinxiao Li, Dmitry Kalenichenko

Models and examples built with TensorFlow

Ranked #30 on Video Object Detection on ImageNet VID (using extra training data)

object-detection Object Recognition +2

76,582

Paper
Code

Model-Driven Feed-Forward Prediction for Manipulation of Deformable Objects

no code implementations • 15 Jul 2016 • Yinxiao Li, Yan Wang, Yonghao Yue, Danfei Xu, Michael Case, Shih-Fu Chang, Eitan Grinspun, Peter Allen

A fully featured 3D model of the garment is constructed in real-time and volumetric features are then used to obtain the most similar model in the database to predict the object category and pose.

Object Pose Estimation +1

Paper
Add Code

Articulated Pose Estimation Using Hierarchical Exemplar-Based Models

no code implementations • 13 Dec 2015 • Jiongxin Liu, Yinxiao Li, Peter Allen, Peter Belhumeur

Exemplar-based models have achieved great success on localizing the parts of semi-rigid objects.

Pose Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.