Search Results for author: Ziheng Wu

Found 10 papers, 4 papers with code

BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis

no code implementations • 12 Nov 2023 • Tingfeng Cao, Chengyu Wang, Bingyan Liu, Ziheng Wu, Jinhui Zhu, Jun Huang

Then, to ensure that our generated prompts can generate more beautiful images, we further propose a Reinforcement Learning with Visual AI Feedback technique to fine-tune our model to maximize the reward values of the generated prompts, where the reward values are calculated based on the PickScore and the Aesthetic Scores.

Prompt Engineering Text-to-Image Generation

Paper
Add Code

Hierarchical Side-Tuning for Vision Transformers

no code implementations • 9 Oct 2023 • Weifeng Lin, Ziheng Wu, Jiayu Chen, Wentao Yang, Mingxin Huang, Jun Huang, Lianwen Jin

Fine-tuning pre-trained Vision Transformers (ViT) has consistently demonstrated promising performance in the realm of visual recognition.

Instance Segmentation object-detection +4

Paper
Add Code

EasyPhoto: Your Smart AI Photo Generator

2 code implementations • 7 Oct 2023 • Ziheng Wu, Jiaqi Xu, Xinyi Zou, Kunzhe Huang, Xing Shi, Jun Huang

By training a digital doppelganger of a specific user ID using 5 to 20 relevant images, the finetuned model (according to the trained LoRA model) allows for the generation of AI photos using arbitrary templates.

4,569

Paper
Code

DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion

no code implementations • 21 Sep 2023 • Zhenzhen Chu, Jiayu Chen, Cen Chen, Chengyu Wang, Ziheng Wu, Jun Huang, Weining Qian

Position-aware global tokens also contain the position information of the image, which makes our model better for vision tasks.

Image Classification object-detection +3

Paper
Add Code

FaceChain: A Playground for Human-centric Artificial Intelligence Generated Content

1 code implementation • 28 Aug 2023 • Yang Liu, Cheng Yu, Lei Shang, Yongyi He, Ziheng Wu, Xingjun Wang, Chao Xu, Haoyu Xie, Weida Wang, Yuze Zhao, Lin Zhu, Chen Cheng, Weitao Chen, Yuan YAO, Wenmeng Zhou, Jiaqi Xu, Qiang Wang, Yingda Chen, Xuansong Xie, Baigui Sun

In this paper, we present FaceChain, a personalized portrait generation framework that combines a series of customized image-generation model and a rich set of face-related perceptual understanding models (\eg, face detection, deep face embedding extraction, and facial attribute recognition), to tackle aforementioned challenges and to generate truthful personalized portraits, with only a handful of portrait images as input.

Attribute Potrait Generation +1

8,349

Paper
Code

DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis

no code implementations • 7 Aug 2023 • Zhongjie Duan, Lizhou You, Chengyu Wang, Cen Chen, Ziheng Wu, Weining Qian, Jun Huang

In recent years, diffusion models have emerged as the most powerful approach in image synthesis.

Image Generation

Paper
Add Code

Scale-Aware Modulation Meet Transformer

1 code implementation • ICCV 2023 • Weifeng Lin, Ziheng Wu, Jiayu Chen, Jun Huang, Lianwen Jin

Specifically, SMT with 11. 5M / 2. 4GFLOPs and 32M / 7. 7GFLOPs can achieve 82. 2% and 84. 3% top-1 accuracy on ImageNet-1K, respectively.

object-detection Object Detection +1

167

Paper
Code

SC-ML: Self-supervised Counterfactual Metric Learning for Debiased Visual Question Answering

no code implementations • 4 Apr 2023 • Xinyao Shu, ShiYang Yan, Xu Yang, Ziheng Wu, Zhongfeng Chen, Zhenyu Lu

Unfortunately, language bias is a common problem in VQA, which refers to the model generating answers only by associating with the questions while ignoring the visual content, resulting in biased results.

counterfactual Metric Learning +2

Paper
Add Code

YOLOX-PAI: An Improved YOLOX, Stronger and Faster than YOLOv6

3 code implementations • 27 Aug 2022 • Ziheng Wu, Xinyi Zou, Wenmeng Zhou, Jun Huang

We develop an all-in-one computer vision toolbox named EasyCV to facilitate the use of various SOTA computer vision methods.

object-detection Object Detection

6,085

Paper
Code

Elastic-Link for Binarized Neural Network

no code implementations • 19 Dec 2021 • Jie Hu, Ziheng Wu, Vince Tan, Zhilin Lu, Mengze Zeng, Enhua Wu

For example, we raise the top-1 accuracy of binarized ResNet26 from 57. 9% to 64. 0%.

Binarization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.