Search Results for author: Sheng Zheng

Found 9 papers, 1 papers with code

Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation

no code implementations • 8 Mar 2024 • Joseph Cho, Fachrina Dewi Puspitasari, Sheng Zheng, Jingyao Zheng, Lik-Hang Lee, Tae-Ho Kim, Choong Seon Hong, Chaoning Zhang

Text-to-video generation marks a significant frontier in the rapidly evolving domain of generative AI, integrating advancements in text-to-image synthesis, video captioning, and text-guided editing.

Hallucination Image Generation +3

Paper
Add Code

MobileSAMv2: Faster Segment Anything to Everything

1 code implementation • 15 Dec 2023 • Chaoning Zhang, Dongshen Han, Sheng Zheng, Jinwoo Choi, Tae-Ho Kim, Choong Seon Hong

The efficiency bottleneck of SegEvery with SAM, however, lies in its mask decoder because it needs to first generate numerous masks with redundant grid-search prompts and then perform filtering to obtain the final valid masks.

Knowledge Distillation Object Discovery +1

4,289

Paper
Code

Segment Anything Meets Universal Adversarial Perturbation

no code implementations • 19 Oct 2023 • Dongshen Han, Sheng Zheng, Chaoning Zhang

On top of the ablation study to understand various components in our proposed method, we shed light on the roles of positive and negative samples in making the generated UAP effective for attacking SAM.

Adversarial Attack Adversarial Robustness +1

Paper
Add Code

Black-box Targeted Adversarial Attack on Segment Anything (SAM)

no code implementations • 16 Oct 2023 • Sheng Zheng, Chaoning Zhang, Xinhong Hao

The task of TAA on SAM has been realized in a recent arXiv work in the white-box setup by assuming access to prompt and model, which is thus less practical.

Adversarial Attack

Paper
Add Code

Understanding Segment Anything Model: SAM is Biased Towards Texture Rather than Shape

no code implementations • 3 Jun 2023 • Chaoning Zhang, Yu Qiao, Shehbaz Tariq, Sheng Zheng, Chenshuang Zhang, Chenghao Li, Hyundong Shin, Choong Seon Hong

Different from label-oriented recognition tasks, the SAM is trained to predict a mask for covering the object shape based on a promt.

Image Segmentation Semantic Segmentation

Paper
Add Code

A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering

no code implementations • 12 May 2023 • Chaoning Zhang, Fachrina Dewi Puspitasari, Sheng Zheng, Chenghao Li, Yu Qiao, Taegoo Kang, Xinru Shan, Chenshuang Zhang, Caiyan Qin, Francois Rameau, Lik-Hang Lee, Sung-Ho Bae, Choong Seon Hong

This is an ongoing project and we intend to update the manuscript on a regular basis.

Edge Detection Prompt Engineering

Paper
Add Code

One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era

no code implementations • 4 Apr 2023 • Chaoning Zhang, Chenshuang Zhang, Chenghao Li, Yu Qiao, Sheng Zheng, Sumit Kumar Dam, Mengchun Zhang, Jung Uk Kim, Seong Tae Kim, Jinwoo Choi, Gyeong-Moon Park, Sung-Ho Bae, Lik-Hang Lee, Pan Hui, In So Kweon, Choong Seon Hong

Overall, this work is the first to survey ChatGPT with a comprehensive review of its underlying technology, applications, and challenges.

Paper
Add Code

A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI

no code implementations • 23 Mar 2023 • Chenshuang Zhang, Chaoning Zhang, Sheng Zheng, Mengchun Zhang, Maryam Qamar, Sung-Ho Bae, In So Kweon

This work conducts a survey on audio diffusion model, which is complementary to existing surveys that either lack the recent progress of diffusion-based speech synthesis or highlight an overall picture of applying diffusion model in multiple fields.

Speech Enhancement Speech Synthesis +1

Paper
Add Code

A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?

no code implementations • 21 Mar 2023 • Chaoning Zhang, Chenshuang Zhang, Sheng Zheng, Yu Qiao, Chenghao Li, Mengchun Zhang, Sumit Kumar Dam, Chu Myaet Thwal, Ye Lin Tun, Le Luang Huy, Donguk Kim, Sung-Ho Bae, Lik-Hang Lee, Yang Yang, Heng Tao Shen, In So Kweon, Choong Seon Hong

As ChatGPT goes viral, generative AI (AIGC, a. k. a AI-generated content) has made headlines everywhere because of its ability to analyze and create text, images, and beyond.

Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.