Search Results for author: Mingjian Zhu

Found 7 papers, 5 papers with code

GenDet: Towards Good Generalizations for AI-Generated Image Detection

1 code implementation12 Dec 2023 Mingjian Zhu, Hanting Chen, Mouxiao Huang, Wei Li, Hailin Hu, Jie Hu, Yunhe Wang

The misuse of AI imagery can have harmful societal effects, prompting the creation of detectors to combat issues like the spread of fake news.

Anomaly Detection

Dynamic Resolution Network

3 code implementations NeurIPS 2021 Mingjian Zhu, Kai Han, Enhua Wu, Qiulin Zhang, Ying Nie, Zhenzhong Lan, Yunhe Wang

To this end, we propose a novel dynamic-resolution network (DRNet) in which the input resolution is determined dynamically based on each input sample.

Vision Transformer Pruning

2 code implementations17 Apr 2021 Mingjian Zhu, Yehui Tang, Kai Han

Vision transformer has achieved competitive performance on a variety of computer vision applications.

Video Captioning in Compressed Video

no code implementations2 Jan 2021 Mingjian Zhu, Chenrui Duan, Changbin Yu

We propose a video captioning method which operates directly on the stored compressed videos.

Caption Generation Video Captioning

Dynamic Feature Pyramid Networks for Object Detection

1 code implementation1 Dec 2020 Mingjian Zhu, Kai Han, Changbin Yu, Yunhe Wang

An attempt to enhance the FPN is enriching the spatial information by expanding the receptive fields, which is promising to largely improve the detection accuracy.

Object object-detection +1

Crowd Video Captioning

no code implementations13 Nov 2019 Liqi Yan, Mingjian Zhu, Changbin Yu

Since the deployment of reporters in the entrance and exit costs lots of manpower, how to automatically describe the behavior of a crowd of off-site spectators is significant and remains a problem.

Video Captioning

Attribute-Aware Attention Model for Fine-grained Representation Learning

1 code implementation2 Jan 2019 Kai Han, Jianyuan Guo, Chao Zhang, Mingjian Zhu

Based on the considerations above, we propose a novel Attribute-Aware Attention Model ($A^3M$), which can learn local attribute representation and global category representation simultaneously in an end-to-end manner.

Attribute Fine-Grained Image Classification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.