Search Results for author: Gongfan Fang

Found 15 papers, 12 papers with code

SlimSAM: 0.1% Data Makes Segment Anything Slim

2 code implementations • 8 Dec 2023 • Zigeng Chen, Gongfan Fang, Xinyin Ma, Xinchao Wang

To address this challenging trade-off, we introduce SlimSAM, a novel data-efficient SAM compression method that achieves superior performance with extremely less training data.

125,118

Paper
Code

DeepCache: Accelerating Diffusion Models for Free

2 code implementations • 1 Dec 2023 • Xinyin Ma, Gongfan Fang, Xinchao Wang

Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities.

Denoising Image Generation

601

Paper
Code

LLM-Pruner: On the Structural Pruning of Large Language Models

1 code implementation • NeurIPS 2023 • Xinyin Ma, Gongfan Fang, Xinchao Wang

With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to preserve the multi-task solving and language generation ability of the original LLM.

Text Generation Zero-Shot Learning

652

Paper
Code

Structural Pruning for Diffusion Models

1 code implementation • NeurIPS 2023 • Gongfan Fang, Xinyin Ma, Xinchao Wang

Generative modeling has recently undergone remarkable advancements, primarily propelled by the transformative implications of Diffusion Probabilistic Models (DPMs).

123

Paper
Code

DepGraph: Towards Any Structural Pruning

1 code implementation • CVPR 2023 • Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang

Structural pruning enables model acceleration by removing structurally-grouped parameters from neural networks.

Network Pruning Neural Network Compression

2,307

Paper
Code

Federated Selective Aggregation for Knowledge Amalgamation

1 code implementation • 27 Jul 2022 • Donglin Xie, Ruonan Yu, Gongfan Fang, Jie Song, Zunlei Feng, Xinchao Wang, Li Sun, Mingli Song

The goal of FedSA is to train a student model for a new task with the help of several decentralized teachers, whose pre-training tasks and data are different and agnostic.

Paper
Code

Prompting to Distill: Boosting Data-Free Knowledge Distillation via Reinforced Prompt

no code implementations • 16 May 2022 • Xinyin Ma, Xinchao Wang, Gongfan Fang, Yongliang Shen, Weiming Lu

Data-free knowledge distillation (DFKD) conducts knowledge distillation via eliminating the dependence of original training data, and has recently achieved impressive results in accelerating pre-trained language models.

Data-free Knowledge Distillation

Paper
Add Code

Knowledge Amalgamation for Object Detection with Transformers

1 code implementation • 7 Mar 2022 • Haofei Zhang, Feng Mao, Mengqi Xue, Gongfan Fang, Zunlei Feng, Jie Song, Mingli Song

Moreover, the transformer-based students excel in learning amalgamated knowledge, as they have mastered heterogeneous detection tasks rapidly and achieved superior or at least comparable performance to those of the teachers in their specializations.

Object object-detection +1

Paper
Code

Up to 100$\times$ Faster Data-free Knowledge Distillation

2 code implementations • 12 Dec 2021 • Gongfan Fang, Kanya Mo, Xinchao Wang, Jie Song, Shitao Bei, Haofei Zhang, Mingli Song

At the heart of our approach is a novel strategy to reuse the shared common features in training data so as to synthesize different data instances.

Data-free Knowledge Distillation

Paper
Code

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

2 code implementations • NeurIPS 2021 • Gongfan Fang, Yifan Bao, Jie Song, Xinchao Wang, Donglin Xie, Chengchao Shen, Mingli Song

Knowledge distillation~(KD) aims to craft a compact student model that imitates the behavior of a pre-trained teacher in a target domain.

Knowledge Distillation Semantic Segmentation +1

Paper
Code

Contrastive Model Inversion for Data-Free Knowledge Distillation

3 code implementations • 18 May 2021 • Gongfan Fang, Jie Song, Xinchao Wang, Chengchao Shen, Xingen Wang, Mingli Song

In this paper, we propose Contrastive Model Inversion~(CMI), where the data diversity is explicitly modeled as an optimizable objective, to alleviate the mode collapse issue.

Contrastive Learning Data-free Knowledge Distillation

Paper
Code

Adversarial Self-Supervised Data-Free Distillation for Text Classification

no code implementations • EMNLP 2020 • Xinyin Ma, Yongliang Shen, Gongfan Fang, Chen Chen, Chenghao Jia, Weiming Lu

To the best of our knowledge, our framework is the first data-free distillation framework designed for NLP tasks.

General Classification Knowledge Distillation +3

Paper
Add Code

Impression Space from Deep Template Network

no code implementations • 10 Jul 2020 • Gongfan Fang, Xinchao Wang, Haofei Zhang, Jie Song, Mingli Song

This network is referred to as the {\emph{Template Network}} because its filters will be used as templates to reconstruct images from the impression.

Image Generation Translation

Paper
Add Code

Data-Free Adversarial Distillation

3 code implementations • 23 Dec 2019 • Gongfan Fang, Jie Song, Chengchao Shen, Xinchao Wang, Da Chen, Mingli Song

Knowledge Distillation (KD) has made remarkable progress in the last few years and become a popular paradigm for model compression and knowledge transfer.

Knowledge Distillation Model Compression +2

Paper
Code

Knowledge Amalgamation from Heterogeneous Networks by Common Feature Learning

2 code implementations • 24 Jun 2019 • Sihui Luo, Xinchao Wang, Gongfan Fang, Yao Hu, Dapeng Tao, Mingli Song

An increasing number of well-trained deep networks have been released online by researchers and developers, enabling the community to reuse them in a plug-and-play way without accessing the training annotations.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.