Search Results for author: Zhiwei Deng

Found 25 papers, 11 papers with code

Distributional Dataset Distillation with Subtask Decomposition

1 code implementation1 Mar 2024 Tian Qin, Zhiwei Deng, David Alvarez-Melis

What does a neural network learn when training from a task-specific dataset?

Perceptual Group Tokenizer: Building Perception with Iterative Grouping

no code implementations30 Nov 2023 Zhiwei Deng, Ting Chen, Yang Li

In this paper, we propose the Perceptual Group Tokenizer, a model that entirely relies on grouping operations to extract visual features and perform self-supervised representation learning, where a series of grouping operations are used to iteratively hypothesize the context for pixels or superpixels to refine feature representations.

Representation Learning Self-Supervised Image Classification +2

Unseen Image Synthesis with Diffusion Models

no code implementations13 Oct 2023 Ye Zhu, Yu Wu, Zhiwei Deng, Olga Russakovsky, Yan Yan

While the current trend in the generative field is scaling up towards larger models and more training data for generalized domain representations, we go the opposite direction in this work by synthesizing unseen domain images without additional training.

Denoising Image Generation

A Zero-Shot Language Agent for Computer Control with Structured Reflection

no code implementations12 Oct 2023 Tao Li, Gang Li, Zhiwei Deng, Bryan Wang, Yang Li

To perform a task, recent works often require a model to learn from trace examples of the task via either supervised learning or few/many-shot prompting.

Management

Vision-Language Dataset Distillation

2 code implementations15 Aug 2023 Xindi Wu, Byron Zhang, Zhiwei Deng, Olga Russakovsky

In this work, we design the first vision-language dataset distillation method, building on the idea of trajectory matching.

Image Classification Image-to-Text Retrieval +2

Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

2 code implementations6 Jun 2022 Zhiwei Deng, Olga Russakovsky

We propose an algorithm that compresses the critical information of a large dataset into compact addressable memories.

Continual Learning

Adaptive Appearance Rendering

1 code implementation24 Apr 2021 Mengyao Zhai, Ruizhi Deng, Jiacheng Chen, Lei Chen, Zhiwei Deng, Greg Mori

Hence, we develop an approach based on intermediate representations of poses and appearance: our pose-guided appearance rendering network firstly encodes the targets' poses using an encoder-decoder neural network.

Video Generation

BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps

1 code implementation ACL 2020 Wang Zhu, Hexiang Hu, Jiacheng Chen, Zhiwei Deng, Vihan Jain, Eugene Ie, Fei Sha

To this end, we propose BabyWalk, a new VLN agent that is learned to navigate by decomposing long instructions into shorter ones (BabySteps) and completing them sequentially.

Imitation Learning Navigate +1

Take the Scenic Route: Improving Generalization in Vision-and-Language Navigation

no code implementations31 Mar 2020 Felix Yu, Zhiwei Deng, Karthik Narasimhan, Olga Russakovsky

In the Vision-and-Language Navigation (VLN) task, an agent with egocentric vision navigates to a destination given natural language instructions.

Vision and Language Navigation

Policy Message Passing: A New Algorithm for Probabilistic Graph Inference

no code implementations ICLR 2020 Zhiwei Deng, Greg Mori

A general graph-structured neural network architecture operates on graphs through two core components: (1) complex enough message functions; (2) a fixed information aggregation process.

Continuous Graph Flow

no code implementations7 Aug 2019 Zhiwei Deng, Megha Nawhal, Lili Meng, Greg Mori

In this paper, we propose Continuous Graph Flow, a generative continuous flow based method that aims to model complex distributions of graph-structured data.

Density Estimation Graph Generation

Structured Label Inference for Visual Understanding

1 code implementation18 Feb 2018 Nelson Nauata, Hexiang Hu, Guang-Tong Zhou, Zhiwei Deng, Zicheng Liao, Greg Mori

In this paper, we exploit this rich structure for performing graph-based inference in label space for a number of tasks: multi-label image and video classification and action detection in untrimmed videos.

Action Detection General Classification +3

Sparsely Aggregated Convolutional Networks

2 code implementations ECCV 2018 Ligeng Zhu, Ruizhi Deng, Michael Maire, Zhiwei Deng, Greg Mori, Ping Tan

We explore a key architectural aspect of deep convolutional neural networks: the pattern of internal skip connections used to aggregate outputs of earlier layers for consumption by deeper layers.

Factorized Variational Autoencoders for Modeling Audience Reactions to Movies

no code implementations CVPR 2017 Zhiwei Deng, Rajitha Navarathna, Peter Carr, Stephan Mandt, Yisong Yue, Iain Matthews, Greg Mori

Matrix and tensor factorization methods are often used for finding underlying low-dimensional patterns from noisy data.

Active Learning for Structured Prediction from Partially Labeled Data

no code implementations7 Jun 2017 Mehran Khodabandeh, Zhiwei Deng, Mostafa S. Ibrahim, Shinichi Satoh, Greg Mori

We propose a general purpose active learning algorithm for structured prediction, gathering labeled data for training a model that outputs a set of related labels for an image or video.

Active Learning Structured Prediction

Generic Tubelet Proposals for Action Localization

no code implementations30 May 2017 Jiawei He, Mostafa S. Ibrahim, Zhiwei Deng, Greg Mori

Our class-independent TPN outperforms other tubelet generation methods, and our unified temporal deep network achieves state-of-the-art localization results on all three datasets.

Action Classification Action Localization +1

LabelBank: Revisiting Global Perspectives for Semantic Segmentation

1 code implementation29 Mar 2017 Hexiang Hu, Zhiwei Deng, Guang-Tong Zhou, Fei Sha, Greg Mori

We advocate that holistic inference of image concepts provides valuable information for detailed pixel labeling.

Segmentation Semantic Segmentation

Recalling Holistic Information for Semantic Segmentation

no code implementations24 Nov 2016 Hexiang Hu, Zhiwei Deng, Guang-Tong Zhou, Fei Sha, Greg Mori

We advocate that high-recall holistic inference of image concepts provides valuable information for detailed pixel labeling.

Segmentation Semantic Segmentation

Hierarchical Deep Temporal Models for Group Activity Recognition

1 code implementation9 Jul 2016 Mostafa S. Ibrahim, Srikanth Muralidharan, Zhiwei Deng, Arash Vahdat, Greg Mori

In order to model both person-level and group-level dynamics, we present a 2-stage deep temporal model for the group activity recognition problem.

Group Activity Recognition

A Hierarchical Deep Temporal Model for Group Activity Recognition

1 code implementation CVPR 2016 Moustafa Ibrahim, Srikanth Muralidharan, Zhiwei Deng, Arash Vahdat, Greg Mori

In group activity recognition, the temporal dynamics of the whole activity can be inferred based on the dynamics of the individual people representing the activity.

Group Activity Recognition

Deep Structured Models For Group Activity Recognition

no code implementations12 Jun 2015 Zhiwei Deng, Mengyao Zhai, Lei Chen, Yuhao Liu, Srikanth Muralidharan, Mehrsan Javan Roshtkhari, Greg Mori

This paper presents a deep neural-network-based hierarchical graphical model for individual and group activity recognition in surveillance scenes.

Group Activity Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.