Search Results for author: Bir Bhanu

Found 14 papers, 3 papers with code

RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation

no code implementations6 Sep 2023 Hengyue Liu, Bir Bhanu

As each representation's cardinality has different trade-offs between performance and computation overhead, extracting highly representative features efficiently and dynamically is both challenging and crucial for SGG.

Graph Generation Scene Graph Generation

RECLIP: Resource-efficient CLIP by Training with Small Images

no code implementations12 Apr 2023 Runze Li, Dahun Kim, Bir Bhanu, Weicheng Kuo

We present RECLIP (Resource-efficient CLIP), a simple method that minimizes computational resource footprint for CLIP (Contrastive Language Image Pretraining).

Contrastive Learning Retrieval +3

MonoIndoor++:Towards Better Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments

no code implementations18 Jul 2022 Runze Li, Pan Ji, Yi Xu, Bir Bhanu

As compared to outdoor environments, estimating depth of monocular videos for indoor environments, using self-supervised methods, results in two additional challenges: (i) the depth range of indoor video sequences varies a lot across different frames, making it difficult for the depth network to induce consistent depth cues for training; (ii) the indoor sequences recorded with handheld devices often contain much more rotational motions, which cause difficulties for the pose network to predict accurate relative camera poses.

Depth Prediction Monocular Depth Estimation +1

Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation

1 code implementation22 Apr 2022 Qun Li, Ziyi Zhang, Fu Xiao, Feng Zhang, Bir Bhanu

A high-resolution network exhibits remarkable capability in extracting multi-scale features for human pose estimation, but fails to capture long-range interactions between joints and has high computational complexity.

Pose Estimation Vocal Bursts Intensity Prediction

Ada-VSR: Adaptive Video Super-Resolution with Meta-Learning

no code implementations5 Aug 2021 Akash Gupta, Padmaja Jonnalagedda, Bir Bhanu, Amit K. Roy-Chowdhury

Specifically, meta-learning is employed to obtain adaptive parameters, using a large-scale external dataset, that can adapt quickly to the novel condition (degradation model) of the given test video during the internal learning task, thereby exploiting external and internal information of a video for super-resolution.

Meta-Learning Transfer Learning +1

Learning Local Recurrent Models for Human Mesh Recovery

no code implementations27 Jul 2021 Runze Li, Srikrishna Karanam, Ren Li, Terrence Chen, Bir Bhanu, Ziyan Wu

We conduct a variety of experiments on standard video mesh recovery benchmark datasets such as Human3. 6M, MPI-INF-3DHP, and 3DPW, demonstrating the efficacy of our design of modeling local dynamics as well as establishing state-of-the-art results based on standard evaluation metrics.

3D Human Pose Estimation 3D Human Shape Estimation +1

MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments

no code implementations ICCV 2021 Pan Ji, Runze Li, Bir Bhanu, Yi Xu

The effectiveness of each module is shown through a carefully conducted ablation study and the demonstration of the state-of-the-art performance on three indoor datasets, \ie, EuRoC, NYUv2, and 7-scenes.

Monocular Depth Estimation Pose Estimation

Fully Convolutional Scene Graph Generation

1 code implementation CVPR 2021 Hengyue Liu, Ning Yan, Masood S. Mortazavi, Bir Bhanu

This paper presents a fully convolutional scene graph generation (FCSGG) model that detects objects and relations simultaneously.

Graph Generation Scene Graph Generation

Dynamically Throttleable Neural Networks (TNN)

no code implementations1 Nov 2020 Hengyue Liu, Samyak Parajuli, Jesse Hostetler, Sek Chai, Bir Bhanu

Conditional computation for Deep Neural Networks (DNNs) reduce overall computational load and improve model accuracy by running a subset of the network.

SAGE: Sequential Attribute Generator for Analyzing Glioblastomas using Limited Dataset

no code implementations14 May 2020 Padmaja Jonnalagedda, Brent Weinberg, Jason Allen, Taejin L. Min, Shiv Bhanu, Bir Bhanu

While deep learning approaches have shown remarkable performance in many imaging tasks, most of these methods rely on availability of large quantities of data.

Attribute

Towards Visually Explaining Variational Autoencoders

2 code implementations CVPR 2020 Wenqian Liu, Runze Li, Meng Zheng, Srikrishna Karanam, Ziyan Wu, Bir Bhanu, Richard J. Radke, Octavia Camps

We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions.

Disentanglement

An Online Learned Elementary Grouping Model for Multi-target Tracking

no code implementations CVPR 2014 Xiaojing Chen, Zhen Qin, Le An, Bir Bhanu

We introduce an online approach to learn possible elementary groups (groups that contain only two targets) for inferring high level context that can be used to improve multi-target tracking in a data-association based framework.

Cannot find the paper you are looking for? You can Submit a new open access paper.