Search Results for author: Bir Bhanu

Found 14 papers, 3 papers with code

RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation

no code implementations • 6 Sep 2023 • Hengyue Liu, Bir Bhanu

As each representation's cardinality has different trade-offs between performance and computation overhead, extracting highly representative features efficiently and dynamically is both challenging and crucial for SGG.

Graph Generation Scene Graph Generation

Paper
Add Code

RECLIP: Resource-efficient CLIP by Training with Small Images

no code implementations • 12 Apr 2023 • Runze Li, Dahun Kim, Bir Bhanu, Weicheng Kuo

We present RECLIP (Resource-efficient CLIP), a simple method that minimizes computational resource footprint for CLIP (Contrastive Language Image Pretraining).

Contrastive Learning Retrieval +3

Paper
Add Code

MonoIndoor++:Towards Better Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments

no code implementations • 18 Jul 2022 • Runze Li, Pan Ji, Yi Xu, Bir Bhanu

As compared to outdoor environments, estimating depth of monocular videos for indoor environments, using self-supervised methods, results in two additional challenges: (i) the depth range of indoor video sequences varies a lot across different frames, making it difficult for the depth network to induce consistent depth cues for training; (ii) the indoor sequences recorded with handheld devices often contain much more rotational motions, which cause difficulties for the pose network to predict accurate relative camera poses.

Depth Prediction Monocular Depth Estimation +1

Paper
Add Code

Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation

1 code implementation • 22 Apr 2022 • Qun Li, Ziyi Zhang, Fu Xiao, Feng Zhang, Bir Bhanu

A high-resolution network exhibits remarkable capability in extracting multi-scale features for human pose estimation, but fails to capture long-range interactions between joints and has high computational complexity.

Ranked #30 on Pose Estimation on COCO test-dev

Pose Estimation Vocal Bursts Intensity Prediction

Paper
Code

Transferring Knowledge with Attention Distillation for Multi-Domain Image-to-Image Translation

no code implementations • 17 Aug 2021 • Runze Li, Tomaso Fontanini, Luca Donati, Andrea Prati, Bir Bhanu

Gradient-based attention modeling has been used widely as a way to visualize and understand convolutional neural networks.

Image-to-Image Translation Translation

Paper
Add Code

Ada-VSR: Adaptive Video Super-Resolution with Meta-Learning

no code implementations • 5 Aug 2021 • Akash Gupta, Padmaja Jonnalagedda, Bir Bhanu, Amit K. Roy-Chowdhury

Specifically, meta-learning is employed to obtain adaptive parameters, using a large-scale external dataset, that can adapt quickly to the novel condition (degradation model) of the given test video during the internal learning task, thereby exploiting external and internal information of a video for super-resolution.

Meta-Learning Transfer Learning +1

Paper
Add Code

Learning Local Recurrent Models for Human Mesh Recovery

no code implementations • 27 Jul 2021 • Runze Li, Srikrishna Karanam, Ren Li, Terrence Chen, Bir Bhanu, Ziyan Wu

We conduct a variety of experiments on standard video mesh recovery benchmark datasets such as Human3. 6M, MPI-INF-3DHP, and 3DPW, demonstrating the efficacy of our design of modeling local dynamics as well as establishing state-of-the-art results based on standard evaluation metrics.

Ranked #44 on 3D Human Pose Estimation on 3DPW

3D Human Pose Estimation 3D Human Shape Estimation +1

Paper
Add Code

MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments

no code implementations • ICCV 2021 • Pan Ji, Runze Li, Bir Bhanu, Yi Xu

The effectiveness of each module is shown through a carefully conducted ablation study and the demonstration of the state-of-the-art performance on three indoor datasets, \ie, EuRoC, NYUv2, and 7-scenes.

Ranked #3 on Monocular Depth Estimation on NYU-Depth V2 self-supervised

Monocular Depth Estimation Pose Estimation

Paper
Add Code

Fully Convolutional Scene Graph Generation

1 code implementation • CVPR 2021 • Hengyue Liu, Ning Yan, Masood S. Mortazavi, Bir Bhanu

This paper presents a fully convolutional scene graph generation (FCSGG) model that detects objects and relations simultaneously.

Graph Generation Scene Graph Generation

Paper
Code

Dynamically Throttleable Neural Networks (TNN)

no code implementations • 1 Nov 2020 • Hengyue Liu, Samyak Parajuli, Jesse Hostetler, Sek Chai, Bir Bhanu

Conditional computation for Deep Neural Networks (DNNs) reduce overall computational load and improve model accuracy by running a subset of the network.

Paper
Add Code

SAGE: Sequential Attribute Generator for Analyzing Glioblastomas using Limited Dataset

no code implementations • 14 May 2020 • Padmaja Jonnalagedda, Brent Weinberg, Jason Allen, Taejin L. Min, Shiv Bhanu, Bir Bhanu

While deep learning approaches have shown remarkable performance in many imaging tasks, most of these methods rely on availability of large quantities of data.

Attribute

Paper
Add Code

Feature Disentanglement to Aid Imaging Biomarker Characterization for Genetic Mutations

no code implementations • MIDL 2019 • Padmaja Jonnalagedda, Brent Weinberg, Jason Allen, Bir Bhanu

Various mutations have been shown to correlate with prognosis of High-Grade Glioma (Glioblastoma).

Disentanglement Generative Adversarial Network

Paper
Add Code

Towards Visually Explaining Variational Autoencoders

2 code implementations • CVPR 2020 • Wenqian Liu, Runze Li, Meng Zheng, Srikrishna Karanam, Ziyan Wu, Bir Bhanu, Richard J. Radke, Octavia Camps

We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions.

Disentanglement

Paper
Code

An Online Learned Elementary Grouping Model for Multi-target Tracking

no code implementations • CVPR 2014 • Xiaojing Chen, Zhen Qin, Le An, Bir Bhanu

We introduce an online approach to learn possible elementary groups (groups that contain only two targets) for inferring high level context that can be used to improve multi-target tracking in a data-association based framework.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.