Search Results for author: Rakesh Ranjan

Found 28 papers, 8 papers with code

UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model

1 code implementation • 4 May 2024 • Shuai Yuan, Lei Luo, Zhuo Hui, Can Pu, Xiaoyu Xiang, Rakesh Ranjan, Denis Demandolx

Traditional unsupervised optical flow methods are vulnerable to occlusions and motion boundaries due to lack of object-level information.

Object Optical Flow Estimation

Paper
Code

CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians

no code implementations • 28 Mar 2024 • Avinash Paliwal, Wei Ye, Jinhui Xiong, Dmytro Kotovenko, Rakesh Ranjan, Vikas Chandra, Nima Khademi Kalantari

The field of 3D reconstruction from images has rapidly evolved in the past few years, first with the introduction of Neural Radiance Field (NeRF) and more recently with 3D Gaussian Splatting (3DGS).

3D Reconstruction Decoder +1

Paper
Add Code

Towards Image Ambient Lighting Normalization

no code implementations • 27 Mar 2024 • Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu, Rakesh Ranjan, Radu Timofte

However, existing works often simplify this task within the context of shadow removal, limiting the light sources to one and oversimplifying the scene, thus excluding complex self-shadows and restricting surface classes to smooth ones.

Benchmarking Image Restoration +1

Paper
Add Code

Garment3DGen: 3D Garment Stylization and Texture Generation

no code implementations • 27 Mar 2024 • Nikolaos Sarafianos, Tuur Stuyck, Xiaoyu Xiang, Yilei Li, Jovan Popovic, Rakesh Ranjan

We present a plethora of quantitative and qualitative comparisons on various assets both real and generated and provide use-cases of how one can generate simulation-ready 3D garments.

Image to 3D Texture Synthesis

Paper
Add Code

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

no code implementations • 20 Feb 2024 • Shitao Tang, Jiacheng Chen, Dilin Wang, Chengzhou Tang, Fuyang Zhang, Yuchen Fan, Vikas Chandra, Yasutaka Furukawa, Rakesh Ranjan

MVDiffusion++ achieves superior flexibility and scalability with two surprisingly simple ideas: 1) A ``pose-free architecture'' where standard self-attention among 2D latent features learns 3D consistency across an arbitrary number of conditional and generation views without explicitly using camera pose information; and 2) A ``view dropout strategy'' that discards a substantial number of output views during training, which reduces the training-time memory footprint and enables dense and high-resolution view synthesis at test time.

3D Object Reconstruction 3D Reconstruction +2

Paper
Add Code

Key-Graph Transformer for Image Restoration

no code implementations • 4 Feb 2024 • Bin Ren, Yawei Li, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc van Gool, Nicu Sebe

While it is crucial to capture global information for effective image restoration (IR), integrating such cues into transformer-based methods becomes computationally expensive, especially with high input resolution.

Graph Attention Image Restoration

Paper
Add Code

Geometry Transfer for Stylizing Radiance Fields

no code implementations • 1 Feb 2024 • Hyunyoung Jung, Seonghyeon Nam, Nikolaos Sarafianos, Sungjoo Yoo, Alexander Sorkine-Hornung, Rakesh Ranjan

Shape and geometric patterns are essential in defining stylistic identity.

Style Transfer

Paper
Add Code

SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity

no code implementations • 31 Dec 2023 • Peihao Wang, Zhiwen Fan, Dejia Xu, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra

In this paper, we reveal that the gradient estimation in score distillation is inherent to high variance.

Text to 3D

Paper
Add Code

Taming Mode Collapse in Score Distillation for Text-to-3D Generation

no code implementations • 31 Dec 2023 • Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra

In this paper, we reveal that the existing score distillation-based text-to-3D generation frameworks degenerate to maximal likelihood seeking on each view independently and thus suffer from the mode collapse problem, manifesting as the Janus artifact in practice.

3D Generation Prompt Engineering +1

Paper
Add Code

PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar

no code implementations • 21 Dec 2023 • Tzofi Klinghoffer, Xiaoyu Xiang, Siddharth Somasundaram, Yuchen Fan, Christian Richardt, Ramesh Raskar, Rakesh Ranjan

3D reconstruction from a single-view is challenging because of the ambiguity from monocular cues and lack of information about occluded regions.

3D Reconstruction Single-View 3D Reconstruction

Paper
Add Code

Training Neural Networks on RAW and HDR Images for Restoration Tasks

1 code implementation • 6 Dec 2023 • Lei Luo, ALEXANDRE CHAPIRO, Xiaoyu Xiang, Yuchen Fan, Rakesh Ranjan, Rafal Mantiuk

Our results indicate that neural networks train significantly better on HDR and RAW images represented in display-encoded color spaces, which offer better perceptual uniformity than linear spaces.

Deblurring Denoising +2

Paper
Code

MoVideo: Motion-Aware Video Generation with Diffusion Models

no code implementations • 19 Nov 2023 • Jingyun Liang, Yuchen Fan, Kai Zhang, Radu Timofte, Luc van Gool, Rakesh Ranjan

While recent years have witnessed great progress on using diffusion models for video generation, most of them are simple extensions of image generation frameworks, which fail to explicitly consider one of the key differences between videos and images, i. e., motion.

Image Generation Image to Video Generation +1

Paper
Add Code

NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation

no code implementations • 29 Aug 2023 • Tim Meinhardt, Matt Feiszli, Yuchen Fan, Laura Leal-Taixe, Rakesh Ranjan

Until recently, the Video Instance Segmentation (VIS) community operated under the common belief that offline methods are generally superior to a frame by frame online processing.

Ranked #5 on Video Instance Segmentation on YouTube-VIS validation (using extra training data)

Instance Segmentation Segmentation +2

Paper
Add Code

MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition

no code implementations • 12 May 2023 • Xinyu Gong, Sreyas Mohan, Naina Dhingra, Jean-Charles Bazin, Yilei Li, Zhangyang Wang, Rakesh Ranjan

In this paper, we study a novel problem in egocentric action recognition, which we term as "Multimodal Generalization" (MMG).

Action Recognition Egocentric Activity Recognition +1

Paper
Add Code

Learning Neural Duplex Radiance Fields for Real-Time View Synthesis

no code implementations • CVPR 2023 • Ziyu Wan, Christian Richardt, Aljaž Božič, Chao Li, Vijay Rengarajan, Seonghyeon Nam, Xiaoyu Xiang, Tuotuo Li, Bo Zhu, Rakesh Ranjan, Jing Liao

Neural radiance fields (NeRFs) enable novel view synthesis with unprecedented visual quality.

Novel View Synthesis Surface Reconstruction

Paper
Add Code

AnyFlow: Arbitrary Scale Optical Flow with Implicit Neural Representation

no code implementations • CVPR 2023 • Hyunyoung Jung, Zhuo Hui, Lei Luo, Haitao Yang, Feng Liu, Sungjoo Yoo, Rakesh Ranjan, Denis Demandolx

To apply optical flow in practice, it is often necessary to resize the input to smaller dimensions in order to reduce computational costs.

Optical Flow Estimation

Paper
Add Code

Efficient and Explicit Modelling of Image Hierarchies for Image Restoration

1 code implementation • CVPR 2023 • Yawei Li, Yuchen Fan, Xiaoyu Xiang, Denis Demandolx, Rakesh Ranjan, Radu Timofte, Luc van Gool

The aim of this paper is to propose a mechanism to efficiently and explicitly model image hierarchies in the global, regional, and local range for image restoration.

Ranked #1 on Image Defocus Deblurring on DPD (Dual-view)

Image Deblurring Image Defocus Deblurring +1

336

Paper
Code

MMG-Ego4D: Multimodal Generalization in Egocentric Action Recognition

no code implementations • CVPR 2023 • Xinyu Gong, Sreyas Mohan, Naina Dhingra, Jean-Charles Bazin, Yilei Li, Zhangyang Wang, Rakesh Ranjan

In this paper, we study a novel problem in egocentric action recognition, which we term as "Multimodal Generalization" (MMG).

Action Recognition Zero-shot Generalization

Paper
Add Code

FSID: Fully Synthetic Image Denoising via Procedural Scene Generation

1 code implementation • 7 Dec 2022 • Gyeongmin Choe, Beibei Du, Seonghyeon Nam, Xiaoyu Xiang, Bo Zhu, Rakesh Ranjan

To address this, we have developed a procedural synthetic data generation pipeline and dataset tailored to low-level vision tasks.

Image Denoising Scene Generation +1

2,022

Paper
Code

Fast Point Cloud Generation with Straight Flows

1 code implementation • CVPR 2023 • Lemeng Wu, Dilin Wang, Chengyue Gong, Xingchao Liu, Yunyang Xiong, Rakesh Ranjan, Raghuraman Krishnamoorthi, Vikas Chandra, Qiang Liu

We perform evaluations on multiple 3D tasks and find that our PSF performs comparably to the standard diffusion model, outperforming other efficient 3D point cloud generation methods.

Point Cloud Completion

Paper
Code

Consistent Direct Time-of-Flight Video Depth Super-Resolution

1 code implementation • CVPR 2023 • Zhanghao Sun, Wei Ye, Jinhui Xiong, Gyeongmin Choe, Jialiang Wang, Shuochen Su, Rakesh Ranjan

We believe the methods and dataset are beneficial to a broad community as dToF depth sensing is becoming mainstream on mobile devices.

Super-Resolution

Paper
Code

Recurrent Video Restoration Transformer with Guided Deformable Attention

3 code implementations • 5 Jun 2022 • Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, JieZhang Cao, Kai Zhang, Radu Timofte, Luc van Gool

Specifically, RVRT divides the video into multiple clips and uses the previously inferred clip feature to estimate the subsequent clip feature.

Ranked #1 on Video Super-Resolution on Vid4 - 4x upscaling - BD degradation

Analog Video Restoration Deblurring +3

333

Paper
Code

HIME: Efficient Headshot Image Super-Resolution with Multiple Exemplars

no code implementations • 28 Mar 2022 • Xiaoyu Xiang, Jon Morton, Fitsum A Reda, Lucas Young, Federico Perazzi, Rakesh Ranjan, Amit Kumar, Andrea Colaco, Jan Allebach

Compared with previous methods, our network can effectively handle the misalignment between the input and the reference without requiring facial priors and learn the aggregated reference set representation in an end-to-end manner.

Image Super-Resolution

Paper
Add Code

Learning Spatio-Temporal Downsampling for Effective Video Upscaling

no code implementations • 15 Mar 2022 • Xiaoyu Xiang, Yapeng Tian, Vijay Rengarajan, Lucas Young, Bo Zhu, Rakesh Ranjan

Consequently, the inverse task of upscaling a low-resolution, low frame-rate video in space and time becomes a challenging ill-posed problem due to information loss and aliasing artifacts.

Quantization

Paper
Add Code

VRT: A Video Restoration Transformer

1 code implementation • 28 Jan 2022 • Jingyun Liang, JieZhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc van Gool

Besides, parallel warping is used to further fuse information from neighboring frames by parallel feature warping.

Ranked #1 on Deblurring on BASED

Deblurring Denoising +7

1,259

Paper
Code

EyePAD++: A Distillation-based approach for joint Eye Authentication and Presentation Attack Detection using Periocular Images

no code implementations • CVPR 2022 • Prithviraj Dhar, Amit Kumar, Kirsten Kaplan, Khushi Gupta, Rakesh Ranjan, Rama Chellappa

To overcome this, we propose Eye Authentication with PAD (EyePAD), a distillation-based method that trains a single network for EA and PAD while reducing the effect of forgetting.

Paper
Add Code

Feature-Align Network with Knowledge Distillation for Efficient Denoising

no code implementations • 2 Mar 2021 • Lucas D. Young, Fitsum A. Reda, Rakesh Ranjan, Jon Morton, Jun Hu, Yazhu Ling, Xiaoyu Xiang, David Liu, Vikas Chandra

(2) A novel Feature Matching Loss that allows knowledge distillation from large denoising networks in the form of a perceptual content loss.

Decoder Efficient Neural Network +3

Paper
Add Code

EVRNet: Efficient Video Restoration on Edge Devices

no code implementations • 3 Dec 2020 • Sachin Mehta, Amit Kumar, Fitsum Reda, Varun Nasery, Vikram Mulukutla, Rakesh Ranjan, Vikas Chandra

Video transmission applications (e. g., conferencing) are gaining momentum, especially in times of global health pandemic.

Denoising SSIM +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.