Search Results for author: Jaskirat Singh

Found 12 papers, 3 papers with code

SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control

no code implementations • 8 Dec 2023 • Jaskirat Singh, Jianming Zhang, Qing Liu, Cameron Smith, Zhe Lin, Liang Zheng

To overcome these limitations, we introduce SmartMask, which allows any novice user to create detailed masks for precise object insertion.

Image Inpainting Layout Design +2

Paper
Add Code

IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models

1 code implementation • 12 Nov 2023 • Zhaoyuan Yang, Zhengyang Yu, Zhiwei Xu, Jaskirat Singh, Jing Zhang, Dylan Campbell, Peter Tu, Richard Hartley

We present a diffusion-based image morphing approach with perceptually-uniform sampling (IMPUS) that produces smooth, direct and realistic interpolations given an image pair.

Image Generation Image Morphing

Paper
Code

Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback

no code implementations • NeurIPS 2023 • Jaskirat Singh, Liang Zheng

Furthermore, we also find that the assertion level alignment scores provide a useful feedback which can then be used in a simple iterative procedure to gradually increase the expression of different assertions in the final image outputs.

Image Generation Visual Question Answering (VQA)

Paper
Add Code

Probabilistic and Semantic Descriptions of Image Manifolds and Their Applications

no code implementations • 6 Jul 2023 • Peter Tu, Zhaoyuan Yang, Richard Hartley, Zhiwei Xu, Jing Zhang, Yiwei Fu, Dylan Campbell, Jaskirat Singh, Tianyu Wang

This paper begins with a description of methods for estimating image probability density functions that reflects the observation that such data is usually constrained to lie in restricted regions of the high-dimensional image space-not every pattern of pixels is an image.

Paper
Add Code

High-Fidelity Guided Image Synthesis with Latent Diffusion Models

no code implementations • CVPR 2023 • Jaskirat Singh, Stephen Gould, Liang Zheng

The user scribbles control the color composition while the text prompt provides control over the overall image semantics.

Image Generation Vocal Bursts Intensity Prediction

Paper
Add Code

UAV-based Visual Remote Sensing for Automated Building Inspection

no code implementations • 27 Sep 2022 • Kushagra Srivastava, Dhruv Patel, Aditya Kumar Jha, Mohhit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Pradeep Kumar Ramancharla, Harikumar Kandath, K. Madhava Krishna

Unmanned Aerial Vehicle (UAV) based remote sensing system incorporated with computer vision has demonstrated potential for assisting building construction and in disaster management like damage assessment during earthquakes.

Management

Paper
Add Code

Paint2Pix: Interactive Painting based Progressive Image Synthesis and Editing

1 code implementation • 17 Aug 2022 • Jaskirat Singh, Liang Zheng, Cameron Smith, Jose Echevarria

In particular, we propose a novel approach paint2pix, which learns to predict (and adapt) "what a user wants to draw" from rudimentary brushstroke inputs, by learning a mapping from the manifold of incomplete human paintings to their realistic renderings.

Image Generation

120

Paper
Code

Intelli-Paint: Towards Developing Human-like Painting Agents

no code implementations • 16 Dec 2021 • Jaskirat Singh, Cameron Smith, Jose Echevarria, Liang Zheng

However, current research in this direction is often reliant on a progressive grid-based division strategy wherein the agent divides the overall image into successively finer grids, and then proceeds to paint each of them in parallel.

Paper
Add Code

Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning

no code implementations • 14 Feb 2021 • Jaskirat Singh, Liang Zheng

However, we argue that the sample variance for a multi-scene environment is best minimized by treating each scene as a distinct MDP, and then learning a joint value function V(s, M) dependent on both state s and MDP M. We further demonstrate that the true joint value function for a multi-scene environment, follows a multi-modal distribution which is not captured by traditional CNN / LSTM based critic networks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings

1 code implementation • CVPR 2021 • Jaskirat Singh, Liang Zheng

2) We also introduce invariance to the position and scale of the foreground object through a neural alignment model, which combines object localization and spatial transformer networks in an end to end manner, to zoom into a particular semantic instance.

Model-based Reinforcement Learning Object +3

Paper
Code

Enhanced Scene Specificity with Sparse Dynamic Value Estimation

no code implementations • 25 Nov 2020 • Jaskirat Singh, Liang Zheng

Recently, Singh et al. [1] tried to address this by proposing a dynamic value estimation approach that models the true joint value function distribution as a Gaussian mixture model (GMM).

Specificity

Paper
Add Code

Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning

no code implementations • 25 May 2020 • Jaskirat Singh, Liang Zheng

Training deep reinforcement learning agents on environments with multiple levels / scenes / conditions from the same task, has become essential for many applications aiming to achieve generalization and domain transfer from simulation to the real world.

Clustering reinforcement-learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.