Search Results for author: Aljosa Smolic

Found 32 papers, 13 papers with code

StylePrompter: All Styles Need Is Attention

1 code implementation • 30 Jul 2023 • Chenyi Zhuang, Pan Gao, Aljosa Smolic

We then prove that StylePrompter lies in a more disentangled $\mathcal{W^+}$ and show the controllability of SMART.

Paper
Code

Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token

1 code implementation • 16 May 2023 • Jinsong Shi, Pan Gao, Aljosa Smolic

Specifically, we firstly generate the predicted error map by pre-training one model consisting of a Transformer encoder and decoder, in which the objective difference between the distorted and the reference images is used as supervision.

Blind Image Quality Assessment No-Reference Image Quality Assessment +1

Paper
Code

View Sub-sampling and Reconstruction for Efficient Light Field Compression

no code implementations • 12 Aug 2022 • Yang Chen, Martin Alain, Aljosa Smolic

In this work, we study different sub-sampling and reconstruction strategies for light field compression.

Paper
Add Code

Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation

1 code implementation • 7 Aug 2022 • Sebastian Lutz, Richard Blythman, Koustav Ghosal, Matthew Moynihan, Ciaran Simms, Aljosa Smolic

Monocular 3D human pose estimation technologies have the potential to greatly increase the availability of human movement data.

Ranked #50 on 3D Human Pose Estimation on Human3.6M

Monocular 3D Human Pose Estimation Multi-Task Learning

Paper
Code

KinePose: A temporally optimized inverse kinematics technique for 6DOF human pose estimation with biomechanical constraints

no code implementations • 26 Jul 2022 • Kevin Gildea, Clara Mercadal-Baudart, Richard Blythman, Aljosa Smolic, Ciaran Simms

With frame-by-frame IK we obtain low errors in the case of bent elbows and knees, however, motion sequences with phases of extended/straight limbs results in ambiguity in twist angle.

3D Human Pose Estimation

Paper
Add Code

Image Aesthetics Assessment Using Graph Attention Network

1 code implementation • 26 Jun 2022 • Koustav Ghosal, Aljosa Smolic

Second, we propose a graph neural network architecture that takes this feature-graph and captures the semantic relationship between the different regions of the input image using visual attention.

Graph Attention

Paper
Code

TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding

1 code implementation • 17 Oct 2021 • Zhengwei Wang, Qi She, Aljosa Smolic

Video compression (e. g., H. 264, MPEG-4) reduces superfluous information by representing the raw video stream using the concept of Group of Pictures (GOP).

Action Recognition Temporal Action Localization +1

Paper
Code

Spectral analysis of re-parameterized light fields

no code implementations • 12 Oct 2021 • Martin Alain, Aljosa Smolic

In this paper, we study the spectral properties of re-parameterized light field.

Paper
Add Code

Deep Color Mismatch Correction In Stereoscopic 3D Images

1 code implementation • IEEE International Conference on Image Processing (ICIP) 2021 • Simone Croci, Cagri Ozcinar, Emin Zerman, Roman Dudek, Sebastian Knorr, Aljosa Smolic

Color mismatch in stereoscopic 3D (S3D) images can create visual discomfort and affect the performance of S3D image processing algorithms, e. g., for depth estimation.

Color Mismatch Correction Depth Estimation

Paper
Code

Frequency Domain Loss Function for Deep Exposure Correction of Dark Images

no code implementations • 22 Apr 2021 • Ojasvi Yadav, Koustav Ghosal, Sebastian Lutz, Aljosa Smolic

We address the problem of exposure correction of dark, blurry and noisy images captured in low-light conditions in the wild.

Image Denoising Translation

Paper
Add Code

Foreground color prediction through inverse compositing

no code implementations • 24 Mar 2021 • Sebastian Lutz, Aljosa Smolic

In natural image matting, the goal is to estimate the opacity of the foreground object in the image.

Image Matting

Paper
Add Code

ACTION-Net: Multipath Excitation for Action Recognition

1 code implementation • CVPR 2021 • Zhengwei Wang, Qi She, Aljosa Smolic

To this end, we propose a spAtio-temporal, Channel and moTion excitatION (ACTION) module consisting of three paths: Spatio-Temporal Excitation (STE) path, Channel Excitation (CE) path, and Motion Excitation (ME) path.

Action Recognition Temporal Action Localization

196

Paper
Code

Self-supervised Light Field View Synthesis Using Cycle Consistency

no code implementations • 12 Aug 2020 • Yang Chen, Martin Alain, Aljosa Smolic

In order to enhance the angular resolution of light fields, view synthesis methods can be utilized to generate dense intermediate views from sparse light field input.

Paper
Add Code

A Study of Efficient Light Field Subsampling and Reconstruction Strategies

no code implementations • 11 Aug 2020 • Yang Chen, Martin Alain, Aljosa Smolic

Limited angular resolution is one of the main obstacles for practical applications of light fields.

Super-Resolution

Paper
Add Code

Fast and Accurate Optical Flow based Depth Map Estimation from Light Fields

no code implementations • 11 Aug 2020 • Yang Chen, Martin Alain, Aljosa Smolic

Depth map estimation is a crucial task in computer vision, and new approaches have recently emerged taking advantage of light fields, as this new imaging modality captures much more information about the angular direction of light rays compared to common approaches based on stereoscopic images or multi-view.

Depth Estimation Optical Flow Estimation

Paper
Add Code

A Study on Visual Perception of Light Field Content

no code implementations • 7 Aug 2020 • Ailbhe Gill, Emin Zerman, Cagri Ozcinar, Aljosa Smolic

We conducted perception experiments displaying them to users in various ways and collected corresponding visual attention data.

Paper
Add Code

Sub-Pixel Back-Projection Network For Lightweight Single Image Super-Resolution

1 code implementation • 3 Aug 2020 • Supratik Banerjee, Cagri Ozcinar, Aakanksha Rana, Aljosa Smolic, Michael Manzke

To tackle this problem, in this paper, we study reducing the number of parameters and computational cost of CNN-based SISR methods while maintaining the accuracy of super-resolution reconstruction performance.

Ranked #2 on Image Super-Resolution on BSDS100 - 2x upscaling

Image Super-Resolution

Paper
Code

CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition

1 code implementation • 20 Apr 2020 • Zhengwei Wang, Qi She, Tejo Chalasani, Aljosa Smolic

Egocentric gestures are the most natural form of communication for humans to interact with wearable devices such as VR/AR helmets and glasses.

Gesture Recognition Video Recognition

Paper
Code

Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition

no code implementations • 18 Sep 2019 • Tejo Chalasani, Aljosa Smolic

Ego hand gestures can be used as an interface in AR and VR environments.

Activity Recognition Caption Generation +5

Paper
Add Code

DublinCity: Annotated LiDAR Point Cloud and its Applications

no code implementations • 6 Sep 2019 • S. M. Iman Zolanvari, Susana Ruano, Aakanksha Rana, Alan Cummins, Rogerio Eduardo da Silva, Morteza Rahbar, Aljosa Smolic

The paper presents the first-ever labelled dataset for a highly dense Aerial Laser Scanning (ALS) point cloud at city-scale.

3D Reconstruction Scene Understanding

Paper
Add Code

STaDA: Style Transfer as Data Augmentation

no code implementations • 3 Sep 2019 • Xu Zheng, Tejo Chalasani, Koustav Ghosal, Sebastian Lutz, Aljosa Smolic

The success of training deep Convolutional Neural Networks (CNNs) heavily depends on a significant amount of labelled data.

Classification Data Augmentation +3

Paper
Add Code

A Geometry-Sensitive Approach for Photographic Style Classification

no code implementations • 3 Sep 2019 • Koustav Ghosal, Mukta Prasad, Aljosa Smolic

We further introduce a two-column CNN architecture that performs better than the state-of-the-art (SoA) in photographic style classification.

Classification Data Augmentation +3

Paper
Add Code

Aesthetic Image Captioning From Weakly-Labelled Photographs

1 code implementation • 29 Aug 2019 • Koustav Ghosal, Aakanksha Rana, Aljosa Smolic

Aesthetic image captioning (AIC) refers to the multi-modal task of generating critical textual feedbacks for photographs.

Aesthetic Image Captioning Benchmarking

Paper
Code

ColorNet -- Estimating Colorfulness in Natural Images

1 code implementation • 22 Aug 2019 • Emin Zerman, Aakanksha Rana, Aljosa Smolic

Measuring the colorfulness of a natural or virtual scene is critical for many applications in image processing field ranging from capturing to display.

Image Classification

Paper
Code

Super-resolution of Omnidirectional Images Using Adversarial Learning

1 code implementation • 12 Aug 2019 • Cagri Ozcinar, Aakanksha Rana, Aljosa Smolic

An omnidirectional image (ODI) enables viewers to look in every direction from a fixed point through a head-mounted display providing an immersive experience compared to that of a standard image.

Generative Adversarial Network Super-Resolution

Paper
Code

Deep Tone Mapping Operator for High Dynamic Range Images

no code implementations • 12 Aug 2019 • Aakanksha Rana, Praveer Singh, Giuseppe Valenzise, Frederic Dufaux, Nikos Komodakis, Aljosa Smolic

In this paper, we address this problem by proposing a fast, parameter-free and scene-adaptable deep tone mapping operator (DeepTMO) that yields a high-resolution and high-subjective quality tone mapped output.

Generative Adversarial Network Tone Mapping +1

Paper
Add Code

A Fourier Disparity Layer representation for Light Fields

no code implementations • 21 Jan 2019 • Mikael Le Pendu, Christine Guillemot, Aljosa Smolic

In this paper, we present a new Light Field representation for efficient Light Field processing and rendering called Fourier Disparity Layers (FDL).

Denoising

Paper
Add Code

Dynamic Environment Mapping for Augmented Reality Applications on Mobile Devices

no code implementations • 21 Sep 2018 • Rafael Monroy, Matis Hudon, Aljosa Smolic

In this work, we present a novel pipeline that allows for coupled environment acquisition and virtual object rendering on a mobile device equipped with a depth sensor.

Paper
Add Code

Egocentric Gesture Recognition for Head-Mounted AR devices

no code implementations • 16 Aug 2018 • Tejo Chalasani, Jan Ondrej, Aljosa Smolic

Natural interaction with virtual objects in AR/VR environments makes for a smooth user experience.

Data Augmentation Gesture Recognition

Paper
Add Code

A Pipeline for Lenslet Light Field Quality Enhancement

no code implementations • 16 Aug 2018 • Pierre Matysiak, Mairéad Grogan, Mikaël Le Pendu, Martin Alain, Aljosa Smolic

In recent years, light fields have become a major research topic and their applications span across the entire spectrum of classical image processing.

Denoising

Paper
Add Code

AlphaGAN: Generative adversarial networks for natural image matting

1 code implementation • 26 Jul 2018 • Sebastian Lutz, Konstantinos Amplianitis, Aljosa Smolic

We present the first generative adversarial network (GAN) for natural image matting.

Generative Adversarial Network Image Matting

152

Paper
Code

SalNet360: Saliency Maps for omni-directional images with CNN

no code implementations • 19 Sep 2017 • Rafael Monroy, Sebastian Lutz, Tejo Chalasani, Aljosa Smolic

The prediction of Visual Attention data from any kind of media is of valuable use to content creators and used to efficiently drive encoding algorithms.

Saliency Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.