Search Results for author: Edgar Simo-Serra

Found 30 papers, 9 papers with code

Return-Aligned Decision Transformer

no code implementations • 6 Feb 2024 • Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra

Traditional approaches in offline reinforcement learning aim to learn the optimal policy that maximizes the cumulative reward, also known as return.

Paper
Add Code

Visual Grounding of Whole Radiology Reports for 3D CT Images

no code implementations • 8 Dec 2023 • Akimichi Ichinose, Taro Hatsutani, Keigo Nakamura, Yoshiro Kitamura, Satoshi Iizuka, Edgar Simo-Serra, Shoji Kido, Noriyuki Tomiyama

Our framework combines two components of 1) anatomical segmentation of images, and 2) report structuring.

Segmentation Visual Grounding

Paper
Add Code

Image Synthesis-based Late Stage Cancer Augmentation and Semi-Supervised Segmentation for MRI Rectal Cancer Staging

no code implementations • 8 Dec 2023 • Saeko Sasuga, Akira Kudo, Yoshiro Kitamura, Satoshi Iizuka, Edgar Simo-Serra, Atsushi Hamabe, Masayuki Ishii, Ichiro Takemasa

To tackle this, we propose two kinds of approaches of image synthesis-based late stage cancer augmentation and semi-supervised learning which is designed for T-stage prediction.

Data Augmentation Image Generation +1

Paper
Add Code

Diffusion-based Holistic Texture Rectification and Synthesis

no code implementations • 26 Sep 2023 • Guoqing Hao, Satoshi Iizuka, Kensho Hara, Edgar Simo-Serra, Hirokatsu Kataoka, Kazuhiro Fukui

We present a novel framework for rectifying occlusions and distortions in degraded texture samples from natural images.

Texture Synthesis

Paper
Add Code

Controllable Multi-domain Semantic Artwork Synthesis

no code implementations • 19 Aug 2023 • Yuantian Huang, Satoshi Iizuka, Edgar Simo-Serra, Kazuhiro Fukui

To address this problem, we propose a dataset, which we call ArtSem, that contains 40, 000 images of artwork from 4 different domains with their corresponding semantic label maps.

Generative Adversarial Network

Paper
Add Code

Diffusart: Enhancing Line Art Colorization with Conditional Diffusion Models

1 code implementation • Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2023 • Hernan Carrillo, Michaël Clément, Aurélie Bugeau, Edgar Simo-Serra

Colorization of line art drawings is an important task in illustration and animation workflows.

Line Art Colorization SSIM

Paper
Code

Towards Flexible Multi-modal Document Models

1 code implementation • CVPR 2023 • Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

Creative workflows for generating graphical documents involve complex inter-related tasks, such as aligning elements, choosing appropriate fonts, or employing aesthetically harmonious colors.

Multi-Task Learning Position

Paper
Code

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation

1 code implementation • CVPR 2023 • Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

Controllable layout generation aims at synthesizing plausible arrangement of element bounding boxes with optional constraints, such as type or position of a specific element.

Position

178

Paper
Code

Generative Colorization of Structured Mobile Web Pages

1 code implementation • 22 Dec 2022 • Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi

The web page colorization problem is then formalized as a task of estimating plausible color styles for a given web page content with a given hierarchical structure of the elements.

Colorization Efficient Exploration +1

Paper
Code

P2Net: A Post-Processing Network for Refining Semantic Segmentation of LiDAR Point Cloud based on Consistency of Consecutive Frames

no code implementations • 1 Dec 2022 • Yutaka Momma, Weimin WANG, Edgar Simo-Serra, Satoshi Iizuka, Ryosuke Nakamura, Hiroshi Ishikawa

To remedy this problem, we propose to explicitly train a network to refine these results predicted by an existing segmentation method.

Semantic Segmentation

Paper
Add Code

Constrained Graphic Layout Generation via Latent Optimization

1 code implementation • 2 Aug 2021 • Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

We optimize using the latent space of an off-the-shelf layout generation model, allowing our approach to be complementary to and used with existing layout generation models.

125

Paper
Code

General Virtual Sketching Framework for Vector Line Art

1 code implementation • Transactions on Graphics (SIGGRAPH) 2021 • Haoran Mo, Edgar Simo-Serra, Chengying Gao, Changqing Zou, Ruomei Wang

Vector line art plays an important role in graphic design, however, it is tedious to manually create.

185

Paper
Code

User-Guided Line Art Flat Filling With Split Filling Mechanism

no code implementations • CVPR 2021 • Lvmin Zhang, Chengze Li, Edgar Simo-Serra, Yi Ji, Tien-Tsin Wong, Chunping Liu

We present a deep learning framework for user-guided line art flat filling that can compute the "influence areas" of the user color scribbles, i. e., the areas where the user scribbles should propagate and influence.

Paper
Add Code

Automatic Segmentation, Localization, and Identification of Vertebrae in 3D CT Images Using Cascaded Convolutional Neural Networks

no code implementations • 29 Sep 2020 • Naoto Masuzawa, Yoshiro Kitamura, Keigo Nakamura, Satoshi Iizuka, Edgar Simo-Serra

The input to the second networks have an auxiliary channel in addition to the 3D CT images.

Anatomy

Paper
Add Code

TopNet: Topology Preserving Metric Learning for Vessel Tree Reconstruction and Labelling

no code implementations • 18 Sep 2020 • Deepak Keshwani, Yoshiro Kitamura, Satoshi Ihara, Satoshi Iizuka, Edgar Simo-Serra

To the best of our knowledge, this is the first deep learning based approach which learns multi-label tree structure connectivity from images.

Metric Learning Segmentation +1

Paper
Add Code

DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement

no code implementations • 18 Sep 2020 • Satoshi Iizuka, Edgar Simo-Serra

The remastering of vintage film comprises of a diversity of sub-tasks including super-resolution, noise removal, and contrast enhancement which aim to restore the deteriorated film medium to its original state.

Colorization Super-Resolution +1

Paper
Add Code

Two-stage Discriminative Re-ranking for Large-scale Landmark Retrieval

2 code implementations • 25 Mar 2020 • Shuhei Yokoo, Kohei Ozaki, Edgar Simo-Serra, Satoshi Iizuka

Due to the variance of the images, which include extreme viewpoint changes such as having to retrieve images of the exterior of a landmark from images of the interior, this is very challenging for approaches based exclusively on visual similarity.

Image Retrieval Landmark Recognition +3

719

Paper
Code

Understanding the Effects of Pre-Training for Object Detectors via Eigenspectrum

no code implementations • 9 Sep 2019 • Yosuke Shinya, Edgar Simo-Serra, Taiji Suzuki

Furthermore, we propose a method for automatically determining the widths (the numbers of channels) of object detectors based on the eigenspectrum.

Image Classification Object +2

Paper
Add Code

Virtual Thin Slice: 3D Conditional GAN-based Super-resolution for CT Slice Interval

no code implementations • 30 Aug 2019 • Akira Kudo, Yoshiro Kitamura, Yuanzhong Li, Satoshi Iizuka, Edgar Simo-Serra

In this paper, we present a novel architecture based on conditional Generative Adversarial Networks (cGANs) with the goal of generating high resolution images of main body parts including head, chest, abdomen and legs.

Anatomy SSIM +1

Paper
Add Code

Joint Gap Detection and Inpainting of Line Drawings

no code implementations • CVPR 2017 • Kazuma Sasaki, Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa

We evaluate our method qualitatively on a diverse set of challenging line drawings and also provide quantitative results with a user study, where it significantly outperforms the state of the art.

Paper
Add Code

Multi-Modal Fashion Product Retrieval

no code implementations • WS 2017 • Antonio Rubio Romano, LongLong Yu, Edgar Simo-Serra, Francesc Moreno-Noguer

Finding a product in the fashion world can be a daunting task.

Retrieval

Paper
Add Code

Mastering Sketching: Adversarial Augmentation for Structured Prediction

no code implementations • 27 Mar 2017 • Edgar Simo-Serra, Satoshi Iizuka, Hiroshi Ishikawa

Our approach augments a simplification network with a discriminator network, training both networks jointly so that the discriminator network discerns whether a line drawing is a real training data or the output of the simplification network, which in turn tries to fool it.

Structured Prediction

Paper
Add Code

Let there be color!: joint end-to-end learning of global and local image priors for automatic image colorization with simultaneous classification

3 code implementations • ACM Transactions on Graphics 2016 • Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa

We present a novel technique to automatically colorize grayscale images that combines both global priors and local image features.

Colorization Image Colorization +1

Paper
Code

Fashion Style in 128 Floats: Joint Ranking and Classification Using Weak Data for Feature Extraction

no code implementations • CVPR 2016 • Edgar Simo-Serra, Hiroshi Ishikawa

We propose a novel approach for learning features from weakly-supervised data by joint ranking and classification.

General Classification

Paper
Add Code

Understanding Human-Centric Images: From Geometry to Fashion

no code implementations • 14 Dec 2015 • Edgar Simo-Serra

Understanding humans from photographs has always been a fundamental goal of computer vision.

3D Human Pose Estimation Semantic Segmentation

Paper
Add Code

Discriminative Learning of Deep Convolutional Feature Point Descriptors

1 code implementation • ICCV 2015 • Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer

Deep learning has revolutionalized image-level tasks such as classification, but patch-level tasks, such as correspondence, still rely on hand-crafted features, e. g. SIFT.

Ranked #2 on Satellite Image Classification on SAT-4

Satellite Image Classification

121

Paper
Code

Structured Prediction with Output Embeddings for Semantic Image Annotation

no code implementations • NAACL 2016 • Ariadna Quattoni, Arnau Ramisa, Pranava Swaroop Madhyastha, Edgar Simo-Serra, Francesc Moreno-Noguer

We address the task of annotating images with semantic tuples.

Structured Prediction

Paper
Add Code

Neuroaesthetics in Fashion: Modeling the Perception of Fashionability

no code implementations • Conference 2015 • Edgar Simo-Serra, Sanja Fidler, Francesc Moreno-Noguer, Raquel Urtasun

Importantly, our model is able to give rich feedback back to the user, conveying which garments or even scenery she/he should change in order to improve fashionability.

Paper
Add Code

Fracking Deep Convolutional Image Descriptors

no code implementations • 19 Dec 2014 • Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Francesc Moreno-Noguer

In this paper we propose a novel framework for learning local image descriptors in a discriminative manner.

Paper
Add Code

A Joint Model for 2D and 3D Pose Estimation from a Single Image

no code implementations • CVPR 2013 • Edgar Simo-Serra, Ariadna Quattoni, Carme Torras, Francesc Moreno-Noguer

We introduce a novel approach to automatically recover 3D human pose from a single image.

Ranked #25 on 3D Human Pose Estimation on HumanEva-I

3D Human Pose Estimation 3D Pose Estimation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.