Browse > Computer Vision > Perception

Perception

124 papers with code · Computer Vision

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

A Neural Algorithm of Artistic Style

26 Aug 2015jcjohnson/neural-style

In fine art, especially painting, humans have mastered the skill to create unique visual experiences through composing a complex interplay between the content and style of an image.

PERCEPTION STYLE TRANSFER

GANSynth: Adversarial Neural Audio Synthesis

ICLR 2019 tensorflow/magenta

Efficient audio synthesis is an inherently difficult machine learning task, as human perception is sensitive to both global structure and fine-scale waveform coherence.

AUDIO GENERATION PERCEPTION

Embodied Question Answering

CVPR 2018 facebookresearch/House3D

We present a new AI task -- Embodied Question Answering (EmbodiedQA) -- where an agent is spawned at a random location in a 3D environment and asked a question ("What color is the car?").

EMBODIED QUESTION ANSWERING PERCEPTION QUESTION ANSWERING

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

CVPR 2018 richzhang/PerceptualSimilarity

We systematically evaluate deep features across different architectures and tasks and compare them with classic metrics.

PERCEPTION

Learning Typographic Style

13 Mar 2016kaonashi-tyc/Rewrite

Typography is a ubiquitous art form that affects our understanding, perception, and trust in what we read.

PERCEPTION

NIMA: Neural Image Assessment

15 Sep 2017titu1994/neural-image-assessment

Automatically learned quality assessment for images has recently become a hot topic due to its usefulness in a wide variety of applications such as evaluating image capture pipelines, storage techniques and sharing media.

IMAGE QUALITY ASSESSMENT PERCEPTION

SCUT-FBP5500: A Diverse Benchmark Dataset for Multi-Paradigm Facial Beauty Prediction

19 Jan 2018HCIILAB/SCUT-FBP5500-Database-Release

Previous works have formulated the recognition of facial beauty as a specific supervised learning problem of classification, regression or ranking, which indicates that FBP is intrinsically a computation problem with multiple paradigms.

FACIAL BEAUTY PREDICTION PERCEPTION

4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks

CVPR 2019 StanfordVL/MinkowskiEngine

To overcome challenges in the 4D space, we propose the hybrid kernel, a special case of the generalized sparse convolution, and the trilateral-stationary conditional random field that enforces spatio-temporal consistency in the 7D space-time-chroma space.

3D SEMANTIC SEGMENTATION 4D SPATIO TEMPORAL SEMANTIC SEGMENTATION PERCEPTION SEMANTIC SEGMENTATION

End-to-end Learning of Driving Models from Large-scale Video Datasets

CVPR 2017 gy20073/BDD_Driving_Model

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or a simulation environment.

PERCEPTION SCENE SEGMENTATION

Unsupervised Attention-guided Image-to-Image Translation

NeurIPS 2018 AlamiMejjati/Unsupervised-Attention-guided-Image-to-Image-Translation

Current unsupervised image-to-image translation techniques struggle to focus their attention on individual objects without altering the background or the way multiple objects interact within a scene.

PERCEPTION UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION