Search Results for author: Yawei Luo

Found 25 papers, 14 papers with code

Optimized View and Geometry Distillation from Multi-view Diffuser

no code implementations11 Dec 2023 Youjia Zhang, Zikai Song, Junqing Yu, Yawei Luo, Wei Yang

We leverage the rendered views from the optimized radiance field as the basis and develop a two-step specialization process of a 2D diffusion model, which is adept at conducting object-specific denoising and generating high-quality multi-view images.

Denoising

Fine-grained Appearance Transfer with Diffusion Models

1 code implementation27 Nov 2023 Yuteng Ye, Guanwen Li, Hang Zhou, Cai Jiale, Junqing Yu, Yawei Luo, Zikai Song, Qilong Xing, Youjia Zhang, Wei Yang

A pivotal aspect of our approach is the strategic use of the predicted $x_0$ space by diffusion models within the latent space of diffusion processes.

Image-to-Image Translation

Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields

1 code implementation20 Nov 2023 Zhiyuan Min, Yawei Luo, Wei Yang, Yuesong Wang, Yi Yang

Different from existing methods that consider cross-view and along-epipolar information independently, EVE-NeRF conducts the view-epipolar feature aggregation in an entangled manner by injecting the scene-invariant appearance continuity and geometry consistency priors to the aggregation process.

Generalizable Novel View Synthesis

Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation

no code implementations30 Jul 2023 Wenqing Wang, Kaifeng Gao, Yawei Luo, Tao Jiang, Fei Gao, Jian Shao, Jianwen Sun, Jun Xiao

Video-based scene graph generation (VidSGG) is an approach that aims to represent video content in a dynamic graph by identifying visual entities and their relationships.

Graph Generation Missing Labels +2

Knowledge-guided Causal Intervention for Weakly-supervised Object Localization

1 code implementation3 Jan 2023 Feifei Shao, Yawei Luo, Fei Gao, Yi Yang, Jun Xiao

Previous weakly-supervised object localization (WSOL) methods aim to expand activation map discriminative areas to cover the whole objects, yet neglect two inherent challenges when relying solely on image-level labels.

Knowledge Distillation Object +1

Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo

1 code implementation CVPR 2023 Yuesong Wang, Zhaojie Zeng, Tao Guan, Wei Yang, Zhuo Chen, Wenkai Liu, Luoyuan Xu, Yawei Luo

To detect more anchor pixels to ensure better adaptive patch deformation, we propose to evaluate the matching ambiguity of a certain pixel by checking the convergence of the estimated depth as optimization proceeds.

Point Clouds

Bidirectional Self-Training with Multiple Anisotropic Prototypes for Domain Adaptive Semantic Segmentation

1 code implementation16 Apr 2022 Yulei Lu, Yawei Luo, Li Zhang, Zheyang Li, Yi Yang, Jun Xiao

A thriving trend for domain adaptive segmentation endeavors to generate the high-quality pseudo labels for target domain and retrain the segmentor on them.

Pseudo Label Semantic Segmentation +2

Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning

no code implementations25 Feb 2022 Feifei Shao, Yawei Luo, Ping Liu, Jie Chen, Yi Yang, Yulei Lu, Jun Xiao

To deploy SSDR-AL in a more practical scenario, we design a noise-aware iterative labeling strategy to confront the "noisy annotation" problem introduced by the previous "dominant labeling" strategy in superpoints.

Active Learning Semantic Segmentation

Contrastive Video-Language Segmentation

no code implementations29 Sep 2021 Chen Liang, Yawei Luo, Yu Wu, Yi Yang

We focus on the problem of segmenting a certain object referred by a natural language sentence in video content, at the core of formulating a pinpoint vision-language relation.

Contrastive Learning Relation +2

Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes

1 code implementation1 Jun 2021 Jian-Wei Zhang, Lei Lv, Yawei Luo, Hao-Zhe Feng, Yi Yang, Wei Chen

The hierarchical features help the model highlight the decision boundary and focus on hard pixels, and the structural information learned from base classes is treated as the prior knowledge for novel classes.

VidFace: A Full-Transformer Solver for Video FaceHallucination with Unaligned Tiny Snapshots

1 code implementation31 May 2021 Yuan Gan, Yawei Luo, Xin Yu, Bang Zhang, Yi Yang

In this paper, we investigate the task of hallucinating an authentic high-resolution (HR) human face from multiple low-resolution (LR) video snapshots.

Face Hallucination Hallucination

Improving Weakly-supervised Object Localization via Causal Intervention

1 code implementation21 Apr 2021 Feifei Shao, Yawei Luo, Li Zhang, Lu Ye, Siliang Tang, Yi Yang, Jun Xiao

The recent emerged weakly supervised object localization (WSOL) methods can learn to localize an object in the image only using image-level labels.

Object Weakly-Supervised Object Localization

ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation

no code implementations19 Mar 2021 Chen Liang, Yu Wu, Yawei Luo, Yi Yang

Text-based video segmentation is a challenging task that segments out the natural language referred objects in videos.

Ranked #4 on Referring Expression Segmentation on J-HMDB (Precision@0.9 metric)

Object Referring Expression Segmentation +4

Look, Cast and Mold: Learning 3D Shape Manifold from Single-view Synthetic Data

no code implementations8 Mar 2021 Qianyu Feng, Yawei Luo, Keyang Luo, Yi Yang

To generalize the model towards a real scenario, we propose to fulfill several aspects: (1) Look: visually incorporate spatial structure from the single view to enhance the expressiveness of representation; (2) Cast: perceptually align the 2D image features to the 3D shape priors with cross-modal semantic contrastive mapping; (3) Mold: reconstruct stereo-shape of target by transforming embeddings into the desired manifold.

3D Reconstruction Single-View 3D Reconstruction

Copy and Paste GAN: Face Hallucination from Shaded Thumbnails

no code implementations CVPR 2020 Yang Zhang, Ivor Tsang, Yawei Luo, Changhui Hu, Xiaobo Lu, Xin Yu

This paper proposes a Copy and Paste Generative Adversarial Network (CPGAN) to recover authentic high-resolution (HR) face images while compensating for low and non-uniform illumination.

Face Hallucination Generative Adversarial Network +1

Significance-aware Information Bottleneck for Domain Adaptive Semantic Segmentation

no code implementations ICCV 2019 Yawei Luo, Ping Liu, Tao Guan, Junqing Yu, Yi Yang

For unsupervised domain adaptation problems, the strategy of aligning the two domains in latent feature space through adversarial learning has achieved much progress in image classification, but usually fails in semantic segmentation tasks in which the latent representations are overcomplex.

Image Classification Segmentation +2

Every Node Counts: Self-Ensembling Graph Convolutional Networks for Semi-Supervised Learning

1 code implementation26 Sep 2018 Yawei Luo, Tao Guan, Junqing Yu, Ping Liu, Yi Yang

To capitalize on the information from unlabeled nodes to boost the training for GCN, we propose a novel framework named Self-Ensembling GCN (SEGCN), which marries GCN with Mean Teacher - another powerful model in semi-supervised learning.

General Classification Node Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.