Search Results for author: Yao Yao

Found 62 papers, 25 papers with code

STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians

no code implementations22 Mar 2024 Yifei Zeng, Yanqin Jiang, Siyu Zhu, Yuanxun Lu, Youtian Lin, Hao Zhu, Weiming Hu, Xun Cao, Yao Yao

Recent progress in pre-trained diffusion models and 3D generation have spurred interest in 4D content creation.

3D Generation

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

1 code implementation21 Mar 2024 Shenhao Zhu, Junming Leo Chen, Zuozhuo Dai, Yinghui Xu, Xun Cao, Yao Yao, Hao Zhu, Siyu Zhu

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

GaussianPro: 3D Gaussian Splatting with Progressive Propagation

no code implementations22 Feb 2024 Kai Cheng, Xiaoxiao Long, Kaizhi Yang, Yao Yao, Wei Yin, Yuexin Ma, Wenping Wang, Xuejin Chen

The advent of 3D Gaussian Splatting (3DGS) has recently brought about a revolution in the field of neural rendering, facilitating high-quality renderings at real-time speed.

Neural Rendering Patch Matching

From Pixel to Slide image: Polarization Modality-based Pathological Diagnosis Using Representation Learning

no code implementations3 Jan 2024 Jia Dong, Yao Yao, Yang Dong, Hui Ma

This structure includes a pathology structure recognition method to predict structures related to thyroid tumors, an encoder-decoder network to extract pixel-level annotation information by learning the feature representations of image blocks, and an attention-based learning mechanism for the final classification task.

Representation Learning

A Polarization and Radiomics Feature Fusion Network for the Classification of Hepatocellular Carcinoma and Intrahepatic Cholangiocarcinoma

no code implementations27 Dec 2023 Jia Dong, Yao Yao, Liyan Lin, Yang Dong, Jiachen Wan, Ran Peng, Chao Li, Hui Ma

Our experimental results underscore the potential of this fusion network as a powerful tool for computer-aided diagnosis of HCC and ICC, showcasing the benefits and prospects of integrating polarization imaging techniques into the current image-intensive digital pathological diagnosis.

Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle

no code implementations6 Dec 2023 Youtian Lin, Zuozhuo Dai, Siyu Zhu, Yao Yao

Moreover, the explicit deformation modeling for discretized Gaussian points ensures ultra-fast training and rendering of a 4D scene, which is comparable to the original 3DGS designed for static 3D reconstruction.

3D Reconstruction 4D reconstruction +1

Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion

no code implementations27 Nov 2023 Yuanxun Lu, Jingyang Zhang, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Xun Cao, Yao Yao

The multi-view 2. 5D diffusion directly models the structural distribution of 3D data, while still maintaining the strong generalization ability of the original 2D diffusion model, filling the gap between 2D diffusion-based and direct 3D diffusion-based methods for 3D content generation.

3D Generation Text to 3D

Relightable 3D Gaussian: Real-time Point Cloud Relighting with BRDF Decomposition and Ray Tracing

no code implementations27 Nov 2023 Jian Gao, Chun Gu, Youtian Lin, Hao Zhu, Xun Cao, Li Zhang, Yao Yao

We present a novel differentiable point-based rendering framework for material and lighting decomposition from multi-view images, enabling editing, ray-tracing, and real-time relighting of the 3D point cloud.

BRDF estimation Lighting Estimation

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

1 code implementation20 Nov 2023 Zhuosheng Zhang, Yao Yao, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao

Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks.

Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

no code implementations6 Nov 2023 Yanqin Jiang, Li Zhang, Jin Gao, Weimin Hu, Yao Yao

This is achieved by leveraging the object-level 3D-aware image diffusion model as the primary supervision signal for training Dynamic Neural Radiance Fields (DyNeRF).

3D Generation Camera Calibration +3

CDR-Adapter: Learning Adapters to Dig Out More Transferring Ability for Cross-Domain Recommendation Models

no code implementations4 Nov 2023 Yanyu Chen, Yao Yao, Wai Kin Victor Chan, Li Xiao, Kai Zhang, Liang Zhang, Yun Ye

In this paper, we present a scalable and efficient paradigm to address data sparsity and cold-start issues in CDR, named CDR-Adapter, by decoupling the original recommendation model from the mapping function, without requiring re-engineering the network structure.

Recommendation Systems Transfer Learning

JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation

1 code implementation29 Oct 2023 Yao Yao, Peike Li, BoYu Chen, Alex Wang

With rapid advances in generative artificial intelligence, the text-to-music synthesis task has emerged as a promising direction for music generation from scratch.

Music Generation

JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling

no code implementations10 Oct 2023 Jingyang Zhang, Shiwei Li, Yuanxun Lu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Yao Yao

We introduce JointNet, a novel neural network architecture for modeling the joint distribution of images and an additional dense modality (e. g., depth maps).

Depth Estimation Depth Prediction +1

Anti-Aliased Neural Implicit Surfaces with Encoding Level of Detail

no code implementations19 Sep 2023 Yiyu Zhuang, Qi Zhang, Ying Feng, Hao Zhu, Yao Yao, Xiaoyu Li, Yan-Pei Cao, Ying Shan, Xun Cao

Drawing inspiration from voxel-based representations with the level of detail (LoD), we introduce a multi-scale tri-plane-based scene representation that is capable of capturing the LoD of the signed distance function (SDF) and the space radiance.

Surface Reconstruction

TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes

2 code implementations22 Aug 2023 Xiaoyan Cao, Yiyao Zheng, Yao Yao, Huapeng Qin, Xiaoyu Cao, Shihui Guo

Existing trackers can be categorized into two association paradigms: single-feature paradigm (based on either motion or appearance feature) and serial paradigm (one feature serves as secondary while the other is primary).

Multi-Object Tracking

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

2 code implementations9 Aug 2023 Peike Li, BoYu Chen, Yao Yao, Yikai Wang, Allen Wang, Alex Wang

Despite the task's significance, prevailing generative models exhibit limitations in music quality, computational efficiency, and generalization.

Computational Efficiency In-Context Learning +2

AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation

no code implementations16 Jun 2023 Yifei Zeng, Yuanxun Lu, Xinya Ji, Yao Yao, Hao Zhu, Xun Cao

Unlike previous approaches that can only synthesize avatars based on simple text descriptions, our method enables the creation of personalized avatars from casually captured face or body images, while still supporting text-based model generation and editing.

Text to 3D

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models

2 code implementations26 May 2023 Yao Yao, Zuchao Li, Hai Zhao

Therefore, we propose Graph-of-Thought (GoT) reasoning, which models human thought processes not only as a chain but also as a graph.

GSM8K Multimodal Reasoning +1

NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation

no code implementations ICCV 2023 Jingyang Zhang, Yao Yao, Shiwei Li, Jingbo Liu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a novel differentiable rendering framework for joint geometry, material, and lighting estimation from multi-view images.

Lighting Estimation

Stochastic Methods for AUC Optimization subject to AUC-based Fairness Constraints

no code implementations23 Dec 2022 Yao Yao, Qihang Lin, Tianbao Yang

In this work, we formulate the training problem of a fairness-aware machine learning model as an AUC optimization problem subject to a class of AUC-based fairness constraints.

Fairness

Towards Relation-centered Pooling and Convolution for Heterogeneous Graph Learning Networks

1 code implementation31 Oct 2022 Tiehua Zhang, Yuze Liu, Yao Yao, Youhua Xia, Xin Chen, Xiaowei Huang, Jiong Jin

Heterogeneous graph neural network has unleashed great potential on graph representation learning and shown superior performance on downstream tasks such as node classification and clustering.

Graph Learning Graph Representation Learning +2

Critical Regularizations for Neural Surface Reconstruction in the Wild

no code implementations CVPR 2022 Jingyang Zhang, Yao Yao, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

The first one is the Hessian regularization that smoothly diffuses the signed distance values to the entire distance field given noisy and incomplete input.

Surface Reconstruction

i-Razor: A Differentiable Neural Input Razor for Feature Selection and Dimension Search in DNN-Based Recommender Systems

1 code implementation1 Apr 2022 Yao Yao, Bin Liu, Haoxun He, Dakui Sheng, Ke Wang, Li Xiao, Huanhuan Cao

Noisy features and inappropriate embedding dimension assignments can deteriorate the performance of recommender systems and introduce unnecessary complexity in model training and online serving.

Click-Through Rate Prediction Feature Engineering +3

NeILF: Neural Incident Light Field for Physically-based Material Estimation

1 code implementation14 Mar 2022 Yao Yao, Jingyang Zhang, Jingbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a differentiable rendering framework for material and lighting estimation from multi-view images and a reconstructed geometry.

Lighting Estimation

Large-scale Optimization of Partial AUC in a Range of False Positive Rates

no code implementations3 Mar 2022 Yao Yao, Qihang Lin, Tianbao Yang

The partial AUC, as a generalization of the AUC, summarizes only the TPRs over a specific range of the FPRs and is thus a more suitable performance measure in many real-world situations.

CLS: Cross Labeling Supervision for Semi-Supervised Learning

1 code implementation17 Feb 2022 Yao Yao, Junyi Shen, Jin Xu, Bin Zhong, Li Xiao

Based on FixMatch, where a pseudo label is generated from a weakly-augmented sample to teach the prediction on a strong augmentation of the same input sample, CLS allows the creation of both pseudo and complementary labels to support both positive and negative learning.

Pseudo Label

POI-Transformers: POI Entity Matching through POI Embeddings by Incorporating Semantic and Geographic Information

no code implementations29 Sep 2021 Jinbao Zhang, Changwang Zhang, Xiaojuan Liu, Xia Li, Weilin Liao, Penghua Liu, Yao Yao, Jihong Zhang

A general and robust POI embedding framework, the POI-Transformers, is initially proposed in this study to address these problems of POI entity matching.

Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control

no code implementations26 Aug 2021 Wanpeng Zhang, Xiaoyan Cao, Yao Yao, Zhicheng An, Xi Xiao, Dijun Luo

In this paper, we present a model-based robust RL framework for autonomous greenhouse control to meet the sample efficiency and safety challenges.

Decision Making Model-based Reinforcement Learning +2

Learning Signed Distance Field for Multi-view Surface Reconstruction

1 code implementation ICCV 2021 Jingyang Zhang, Yao Yao, Long Quan

In this work, we introduce a novel neural surface reconstruction framework that leverages the knowledge of stereo matching and feature consistency to optimize the implicit surface representation.

Stereo Matching Surface Reconstruction

MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning

no code implementations3 Aug 2021 Wanpeng Zhang, Xi Xiao, Yao Yao, Mingzhe Chen, Dijun Luo

MBDP consists of two kinds of dropout mechanisms, where the rollout-dropout aims to improve the robustness with a small cost of sample efficiency, while the model-dropout is designed to compensate for the lost efficiency at a slight expense of robustness.

Model-based Reinforcement Learning

IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control

1 code implementation6 Jul 2021 Xiaoyan Cao, Yao Yao, Lanqing Li, Wanpeng Zhang, Zhicheng An, Zhong Zhang, Li Xiao, Shihui Guo, Xiaoyu Cao, Meihong Wu, Dijun Luo

However, the optimal control of autonomous greenhouses is challenging, requiring decision-making based on high-dimensional sensory data, and the scaling of production is limited by the scarcity of labor capable of handling this task.

Cloud Computing Decision Making

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation

1 code implementation5 Jul 2021 Yao Yao, Li Xiao, Zhicheng An, Wanpeng Zhang, Dijun Luo

Model-based deep reinforcement learning has achieved success in various domains that require high sample efficiencies, such as Go and robotics.

Continuous Control reinforcement-learning +1

Excess-noise suppression for a squeezed state propagating through random amplifying media via wave-front shaping

no code implementations4 Feb 2021 Dong Li, Song Sun, Yao Yao

After propagating through a random amplifying medium, a squeezed state commonly shows excess noise above the shot-noise level.

Quantum Physics

Remarks on stationary and uniformly-rotating vortex sheets: Rigidity results

no code implementations8 Dec 2020 Javier Gómez-Serrano, Jaemin Park, Jia Shi, Yao Yao

In this paper, we show that the only solution of the vortex sheet equation, either stationary or uniformly rotating with negative angular velocity $\Omega$, such that it has positive vorticity and is concentrated in a finite disjoint union of smooth curves with finite length is the trivial one: constant vorticity amplitude supported on a union of nested, concentric circles.

Analysis of PDEs

Understanding the drivers of sustainable land expansion using a patch-level simulation model: A case study in Wuhan, China

no code implementations22 Oct 2020 Xun Liang, Qingfeng Guan, Keith C. Clarke, Shishi Liu, Bingyu Wang, Yao Yao

The change complexity lies in the detailed scale of high granularity data, and in the geometric units used to simulate the change.

Computers and Society

Practical Option Valuations of Futures Contracts with Negative Underlying Prices

no code implementations25 Sep 2020 Anatoliy Swishchuk, Ana Roldan-Contreras, Elham Soufiani, Guillermo Martinez, Mohsen Seifi, Nishant Agrawal, Yao Yao

Here we propose two alternatives to Black 76 to value European option future contracts in which the underlying market prices can be negative or mean reverting.

Visibility-aware Multi-view Stereo Network

1 code implementation18 Aug 2020 Jingyang Zhang, Yao Yao, Shiwei Li, Zixin Luo, Tian Fang

As such, the adverse influence of occluded pixels is suppressed in the cost fusion.

3D Reconstruction Depth Estimation +1

KFNet: Learning Temporal Camera Relocalization using Kalman Filtering

1 code implementation CVPR 2020 Lei Zhou, Zixin Luo, Tianwei Shen, Jiahui Zhang, Mingmin Zhen, Yao Yao, Tian Fang, Long Quan

Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image.

Camera Relocalization

Network Cooperation with Progressive Disambiguation for Partial Label Learning

no code implementations22 Feb 2020 Yao Yao, Chen Gong, Jiehui Deng, Jian Yang

Partial Label Learning (PLL) aims to train a classifier when each training instance is associated with a set of candidate labels, among which only one is correct but is not accessible during the training phase.

Partial Label Learning

BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks

2 code implementations CVPR 2020 Yao Yao, Zixin Luo, Shiwei Li, Jingyang Zhang, Yufan Ren, Lei Zhou, Tian Fang, Long Quan

Compared with other computer vision tasks, it is rather difficult to collect a large-scale MVS dataset as it requires expensive active scanners and labor-intensive process to obtain ground truth 3D structures.

3D Reconstruction

Self-Supervised Learning of Depth and Motion Under Photometric Inconsistency

1 code implementation19 Sep 2019 Tianwei Shen, Lei Zhou, Zixin Luo, Yao Yao, Shiwei Li, Jiahui Zhang, Tian Fang, Long Quan

The self-supervised learning of depth and pose from monocular sequences provides an attractive solution by using the photometric consistency of nearby frames as it depends much less on the ground-truth data.

Pose Estimation Self-Supervised Learning

ContextDesc: Local Descriptor Augmentation with Cross-Modality Context

1 code implementation CVPR 2019 Zixin Luo, Tianwei Shen, Lei Zhou, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Most existing studies on learning local features focus on the patch-based descriptions of individual keypoints, whereas neglecting the spatial relations established from their keypoint locations.

Geometric Matching

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

1 code implementation CVPR 2019 Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

However, one major limitation of current learned MVS approaches is the scalability: the memory-consuming cost volume regularization makes the learned MVS hard to be applied to high-resolution scenes.

Vocal Bursts Intensity Prediction

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

1 code implementation ECCV 2018 Zixin Luo, Tianwei Shen, Lei Zhou, Siyu Zhu, Runze Zhang, Yao Yao, Tian Fang, Long Quan

Learned local descriptors based on Convolutional Neural Networks (CNNs) have achieved significant improvements on patch-based benchmarks, whereas not having demonstrated strong generalization ability on recent benchmarks of image-based 3D reconstruction.

3D Reconstruction

MVSNet: Depth Inference for Unstructured Multi-view Stereo

4 code implementations ECCV 2018 Yao Yao, Zixin Luo, Shiwei Li, Tian Fang, Long Quan

We present an end-to-end deep learning architecture for depth map inference from multi-view images.

Ranked #19 on Point Clouds on Tanks and Temples (Mean F1 (Intermediate) metric)

3D Reconstruction Point Clouds

Pulsar Candidate Identification with Artificial Intelligence Techniques

no code implementations27 Nov 2017 Ping Guo, Fuqing Duan, Pei Wang, Yao Yao, Qian Yin, Xin Xin

To address these problems, we proposed a framework which combines deep convolution generative adversarial network (DCGAN) with support vector machine (SVM) to deal with imbalance class problem and to improve pulsar identification accuracy.

Astronomy Generative Adversarial Network

Sensing Urban Land-Use Patterns By Integrating Google Tensorflow And Scene-Classification Models

no code implementations4 Aug 2017 Yao Yao, Haolin Liang, Xia Li, Jinbao Zhang, Jialv He

To take advantage of the deep-learning method in detecting urban land-use patterns, we applied a transfer-learning-based remote-sensing image approach to extract and classify features.

General Classification Scene Classification +1

Extracting urban impervious surface from GF-1 imagery using one-class classifiers

no code implementations13 May 2017 Yao Yao, Jialv He, Jinbao Zhang, Yatao Zhang

In this study, we investigate several one-class classifiers, such as Presence and Background Learning (PBL), Positive Unlabeled Learning (PUL), OCSVM, BSVM and MAXENT, to extract urban impervious surface area using high spatial resolution imagery of GF-1, China's new generation of high spatial remote sensing satellite, and evaluate the classification accuracy based on artificial interpretation results.

General Classification Management

Cannot find the paper you are looking for? You can Submit a new open access paper.