Search Results for author: Yao Yao

Found 62 papers, 25 papers with code

Surfacing Privacy Settings Using Semantic Matching

1 code implementation • EMNLP (PrivateNLP) 2020 • Rishabh Khandelwal, Asmit Nayak, Yao Yao, Kassem Fawaz

Online services utilize privacy settings to provide users with control over their data.

Paper
Code

Changing against tone merging trends in community? The case of C. Y. Leung

no code implementations • PACLIC 2018 • Ziqi Chen, Yao Yao, Alan C. L. Yu

Paper
Add Code

STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians

no code implementations • 22 Mar 2024 • Yifei Zeng, Yanqin Jiang, Siyu Zhu, Yuanxun Lu, Youtian Lin, Hao Zhu, Weiming Hu, Xun Cao, Yao Yao

Recent progress in pre-trained diffusion models and 3D generation have spurred interest in 4D content creation.

3D Generation

Paper
Add Code

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

1 code implementation • 21 Mar 2024 • Shenhao Zhu, Junming Leo Chen, Zuozhuo Dai, Yinghui Xu, Xun Cao, Yao Yao, Hao Zhu, Siyu Zhu

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

2,665

Paper
Code

GaussianPro: 3D Gaussian Splatting with Progressive Propagation

no code implementations • 22 Feb 2024 • Kai Cheng, Xiaoxiao Long, Kaizhi Yang, Yao Yao, Wei Yin, Yuexin Ma, Wenping Wang, Xuejin Chen

The advent of 3D Gaussian Splatting (3DGS) has recently brought about a revolution in the field of neural rendering, facilitating high-quality renderings at real-time speed.

Neural Rendering Patch Matching

Paper
Add Code

From Pixel to Slide image: Polarization Modality-based Pathological Diagnosis Using Representation Learning

no code implementations • 3 Jan 2024 • Jia Dong, Yao Yao, Yang Dong, Hui Ma

This structure includes a pathology structure recognition method to predict structures related to thyroid tumors, an encoder-decoder network to extract pixel-level annotation information by learning the feature representations of image blocks, and an attention-based learning mechanism for the final classification task.

Representation Learning

Paper
Add Code

A Polarization and Radiomics Feature Fusion Network for the Classification of Hepatocellular Carcinoma and Intrahepatic Cholangiocarcinoma

no code implementations • 27 Dec 2023 • Jia Dong, Yao Yao, Liyan Lin, Yang Dong, Jiachen Wan, Ran Peng, Chao Li, Hui Ma

Our experimental results underscore the potential of this fusion network as a powerful tool for computer-aided diagnosis of HCC and ICC, showcasing the benefits and prospects of integrating polarization imaging techniques into the current image-intensive digital pathological diagnosis.

Paper
Add Code

Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle

no code implementations • 6 Dec 2023 • Youtian Lin, Zuozhuo Dai, Siyu Zhu, Yao Yao

Moreover, the explicit deformation modeling for discretized Gaussian points ensures ultra-fast training and rendering of a 4D scene, which is comparable to the original 3DGS designed for static 3D reconstruction.

3D Reconstruction 4D reconstruction +1

Paper
Add Code

Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion

no code implementations • 27 Nov 2023 • Yuanxun Lu, Jingyang Zhang, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Xun Cao, Yao Yao

The multi-view 2. 5D diffusion directly models the structural distribution of 3D data, while still maintaining the strong generalization ability of the original 2D diffusion model, filling the gap between 2D diffusion-based and direct 3D diffusion-based methods for 3D content generation.

3D Generation Text to 3D

Paper
Add Code

Relightable 3D Gaussian: Real-time Point Cloud Relighting with BRDF Decomposition and Ray Tracing

no code implementations • 27 Nov 2023 • Jian Gao, Chun Gu, Youtian Lin, Hao Zhu, Xun Cao, Li Zhang, Yao Yao

We present a novel differentiable point-based rendering framework for material and lighting decomposition from multi-view images, enabling editing, ray-tracing, and real-time relighting of the 3D point cloud.

BRDF estimation Lighting Estimation

Paper
Add Code

AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance

1 code implementation • 21 Nov 2023 • Zuozhuo Dai, Zhenghao Zhang, Yao Yao, Bingxue Qiu, Siyu Zhu, Long Qin, Weizhi Wang

Image animation is a key task in computer vision which aims to generate dynamic visual content from static image.

Image Animation Image to Video Generation

526

Paper
Code

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

1 code implementation • 20 Nov 2023 • Zhuosheng Zhang, Yao Yao, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao

Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks.

300

Paper
Code

Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

no code implementations • 6 Nov 2023 • Yanqin Jiang, Li Zhang, Jin Gao, Weimin Hu, Yao Yao

This is achieved by leveraging the object-level 3D-aware image diffusion model as the primary supervision signal for training Dynamic Neural Radiance Fields (DyNeRF).

3D Generation Camera Calibration +3

Paper
Add Code

CDR-Adapter: Learning Adapters to Dig Out More Transferring Ability for Cross-Domain Recommendation Models

no code implementations • 4 Nov 2023 • Yanyu Chen, Yao Yao, Wai Kin Victor Chan, Li Xiao, Kai Zhang, Liang Zhang, Yun Ye

In this paper, we present a scalable and efficient paradigm to address data sparsity and cold-start issues in CDR, named CDR-Adapter, by decoupling the original recommendation model from the mapping function, without requiring re-engineering the network structure.

Recommendation Systems Transfer Learning

Paper
Add Code

JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation

1 code implementation • 29 Oct 2023 • Yao Yao, Peike Li, BoYu Chen, Alex Wang

With rapid advances in generative artificial intelligence, the text-to-music synthesis task has emerged as a promising direction for music generation from scratch.

Music Generation

Paper
Code

JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling

no code implementations • 10 Oct 2023 • Jingyang Zhang, Shiwei Li, Yuanxun Lu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Yao Yao

We introduce JointNet, a novel neural network architecture for modeling the joint distribution of images and an additional dense modality (e. g., depth maps).

Depth Estimation Depth Prediction +1

Paper
Add Code

Anti-Aliased Neural Implicit Surfaces with Encoding Level of Detail

no code implementations • 19 Sep 2023 • Yiyu Zhuang, Qi Zhang, Ying Feng, Hao Zhu, Yao Yao, Xiaoyu Li, Yan-Pei Cao, Ying Shan, Xun Cao

Drawing inspiration from voxel-based representations with the level of detail (LoD), we introduce a multi-scale tri-plane-based scene representation that is capable of capturing the LoD of the signed distance function (SDF) and the space radiance.

Surface Reconstruction

Paper
Add Code

TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes

2 code implementations • 22 Aug 2023 • Xiaoyan Cao, Yiyao Zheng, Yao Yao, Huapeng Qin, Xiaoyu Cao, Shihui Guo

Existing trackers can be categorized into two association paradigms: single-feature paradigm (based on either motion or appearance feature) and serial paradigm (one feature serves as secondary while the other is primary).

Multi-Object Tracking

328

Paper
Code

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

2 code implementations • 9 Aug 2023 • Peike Li, BoYu Chen, Yao Yao, Yikai Wang, Allen Wang, Alex Wang

Despite the task's significance, prevailing generative models exhibit limitations in music quality, computational efficiency, and generalization.

Ranked #1 on Text-to-Music Generation on MusicCaps

Computational Efficiency In-Context Learning +2

Paper
Code

AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation

no code implementations • 16 Jun 2023 • Yifei Zeng, Yuanxun Lu, Xinya Ji, Yao Yao, Hao Zhu, Xun Cao

Unlike previous approaches that can only synthesize avatars based on simple text descriptions, our method enables the creation of personalized avatars from casually captured face or body images, while still supporting text-based model generation and editing.

Text to 3D

Paper
Add Code

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models

2 code implementations • 26 May 2023 • Yao Yao, Zuchao Li, Hai Zhao

Therefore, we propose Graph-of-Thought (GoT) reasoning, which models human thought processes not only as a chain but also as a graph.

GSM8K Multimodal Reasoning +1

617

Paper
Code

NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation

no code implementations • ICCV 2023 • Jingyang Zhang, Yao Yao, Shiwei Li, Jingbo Liu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a novel differentiable rendering framework for joint geometry, material, and lighting estimation from multi-view images.

Lighting Estimation

Paper
Add Code

Stochastic Methods for AUC Optimization subject to AUC-based Fairness Constraints

no code implementations • 23 Dec 2022 • Yao Yao, Qihang Lin, Tianbao Yang

In this work, we formulate the training problem of a fairness-aware machine learning model as an AUC optimization problem subject to a class of AUC-based fairness constraints.

Fairness

Paper
Add Code

Towards Relation-centered Pooling and Convolution for Heterogeneous Graph Learning Networks

1 code implementation • 31 Oct 2022 • Tiehua Zhang, Yuze Liu, Yao Yao, Youhua Xia, Xin Chen, Xiaowei Huang, Jiong Jin

Heterogeneous graph neural network has unleashed great potential on graph representation learning and shown superior performance on downstream tasks such as node classification and clustering.

Graph Learning Graph Representation Learning +2

778

Paper
Code

Critical Regularizations for Neural Surface Reconstruction in the Wild

no code implementations • CVPR 2022 • Jingyang Zhang, Yao Yao, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

The first one is the Hessian regularization that smoothly diffuses the signed distance values to the entire distance field given noisy and incomplete input.

Surface Reconstruction

Paper
Add Code

i-Razor: A Differentiable Neural Input Razor for Feature Selection and Dimension Search in DNN-Based Recommender Systems

1 code implementation • 1 Apr 2022 • Yao Yao, Bin Liu, Haoxun He, Dakui Sheng, Ke Wang, Li Xiao, Huanhuan Cao

Noisy features and inappropriate embedding dimension assignments can deteriorate the performance of recommender systems and introduce unnecessary complexity in model training and online serving.

Click-Through Rate Prediction Feature Engineering +3

Paper
Code

NeILF: Neural Incident Light Field for Physically-based Material Estimation

1 code implementation • 14 Mar 2022 • Yao Yao, Jingyang Zhang, Jingbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a differentiable rendering framework for material and lighting estimation from multi-view images and a reconstructed geometry.

Lighting Estimation

108

Paper
Code

Large-scale Optimization of Partial AUC in a Range of False Positive Rates

no code implementations • 3 Mar 2022 • Yao Yao, Qihang Lin, Tianbao Yang

The partial AUC, as a generalization of the AUC, summarizes only the TPRs over a specific range of the FPRs and is thus a more suitable performance measure in many real-world situations.

Paper
Add Code

CLS: Cross Labeling Supervision for Semi-Supervised Learning

1 code implementation • 17 Feb 2022 • Yao Yao, Junyi Shen, Jin Xu, Bin Zhong, Li Xiao

Based on FixMatch, where a pseudo label is generated from a weakly-augmented sample to teach the prediction on a strong augmentation of the same input sample, CLS allows the creation of both pseudo and complementary labels to support both positive and negative learning.

Pseudo Label

Paper
Code

POI-Transformers: POI Entity Matching through POI Embeddings by Incorporating Semantic and Geographic Information

no code implementations • 29 Sep 2021 • Jinbao Zhang, Changwang Zhang, Xiaojuan Liu, Xia Li, Weilin Liao, Penghua Liu, Yao Yao, Jihong Zhang

A general and robust POI embedding framework, the POI-Transformers, is initially proposed in this study to address these problems of POI entity matching.

Paper
Add Code

Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control

no code implementations • 26 Aug 2021 • Wanpeng Zhang, Xiaoyan Cao, Yao Yao, Zhicheng An, Xi Xiao, Dijun Luo

In this paper, we present a model-based robust RL framework for autonomous greenhouse control to meet the sample efficiency and safety challenges.

Decision Making Model-based Reinforcement Learning +2

Paper
Add Code

Learning Signed Distance Field for Multi-view Surface Reconstruction

1 code implementation • ICCV 2021 • Jingyang Zhang, Yao Yao, Long Quan

In this work, we introduce a novel neural surface reconstruction framework that leverages the knowledge of stereo matching and feature consistency to optimize the implicit surface representation.

Stereo Matching Surface Reconstruction

135

Paper
Code

MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning

no code implementations • 3 Aug 2021 • Wanpeng Zhang, Xi Xiao, Yao Yao, Mingzhe Chen, Dijun Luo

MBDP consists of two kinds of dropout mechanisms, where the rollout-dropout aims to improve the robustness with a small cost of sample efficiency, while the model-dropout is designed to compensate for the lost efficiency at a slight expense of robustness.

Model-based Reinforcement Learning

Paper
Add Code

IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control

1 code implementation • 6 Jul 2021 • Xiaoyan Cao, Yao Yao, Lanqing Li, Wanpeng Zhang, Zhicheng An, Zhong Zhang, Li Xiao, Shihui Guo, Xiaoyu Cao, Meihong Wu, Dijun Luo

However, the optimal control of autonomous greenhouses is challenging, requiring decision-making based on high-dimensional sensory data, and the scaling of production is limited by the scarcity of labor capable of handling this task.

Cloud Computing Decision Making

Paper
Code

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation

1 code implementation • 5 Jul 2021 • Yao Yao, Li Xiao, Zhicheng An, Wanpeng Zhang, Dijun Luo

Model-based deep reinforcement learning has achieved success in various domains that require high sample efficiencies, such as Go and robotics.

Continuous Control reinforcement-learning +1

445

Paper
Code

Excess-noise suppression for a squeezed state propagating through random amplifying media via wave-front shaping

no code implementations • 4 Feb 2021 • Dong Li, Song Sun, Yao Yao

After propagating through a random amplifying medium, a squeezed state commonly shows excess noise above the shot-noise level.

Quantum Physics

Paper
Add Code

Remarks on stationary and uniformly-rotating vortex sheets: Rigidity results

no code implementations • 8 Dec 2020 • Javier Gómez-Serrano, Jaemin Park, Jia Shi, Yao Yao

In this paper, we show that the only solution of the vortex sheet equation, either stationary or uniformly rotating with negative angular velocity $\Omega$, such that it has positive vorticity and is concentrated in a finite disjoint union of smooth curves with finite length is the trivial one: constant vorticity amplitude supported on a union of nested, concentric circles.

Analysis of PDEs

Paper
Add Code

Understanding the drivers of sustainable land expansion using a patch-level simulation model: A case study in Wuhan, China

no code implementations • 22 Oct 2020 • Xun Liang, Qingfeng Guan, Keith C. Clarke, Shishi Liu, Bingyu Wang, Yao Yao

The change complexity lies in the detailed scale of high granularity data, and in the geometric units used to simulate the change.

Computers and Society

Paper
Add Code

Practical Option Valuations of Futures Contracts with Negative Underlying Prices

no code implementations • 25 Sep 2020 • Anatoliy Swishchuk, Ana Roldan-Contreras, Elham Soufiani, Guillermo Martinez, Mohsen Seifi, Nishant Agrawal, Yao Yao

Here we propose two alternatives to Black 76 to value European option future contracts in which the underlying market prices can be negative or mean reverting.

Paper
Add Code

Visibility-aware Multi-view Stereo Network

1 code implementation • 18 Aug 2020 • Jingyang Zhang, Yao Yao, Shiwei Li, Zixin Luo, Tian Fang

As such, the adverse influence of occluded pixels is suppressed in the cost fusion.

Ranked #1 on Point Clouds on DTU

3D Reconstruction Depth Estimation +1

232

Paper
Code

Learning Stereo Matchability in Disparity Regression Networks

1 code implementation • 11 Aug 2020 • Jingyang Zhang, Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

Finally, a matchability-aware disparity refinement is introduced to improve the depth inference in weakly matchable regions.

Ranked #2 on Stereo Disparity Estimation on KITTI 2015

regression Stereo Disparity Estimation +1

Paper
Code

KFNet: Learning Temporal Camera Relocalization using Kalman Filtering

1 code implementation • CVPR 2020 • Lei Zhou, Zixin Luo, Tianwei Shen, Jiahui Zhang, Mingmin Zhen, Yao Yao, Tian Fang, Long Quan

Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image.

Camera Relocalization

211

Paper
Code

ASLFeat: Learning Local Features of Accurate Shape and Localization

4 code implementations • CVPR 2020 • Zixin Luo, Lei Zhou, Xuyang Bai, Hongkai Chen, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

This work focuses on mitigating two limitations in the joint learning of local feature detectors and descriptors.

3D Reconstruction Keypoint detection and image matching

302

Paper
Code

Network Cooperation with Progressive Disambiguation for Partial Label Learning

no code implementations • 22 Feb 2020 • Yao Yao, Chen Gong, Jiehui Deng, Jian Yang

Partial Label Learning (PLL) aims to train a classifier when each training instance is associated with a set of candidate labels, among which only one is correct but is not accessible during the training phase.

Partial Label Learning

Paper
Add Code

BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks

2 code implementations • CVPR 2020 • Yao Yao, Zixin Luo, Shiwei Li, Jingyang Zhang, Yufan Ren, Lei Zhou, Tian Fang, Long Quan

Compared with other computer vision tasks, it is rather difficult to collect a large-scale MVS dataset as it requires expensive active scanners and labor-intensive process to obtain ground truth 3D structures.

3D Reconstruction

515

Paper
Code

Self-Supervised Learning of Depth and Motion Under Photometric Inconsistency

1 code implementation • 19 Sep 2019 • Tianwei Shen, Lei Zhou, Zixin Luo, Yao Yao, Shiwei Li, Jiahui Zhang, Tian Fang, Long Quan

The self-supervised learning of depth and pose from monocular sequences provides an attractive solution by using the photometric consistency of nearby frames as it depends much less on the ground-truth data.

Pose Estimation Self-Supervised Learning

197

Paper
Code

ContextDesc: Local Descriptor Augmentation with Cross-Modality Context

1 code implementation • CVPR 2019 • Zixin Luo, Tianwei Shen, Lei Zhou, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Most existing studies on learning local features focus on the patch-based descriptions of individual keypoints, whereas neglecting the spatial relations established from their keypoint locations.

Geometric Matching

226

Paper
Code

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

1 code implementation • CVPR 2019 • Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

However, one major limitation of current learned MVS approaches is the scalability: the memory-consuming cost volume regularization makes the learned MVS hard to be applied to high-resolution scenes.

Vocal Bursts Intensity Prediction

1,333

Paper
Code

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

1 code implementation • ECCV 2018 • Zixin Luo, Tianwei Shen, Lei Zhou, Siyu Zhu, Runze Zhang, Yao Yao, Tian Fang, Long Quan

Learned local descriptors based on Convolutional Neural Networks (CNNs) have achieved significant improvements on patch-based benchmarks, whereas not having demonstrated strong generalization ability on recent benchmarks of image-based 3D reconstruction.

3D Reconstruction

190

Paper
Code

Reconstructing Thin Structures of Manifold Surfaces by Integrating Spatial Curves

no code implementations • CVPR 2018 • Shiwei Li, Yao Yao, Tian Fang, Long Quan

We present a novel surface reconstruction method using both curves and point clouds.

Surface Reconstruction

Paper
Add Code

MVSNet: Depth Inference for Unstructured Multi-view Stereo

4 code implementations • ECCV 2018 • Yao Yao, Zixin Luo, Shiwei Li, Tian Fang, Long Quan

We present an end-to-end deep learning architecture for depth map inference from multi-view images.

Ranked #19 on Point Clouds on Tanks and Temples (Mean F1 (Intermediate) metric)

3D Reconstruction Point Clouds

1,333

Paper
Code

Pulsar Candidate Identification with Artificial Intelligence Techniques

no code implementations • 27 Nov 2017 • Ping Guo, Fuqing Duan, Pei Wang, Yao Yao, Qian Yin, Xin Xin

To address these problems, we proposed a framework which combines deep convolution generative adversarial network (DCGAN) with support vector machine (SVM) to deal with imbalance class problem and to improve pulsar identification accuracy.

Astronomy Generative Adversarial Network

Paper
Add Code

Multi-dimensional Meanings of Subjective Adverbs - Case Study of Mandarin Chinese Adverb Pianpian

no code implementations • PACLIC 2017 • Mi Zhou, Yao Yao, Chu-Ren Huang

Paper
Add Code

Optimal Resource Allocation in Distributed Broadband Wireless Communication Systems

no code implementations • 24 Oct 2017 • Yao Yao, Mustafa Mehmet Ali, Shahin Vakilinia

In this paper, we consider a combined optimization of BWC systems.

Paper
Add Code

Sensing Urban Land-Use Patterns By Integrating Google Tensorflow And Scene-Classification Models

no code implementations • 4 Aug 2017 • Yao Yao, Haolin Liang, Xia Li, Jinbao Zhang, Jialv He

To take advantage of the deep-learning method in detecting urban land-use patterns, we applied a transfer-learning-based remote-sensing image approach to extract and classify features.

General Classification Scene Classification +1

Paper
Add Code

Extracting urban impervious surface from GF-1 imagery using one-class classifiers

no code implementations • 13 May 2017 • Yao Yao, Jialv He, Jinbao Zhang, Yatao Zhang

In this study, we investigate several one-class classifiers, such as Presence and Background Learning (PBL), Positive Unlabeled Learning (PUL), OCSVM, BSVM and MAXENT, to extract urban impervious surface area using high spatial resolution imagery of GF-1, China's new generation of high spatial remote sensing satellite, and evaluate the classification accuracy based on artificial interpretation results.

General Classification Management