Search Results for author: Sen Zhang

Found 27 papers, 11 papers with code

Fine-grained Factual Consistency Assessment for Abstractive Summarization Models

no code implementations EMNLP 2021 Sen Zhang, Jianwei Niu, Chuyuan Wei

Fact consistency assessment requires the reasoning capability to find subtle clues to identify whether a model-generated summary is consistent with the original document.

Abstractive Text Summarization Sentence

Towards Theoretical Understandings of Self-Consuming Generative Models

no code implementations19 Feb 2024 Shi Fu, Sen Zhang, Yingjie Wang, Xinmei Tian, DaCheng Tao

This paper tackles the emerging challenge of training generative models within a self-consuming loop, wherein successive generations of models are recursively trained on mixtures of real and synthetic data from previous generations.

Mitigating Reward Hacking via Information-Theoretic Reward Modeling

no code implementations14 Feb 2024 Yuchun Miao, Sen Zhang, Liang Ding, Rong Bao, Lefei Zhang, DaCheng Tao

Inspired by this finding, we propose the Integrated Cluster Deviation Score (ICDS), which quantifies deviations in the latent space, as an indicator of reward overoptimization to facilitate the development of online mitigation strategies.

Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

no code implementations13 Feb 2024 Ziyi Zhang, Sen Zhang, Yibing Zhan, Yong Luo, Yonggang Wen, DaCheng Tao

Then, we surprisingly discover that dormant neurons in our critic model act as a regularization against overoptimization, while active neurons reflect primacy bias in this setting.

Denoising Inductive Bias

FreDF: Learning to Forecast in Frequency Domain

no code implementations4 Feb 2024 Hao Wang, Licheng Pan, Zhichao Chen, Degui Yang, Sen Zhang, Yifei Yang, Xinggao Liu, Haoxuan Li, DaCheng Tao

Time series modeling is uniquely challenged by the presence of autocorrelation in both historical and label sequences.

Time Series

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

1 code implementation11 Oct 2023 Guozheng Ma, Lu Li, Sen Zhang, Zixuan Liu, Zhen Wang, Yixin Chen, Li Shen, Xueqian Wang, DaCheng Tao

Plasticity, the ability of a neural network to evolve with new data, is crucial for high-performance and sample-efficient visual reinforcement learning (VRL).

Data Augmentation reinforcement-learning

The Importance of Multimodal Emotion Conditioning and Affect Consistency for Embodied Conversational Agents

no code implementations26 Sep 2023 Che-Jui Chang, Samuel S. Sohn, Sen Zhang, Rajath Jayashankar, Muhammad Usman, Mubbasir Kapadia

We have conducted a user study with 199 participants to assess how the average person judges the affects perceived from multimodal behaviors that are consistent and inconsistent with respect to a driving affect.

Instruction Tuning for Large Language Models: A Survey

1 code implementation21 Aug 2023 Shengyu Zhang, Linfeng Dong, Xiaoya Li, Sen Zhang, Xiaofei Sun, Shuhe Wang, Jiwei Li, Runyi Hu, Tianwei Zhang, Fei Wu, Guoyin Wang

This paper surveys research works in the quickly advancing field of instruction tuning (IT), a crucial technique to enhance the capabilities and controllability of large language models (LLMs).

Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms

1 code implementation18 Aug 2023 Penghui Wen, Kun Hu, Wenxi Yue, Sen Zhang, Wanlei Zhou, Zhiyong Wang

Robust audio anti-spoofing has been increasingly challenging due to the recent advancements on deepfake techniques.

Face Swapping

Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone

no code implementations10 Aug 2023 Guozhang Liu, Baochai Peng, Ting Liu, Pan Zhang, Mengke Yuan, Chaoran Lu, Ningning Cao, Sen Zhang, Simin Huang, Tao Wang

The diversity of building architecture styles of global cities situated on various landforms, the degraded optical imagery affected by clouds and shadows, and the significant inter-class imbalance of roof types pose challenges for designing a robust and accurate building roof instance segmentor.

Data Augmentation Instance Segmentation +1

HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation

no code implementations10 Aug 2023 Chaoran Lu, Ningning Cao, Pan Zhang, Ting Liu, Baochai Peng, Guozhang Liu, Mengke Yuan, Sen Zhang, Simin Huang, Tao Wang

Unifying the correlative single-view satellite image building extraction and height estimation tasks indicates a promising way to share representations and acquire generalist model for large-scale urban 3D reconstruction.

3D Reconstruction

Image Captions are Natural Prompts for Text-to-Image Models

1 code implementation17 Jul 2023 Shiye Lei, Hao Chen, Sen Zhang, Bo Zhao, DaCheng Tao

With the rapid development of Artificial Intelligence Generated Content (AIGC), it has become common practice in many learning tasks to train or fine-tune large models on synthetic data due to the data-scarcity and privacy leakage problems.

Image Captioning Image Generation

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation

no code implementations8 May 2023 Yupei Lin, Sen Zhang, Xiaojun Yang, Xiao Wang, Yukai Shi

To ensure consistent preservation of the shape during image editing, we propose cross-attention guidance based on regeneration learning.

Event-based Simultaneous Localization and Mapping: A Comprehensive Survey

1 code implementation19 Apr 2023 Kunping Huang, Sen Zhang, Jing Zhang, DaCheng Tao

This paper presents a timely and comprehensive review of event-based vSLAM algorithms that exploit the benefits of asynchronous and irregular event streams for localization and mapping tasks.

Motion Compensation Simultaneous Localization and Mapping

Multi-task Adversarial Learning for Semi-supervised Trajectory-User Linking

1 code implementation ECML-PKDD 2023 Sen Zhang, Senzhang Wang, Xiang Wang, Shigeng Zhang, Hao Miao & Junxing Zhu

We first project users and trajectories into the common latent feature space through learning a projection function (generator) to minimize the distance between the user distribution and the trajectory distribution.

Multi-Task Learning

Criteria Comparative Learning for Real-scene Image Super-Resolution

2 code implementations26 Jul 2022 Yukai Shi, Hao Li, Sen Zhang, Zhijing Yang, Xiao Wang

Inspired by the observation that the contrastive relationship could also exist between the criteria, in this work, we propose a novel training paradigm for RealSR, named Criteria Comparative Learning (Cria-CL), by developing contrastive losses defined on criteria instead of image patches.

Contrastive Learning Image Super-Resolution +1

JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes

1 code implementation16 Jul 2022 Haimei Zhao, Jing Zhang, Sen Zhang, DaCheng Tao

A naive way is to accomplish them independently in a sequential or parallel manner, but there are many drawbacks, i. e., 1) the depth and VO results suffer from the inherent scale ambiguity issue; 2) the BEV layout is directly predicted from the front-view image without using any depth-related information, although the depth map contains useful geometry clues for inferring scene layouts.

Autonomous Driving Depth Estimation +3

Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation

no code implementations23 Apr 2022 Yupei Lin, Sen Zhang, Tianshui Chen, Yongyi Lu, Guangping Li, Yukai Shi

Recently, contrastive learning (CL) has been used to further investigate the image correspondence in unpaired image translation by using patch-based positive/negative learning.

Contrastive Learning Image-to-Image Translation +1

Information-Theoretic Odometry Learning

no code implementations11 Mar 2022 Sen Zhang, Jing Zhang, DaCheng Tao

In this paper, we propose a unified information theoretic framework for learning-motivated methods aimed at odometry estimation, a crucial component of many robotics and vision tasks such as navigation and virtual reality where relative camera poses are required in real time.

Towards Scale Consistent Monocular Visual Odometry by Learning from the Virtual World

no code implementations11 Mar 2022 Sen Zhang, Jing Zhang, DaCheng Tao

In this work, we propose VRVO, a novel framework for retrieving the absolute scale from virtual data that can be easily obtained from modern simulation environments, whereas in the real domain no stereo or ground-truth data are required in either the training or inference phases.

Monocular Visual Odometry

Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment Analysis

no code implementations10 Jan 2022 Jiahao Zheng, Sen Zhang, XiaoPing Wang, Zhigang Zeng

Multimodal sentiment analysis (MSA) is a fundamental complex research problem due to the heterogeneity gap between different modalities and the ambiguity of human emotional expression.

Multimodal Sentiment Analysis

Continuation Newton methods with deflation techniques for global optimization problems

1 code implementation29 Jul 2021 Xin-long Luo, Hang Xiao, Sen Zhang

The global minimum point of an optimization problem is of interest in engineering fields and it is difficult to be found, especially for a nonconvex large-scale optimization problem.

Evolutionary Algorithms

Community Preserved Social Graph Publishing with Node Differential Privacy

no code implementations5 Jan 2021 Sen Zhang, Weiwei Ni, Nan Fu

Community structure, which is an important global pattern of nodes, is a crucial data utility as it serves as fundamental operations for many graph analysis tasks.

Cryptography and Security

Cannot find the paper you are looking for? You can Submit a new open access paper.