Search Results for author: Sen Zhang

Found 27 papers, 11 papers with code

Fine-grained Factual Consistency Assessment for Abstractive Summarization Models

no code implementations • EMNLP 2021 • Sen Zhang, Jianwei Niu, Chuyuan Wei

Fact consistency assessment requires the reasoning capability to find subtle clues to identify whether a model-generated summary is consistent with the original document.

Abstractive Text Summarization Sentence

Paper
Add Code

Towards Theoretical Understandings of Self-Consuming Generative Models

no code implementations • 19 Feb 2024 • Shi Fu, Sen Zhang, Yingjie Wang, Xinmei Tian, DaCheng Tao

This paper tackles the emerging challenge of training generative models within a self-consuming loop, wherein successive generations of models are recursively trained on mixtures of real and synthetic data from previous generations.

Paper
Add Code

Mitigating Reward Hacking via Information-Theoretic Reward Modeling

no code implementations • 14 Feb 2024 • Yuchun Miao, Sen Zhang, Liang Ding, Rong Bao, Lefei Zhang, DaCheng Tao

Inspired by this finding, we propose the Integrated Cluster Deviation Score (ICDS), which quantifies deviations in the latent space, as an indicator of reward overoptimization to facilitate the development of online mitigation strategies.

Paper
Add Code

Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

no code implementations • 13 Feb 2024 • Ziyi Zhang, Sen Zhang, Yibing Zhan, Yong Luo, Yonggang Wen, DaCheng Tao

Then, we surprisingly discover that dormant neurons in our critic model act as a regularization against overoptimization, while active neurons reflect primacy bias in this setting.

Denoising Inductive Bias

Paper
Add Code

FreDF: Learning to Forecast in Frequency Domain

no code implementations • 4 Feb 2024 • Hao Wang, Licheng Pan, Zhichao Chen, Degui Yang, Sen Zhang, Yifei Yang, Xinggao Liu, Haoxuan Li, DaCheng Tao

Time series modeling is uniquely challenged by the presence of autocorrelation in both historical and label sequences.

Time Series

Paper
Add Code

S2M: Converting Single-Turn to Multi-Turn Datasets for Conversational Question Answering

no code implementations • 27 Dec 2023 • Baokui Li, Sen Zhang, Wangshu Zhang, Yicheng Chen, Changlin Yang, Sen Hu, Teng Xu, Siye liu, Jiwei Li

To solve this problem, we propose a novel method to convert single-turn datasets to multi-turn datasets.

Conversational Question Answering Data Augmentation

Paper
Add Code

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

1 code implementation • 11 Oct 2023 • Guozheng Ma, Lu Li, Sen Zhang, Zixuan Liu, Zhen Wang, Yixin Chen, Li Shen, Xueqian Wang, DaCheng Tao

Plasticity, the ability of a neural network to evolve with new data, is crucial for high-performance and sample-efficient visual reinforcement learning (VRL).

Data Augmentation reinforcement-learning

Paper
Code

The Importance of Multimodal Emotion Conditioning and Affect Consistency for Embodied Conversational Agents

no code implementations • 26 Sep 2023 • Che-Jui Chang, Samuel S. Sohn, Sen Zhang, Rajath Jayashankar, Muhammad Usman, Mubbasir Kapadia

We have conducted a user study with 199 participants to assess how the average person judges the affects perceived from multimodal behaviors that are consistent and inconsistent with respect to a driving affect.

Paper
Add Code

Are Large Language Models Really Robust to Word-Level Perturbations?

1 code implementation • 20 Sep 2023 • Haoyu Wang, Guozheng Ma, Cong Yu, Ning Gui, Linrui Zhang, Zhiqi Huang, Suwei Ma, Yongzhe Chang, Sen Zhang, Li Shen, Xueqian Wang, Peilin Zhao, DaCheng Tao

Notably, we are surprised to discover that robustness tends to decrease as fine-tuning (SFT and RLHF) is conducted.

Question Answering

Paper
Code

Instruction Tuning for Large Language Models: A Survey

1 code implementation • 21 Aug 2023 • Shengyu Zhang, Linfeng Dong, Xiaoya Li, Sen Zhang, Xiaofei Sun, Shuhe Wang, Jiwei Li, Runyi Hu, Tianwei Zhang, Fei Wu, Guoyin Wang

This paper surveys research works in the quickly advancing field of instruction tuning (IT), a crucial technique to enhance the capabilities and controllability of large language models (LLMs).

Paper
Code

Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms

1 code implementation • 18 Aug 2023 • Penghui Wen, Kun Hu, Wenxi Yue, Sen Zhang, Wanlei Zhou, Zhiyong Wang

Robust audio anti-spoofing has been increasingly challenging due to the recent advancements on deepfake techniques.

Face Swapping

Paper
Code

Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone

no code implementations • 10 Aug 2023 • Guozhang Liu, Baochai Peng, Ting Liu, Pan Zhang, Mengke Yuan, Chaoran Lu, Ningning Cao, Sen Zhang, Simin Huang, Tao Wang

The diversity of building architecture styles of global cities situated on various landforms, the degraded optical imagery affected by clouds and shadows, and the significant inter-class imbalance of roof types pose challenges for designing a robust and accurate building roof instance segmentor.

Data Augmentation Instance Segmentation +1

Paper
Add Code

HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation

no code implementations • 10 Aug 2023 • Chaoran Lu, Ningning Cao, Pan Zhang, Ting Liu, Baochai Peng, Guozhang Liu, Mengke Yuan, Sen Zhang, Simin Huang, Tao Wang

Unifying the correlative single-view satellite image building extraction and height estimation tasks indicates a promising way to share representations and acquire generalist model for large-scale urban 3D reconstruction.

3D Reconstruction

Paper
Add Code

Image Captions are Natural Prompts for Text-to-Image Models

1 code implementation • 17 Jul 2023 • Shiye Lei, Hao Chen, Sen Zhang, Bo Zhao, DaCheng Tao

With the rapid development of Artificial Intelligence Generated Content (AIGC), it has become common practice in many learning tasks to train or fine-tune large models on synthetic data due to the data-scarcity and privacy leakage problems.

Image Captioning Image Generation

Paper
Code

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation

no code implementations • 8 May 2023 • Yupei Lin, Sen Zhang, Xiaojun Yang, Xiao Wang, Yukai Shi

To ensure consistent preservation of the shape during image editing, we propose cross-attention guidance based on regeneration learning.

Paper
Add Code

Event-based Simultaneous Localization and Mapping: A Comprehensive Survey

1 code implementation • 19 Apr 2023 • Kunping Huang, Sen Zhang, Jing Zhang, DaCheng Tao

This paper presents a timely and comprehensive review of event-based vSLAM algorithms that exploit the benefits of asynchronous and irregular event streams for localization and mapping tasks.

Motion Compensation Simultaneous Localization and Mapping

Paper
Code

Multi-task Adversarial Learning for Semi-supervised Trajectory-User Linking

1 code implementation • ECML-PKDD 2023 • Sen Zhang, Senzhang Wang, Xiang Wang, Shigeng Zhang, Hao Miao & Junxing Zhu

We first project users and trajectories into the common latent feature space through learning a projection function (generator) to minimize the distance between the user distribution and the trajectory distribution.

Multi-Task Learning

Paper
Code

Criteria Comparative Learning for Real-scene Image Super-Resolution

2 code implementations • 26 Jul 2022 • Yukai Shi, Hao Li, Sen Zhang, Zhijing Yang, Xiao Wang

Inspired by the observation that the contrastive relationship could also exist between the criteria, in this work, we propose a novel training paradigm for RealSR, named Criteria Comparative Learning (Cria-CL), by developing contrastive losses defined on criteria instead of image patches.

Contrastive Learning Image Super-Resolution +1

Paper
Code

JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes

1 code implementation • 16 Jul 2022 • Haimei Zhao, Jing Zhang, Sen Zhang, DaCheng Tao

A naive way is to accomplish them independently in a sequential or parallel manner, but there are many drawbacks, i. e., 1) the depth and VO results suffer from the inherent scale ambiguity issue; 2) the BEV layout is directly predicted from the front-view image without using any depth-related information, although the depth map contains useful geometry clues for inferring scene layouts.

Autonomous Driving Depth Estimation +3

Paper
Code

Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics

1 code implementation • 11 Jul 2022 • Sen Zhang, Jing Zhang, DaCheng Tao

Unsupervised monocular depth and ego-motion estimation has drawn extensive research attention in recent years.

Monocular Depth Estimation Motion Estimation +1

191

Paper
Code

Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation

no code implementations • 23 Apr 2022 • Yupei Lin, Sen Zhang, Tianshui Chen, Yongyi Lu, Guangping Li, Yukai Shi

Recently, contrastive learning (CL) has been used to further investigate the image correspondence in unpaired image translation by using patch-based positive/negative learning.

Contrastive Learning Image-to-Image Translation +1

Paper
Add Code

Information-Theoretic Odometry Learning

no code implementations • 11 Mar 2022 • Sen Zhang, Jing Zhang, DaCheng Tao

In this paper, we propose a unified information theoretic framework for learning-motivated methods aimed at odometry estimation, a crucial component of many robotics and vision tasks such as navigation and virtual reality where relative camera poses are required in real time.

Paper
Add Code

Towards Scale Consistent Monocular Visual Odometry by Learning from the Virtual World

no code implementations • 11 Mar 2022 • Sen Zhang, Jing Zhang, DaCheng Tao

In this work, we propose VRVO, a novel framework for retrieving the absolute scale from virtual data that can be easily obtained from modern simulation environments, whereas in the real domain no stereo or ground-truth data are required in either the training or inference phases.

Monocular Visual Odometry

Paper
Add Code

Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment Analysis

no code implementations • 10 Jan 2022 • Jiahao Zheng, Sen Zhang, XiaoPing Wang, Zhigang Zeng

Multimodal sentiment analysis (MSA) is a fundamental complex research problem due to the heterogeneity gap between different modalities and the ambiguity of human emotional expression.

Multimodal Sentiment Analysis

Paper
Add Code

Continuation Newton methods with deflation techniques for global optimization problems

1 code implementation • 29 Jul 2021 • Xin-long Luo, Hang Xiao, Sen Zhang

The global minimum point of an optimization problem is of interest in engineering fields and it is difficult to be found, especially for a nonconvex large-scale optimization problem.

Evolutionary Algorithms

Paper
Code

GTAE: Graph-Transformer based Auto-Encoders for Linguistic-Constrained Text Style Transfer

no code implementations • 1 Feb 2021 • Yukai Shi, Sen Zhang, Chenxing Zhou, Xiaodan Liang, Xiaojun Yang, Liang Lin

Non-parallel text style transfer has attracted increasing research interests in recent years.

Sentence Style Transfer +1

Paper
Add Code

Community Preserved Social Graph Publishing with Node Differential Privacy

no code implementations • 5 Jan 2021 • Sen Zhang, Weiwei Ni, Nan Fu

Community structure, which is an important global pattern of nodes, is a crucial data utility as it serves as fundamental operations for many graph analysis tasks.

Cryptography and Security

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.