Search Results for author: Peng Sun

Found 71 papers, 24 papers with code

EmojiCloud: a Tool for Emoji Cloud Visualization

no code implementations NAACL (Emoji) 2022 Yunhe Feng, Cheng Guo, Bingbing Wen, Peng Sun, Yufei Yue, Dingwen Tao

This paper proposes EmojiCloud, an open-source Python-based emoji cloud visualization tool, to generate a quick and straightforward understanding of emojis from the perspective of frequency and importance.

LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism

no code implementations15 Apr 2024 Bingyang Wu, Shengyu Liu, Yinmin Zhong, Peng Sun, Xuanzhe Liu, Xin Jin

The context window of large language models (LLMs) is rapidly increasing, leading to a huge variance in resource usage between different requests as well as between different phases of the same request.

SoK: Gradient Leakage in Federated Learning

no code implementations8 Apr 2024 Jiacheng Du, Jiahui Hu, Zhibo Wang, Peng Sun, Neil Zhenqiang Gong, Kui Ren

While GIAs have demonstrated effectiveness under \emph{ideal settings and auxiliary assumptions}, their actual efficacy against \emph{practical FL systems} remains under-explored.

Federated Learning Misconceptions

InternLM2 Technical Report

1 code implementation26 Mar 2024 Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, FuKai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, JIA YU, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI).

4k Long-Context Understanding

Large Language Models as Agents in Two-Player Games

no code implementations12 Feb 2024 Yang Liu, Peng Sun, Hang Li

By formally defining the training processes of large language models (LLMs), which usually encompasses pre-training, supervised fine-tuning, and reinforcement learning with human feedback, within a single and unified machine learning paradigm, we can glean pivotal insights for advancing LLM technologies.

Position reinforcement-learning

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

1 code implementation8 Feb 2024 Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, wei he, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, we propose R$^3$: Learning Reasoning through Reverse Curriculum Reinforcement Learning (RL), a novel method that employs only outcome supervision to achieve the benefits of process supervision for large language models.

GSM8K reinforcement-learning +1

ReFT: Reasoning with Reinforced Fine-Tuning

1 code implementation17 Jan 2024 Trung Quoc Luong, Xinbo Zhang, Zhanming Jie, Peng Sun, Xiaoran Jin, Hang Li

ReFT first warmups the model with SFT, and then employs on-line reinforcement learning, specifically the PPO algorithm in this paper, to further fine-tune the model, where an abundance of reasoning paths are automatically sampled given the question and the rewards are naturally derived from the ground-truth answers.

GSM8K Math +1

On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm

2 code implementations6 Dec 2023 Peng Sun, Bei Shi, Daiwei Yu, Tao Lin

Contemporary machine learning requires training large neural networks on massive datasets and thus faces the challenges of high computational demands.

BiSinger: Bilingual Singing Voice Synthesis

1 code implementation25 Sep 2023 Huali Zhou, Yueqian Lin, Yao Shi, Peng Sun, Ming Li

We fuse monolingual singing datasets with open-source singing voice conversion techniques to generate bilingual singing voices while also exploring the potential use of bilingual speech data.

Singing Voice Synthesis Voice Conversion

SST: A Simplified Swin Transformer-based Model for Taxi Destination Prediction based on Existing Trajectory

no code implementations15 Aug 2023 Zepu Wang, Yifei Sun, Zhiyu Lei, Xincheng Zhu, Peng Sun

Accurately predicting the destination of taxi trajectories can have various benefits for intelligent location-based services.

ST-MLP: A Cascaded Spatio-Temporal Linear Framework with Channel-Independence Strategy for Traffic Forecasting

no code implementations14 Aug 2023 Zepu Wang, Yuqi Nie, Peng Sun, Nam H. Nguyen, John Mulvey, H. Vincent Poor

The criticality of prompt and precise traffic forecasting in optimizing traffic flow management in Intelligent Transportation Systems (ITS) has drawn substantial scholarly focus.

Computational Efficiency Management +2

Spatio-Temporal Domain Awareness for Multi-Agent Collaborative Perception

no code implementations ICCV 2023 Kun Yang, Dingkang Yang, Jingyu Zhang, Mingcheng Li, Yang Liu, Jing Liu, Hanqi Wang, Peng Sun, Liang Song

In this paper, we propose SCOPE, a novel collaborative perception framework that aggregates the spatio-temporal awareness characteristics across on-road agents in an end-to-end manner.

3D Object Detection Autonomous Vehicles +1

A Dual Stealthy Backdoor: From Both Spatial and Frequency Perspectives

no code implementations3 Jul 2023 Yudong Gao, Honglong Chen, Peng Sun, Junjian Li, Anqing Zhang, Zhibo Wang

Then, to attain strong stealthiness, we incorporate Fourier Transform and Discrete Cosine Transform to mix the poisoned image and clean image in the frequency domain.

Backdoor Attack

Metric-aligned Sample Selection and Critical Feature Sampling for Oriented Object Detection

no code implementations29 Jun 2023 Peng Sun, Yongbin Zheng, Wenqi Wu, Wanying Xu, Shengjian Bai

First, to align the metric inconsistency between sample selection and regression loss calculation caused by fixed IoU strategy, we introduce affine transformation to evaluate the quality of samples and propose a distance-based label assignment strategy.

object-detection Object Detection +2

On the Confidence Intervals in Bioequivalence Studies

no code implementations11 Jun 2023 Kexuan Li, Susie Sinks, Peng Sun, Lingli Yang

A bioequivalence study is a type of clinical trial designed to compare the biological equivalence of two different formulations of a drug.

Privacy-preserving Adversarial Facial Features

no code implementations CVPR 2023 Zhibo Wang, He Wang, Shuaifan Jin, Wenwen Zhang, Jiahui Hu, Yan Wang, Peng Sun, Wei Yuan, Kaixin Liu, Kui Ren

In this paper, we propose an adversarial features-based face privacy protection (AdvFace) approach to generate privacy-preserving adversarial features, which can disrupt the mapping from adversarial features to facial images to defend against reconstruction attacks.

Face Recognition Privacy Preserving

Optimization Design for Federated Learning in Heterogeneous 6G Networks

no code implementations15 Mar 2023 Bing Luo, Xiaomin Ouyang, Peng Sun, Pengchao Han, Ningning Ding, Jianwei Huang

With the rapid advancement of 5G networks, billions of smart Internet of Things (IoT) devices along with an enormous amount of data are generated at the network edge.

Federated Learning Management +2

Mastering Strategy Card Game (Hearthstone) with Improved Techniques

no code implementations9 Mar 2023 Changnan Xiao, Yongxin Zhang, Xuefeng Huang, Qinhan Huang, Jie Chen, Peng Sun

Strategy card game is a well-known genre that is demanding on the intelligent game-play and can be an ideal test-bench for AI.

Decision Making

Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play

no code implementations7 Mar 2023 Wei Xi, Yongxin Zhang, Changnan Xiao, Xuefeng Huang, Shihong Deng, Haowei Liang, Jie Chen, Peng Sun

Deep Reinforcement Learning combined with Fictitious Play shows impressive results on many benchmark games, most of which are, however, single-stage.

Decision Making

Boosting Distributed Full-graph GNN Training with Asynchronous One-bit Communication

no code implementations2 Mar 2023 Meng Zhang, Qinghao Hu, Peng Sun, Yonggang Wen, Tianwei Zhang

Training Graph Neural Networks (GNNs) on large graphs is challenging due to the conflict between the high memory demand and limited GPU memory.

Quantization

A novel efficient Multi-view traffic-related object detection framework

no code implementations23 Feb 2023 Kun Yang, Jing Liu, Dingkang Yang, Hanqi Wang, Peng Sun, Yanni Zhang, Yan Liu, Liang Song

With the rapid development of intelligent transportation system applications, a tremendous amount of multi-view video data has emerged to enhance vehicle perception.

Model Selection object-detection +1

Local-to-Global Information Communication for Real-Time Semantic Segmentation Network Search

no code implementations16 Feb 2023 Guangliang Cheng, Peng Sun, Ting-Bing Xu, Shuchang Lyu, Peiwen Lin

For local information exchange, a graph convolutional network (GCN) guided module is seamlessly integrated as a communication deliver between cells.

Neural Architecture Search Real-Time Semantic Segmentation

Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models

1 code implementation10 Feb 2023 Yang Liu, Dingkang Yang, Yan Wang, Jing Liu, Jun Liu, Azzedine Boukerche, Peng Sun, Liang Song

Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance systems, enabling the temporal or spatial identification of anomalous events within videos.

Anomaly Detection Event Detection +1

Towards Transferable Targeted Adversarial Examples

1 code implementation CVPR 2023 Zhibo Wang, Hongshan Yang, Yunhe Feng, Peng Sun, Hengchang Guo, Zhifei Zhang, Kui Ren

In this paper, we propose the Transferable Targeted Adversarial Attack (TTAA), which can capture the distribution information of the target class from both label-wise and feature-wise perspectives, to generate highly transferable targeted adversarial examples.

Adversarial Attack

CSI-PPPNet: A One-Sided One-for-All Deep Learning Framework for Massive MIMO CSI Feedback

no code implementations29 Nov 2022 Wei Chen, Weixiao Wan, Shiyue Wang, Peng Sun, Geoffrey Ye Li, Bo Ai

The CSI is compressed via linear projections at the UE, and is recovered at the BS using deep learning (DL) with plug-and-play priors (PPP).

Denoising

Controllability of a Class of Swarm Signaling Networks

no code implementations26 Sep 2022 Peng Sun, Robert E. Kooij, Roland Bouffanais

In this paper, we propose closed-form analytical expressions to determine the minimum number of driver nodes that is needed to control a specific class of networks.

Multimodal Graph Learning for Deepfake Detection

no code implementations12 Sep 2022 Zhiyuan Yan, Peng Sun, Yubo Lang, Shuo Du, Shanzhuo Zhang, Wei Wang, Lei Liu

We evaluate the effectiveness of our method through extensive experiments on widely-used benchmarks and demonstrate that our method outperforms the state-of-the-art detectors in terms of generalization ability and robustness against unknown disturbances.

DeepFake Detection Face Swapping +2

Generative Adversarial Exploration for Reinforcement Learning

no code implementations27 Jan 2022 Weijun Hong, Menghui Zhu, Minghuan Liu, Weinan Zhang, Ming Zhou, Yong Yu, Peng Sun

Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel.

Generative Adversarial Network Montezuma's Revenge +2

A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs

no code implementations10 Jan 2022 Ruofan Liang, Bingsheng He, Shengen Yan, Peng Sun

Multi-tenant machine learning services have become emerging data-intensive workloads in data centers with heavy usage of GPU resources.

BIG-bench Machine Learning Scheduling

Event-Based Fusion for Motion Deblurring with Cross-modal Attention

1 code implementation30 Nov 2021 Lei Sun, Christos Sakaridis, Jingyun Liang, Qi Jiang, Kailun Yang, Peng Sun, Yaozu Ye, Kaiwei Wang, Luc van Gool

Traditional frame-based cameras inevitably suffer from motion blur due to long exposure times.

Ranked #3 on Deblurring on GoPro (using extra training data)

Deblurring Image Deblurring +1

Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning

no code implementations29 Sep 2021 Shuxing Li, Jiawei Xu, Chun Yuan, Peng Sun, Zhuobin Zheng, Zhengyou Zhang, Lei Han

We provide comprehensive analysis and experiments to elaborate the effect of each component in affecting the agent performance, and demonstrate that the proposed and adopted techniques are important to achieve superior performance in general end-to-end FPS games.

FPS Games General Reinforcement Learning +2

Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters

1 code implementation3 Sep 2021 Qinghao Hu, Peng Sun, Shengen Yan, Yonggang Wen, Tianwei Zhang

Modern GPU datacenters are critical for delivering Deep Learning (DL) models and services in both the research community and industry.

Management Scheduling

Towards Distraction-Robust Active Visual Tracking

no code implementations18 Jun 2021 Fangwei Zhong, Peng Sun, Wenhan Luo, Tingyun Yan, Yizhou Wang

In active visual tracking, it is notoriously difficult when distracting objects appear, as distractors often mislead the tracker by occluding the target or bringing a confusing appearance.

Visual Tracking

FedCom: A Byzantine-Robust Local Model Aggregation Rule Using Data Commitment for Federated Learning

no code implementations16 Apr 2021 Bo Zhao, Peng Sun, Liming Fang, Tao Wang, Keyu Jiang

The results demonstrate its effectiveness and superior performance compared to the state-of-the-art Byzantine-robust schemes in defending against typical data poisoning and model poisoning attacks under practical Non-IID data distributions.

Data Poisoning Federated Learning +2

Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion

1 code implementation CVPR 2021 Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li

In principle, the feature modeling scheme is carried out in a depth-sensitive attention module, which leads to the RGB feature enhancement as well as the background distraction reduction by capturing the depth geometry prior.

object-detection RGB-D Salient Object Detection +2

Self-Renormalization of Quasi-Light-Front Correlators on the Lattice

no code implementations4 Mar 2021 Yi-Kai Huo, Yushan Su, Long-Cheng Gui, Xiangdong Ji, Yuan-Yuan Li, Yizhuang Liu, Andreas Schäfer, Maximilian Schlemmer, Peng Sun, Wei Wang, Yi-Bo Yang, Jian-Hui Zhang, Kuan Zhang

In applying large-momentum effective theory, renormalization of the Euclidean correlators in lattice regularization is a challenge due to linear divergences in the self-energy of Wilson lines.

High Energy Physics - Lattice High Energy Physics - Phenomenology

TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game

1 code implementation27 Nov 2020 Lei Han, Jiechao Xiong, Peng Sun, Xinghai Sun, Meng Fang, Qingwei Guo, Qiaobo Chen, Tengfei Shi, Hongsheng Yu, Xipeng Wu, Zhengyou Zhang

We show that with orders of less computation scale, a faithful reimplementation of AlphaStar's methods can not succeed and the proposed techniques are necessary to ensure TStarBot-X's competitive performance.

Imitation Learning Starcraft +1

TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning

1 code implementation25 Nov 2020 Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang

This poses non-trivial difficulties for researchers or engineers and prevents the application of MARL to a broader range of real-world problems.

Dota 2 Multi-agent Reinforcement Learning +4

A Effective Carrier Phase Recovery Method in Tigth Time-Packing Fast than Nyquist Optical Communication System

no code implementations24 Aug 2020 Peng Sun, Xiaoguang Zhang, Dongwei Pan, Lixia Xi, Wenbo Zhang, Xianfeng Tang

We propose a new scheme that combines polybinary transformaton and corrected-BPS to compensate noise for PDM-FTN-QPSK when its accelerated factor is 0. 5, which has 3. 3 dB OSNR gain when phase noise is 800 kHz.

Identification of splicing edges in tampered image based on Dichromatic Reflection Model

no code implementations9 Apr 2020 Zhe Shen, Peng Sun, Yubo Lang, Lei Liu, Silong Peng

Therefore we present a novel optic-physical method to discriminate splicing edges from natural edges in a tampered image.

Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation

no code implementations31 Mar 2020 Peng Sun, Jiaxiang Wu, Songyuan Li, Peiwen Lin, Junzhou Huang, Xi Li

To satisfy the stringent requirements on computational resources in the field of real-time semantic segmentation, most approaches focus on the hand-crafted design of light-weight segmentation networks.

Neural Architecture Search Real-Time Semantic Segmentation +1

Graph-guided Architecture Search for Real-time Semantic Segmentation

1 code implementation CVPR 2020 Peiwen Lin, Peng Sun, Guangliang Cheng, Sirui Xie, Xi Li, Jianping Shi

Unlike previous works that use a simplified search space and stack a repeatable cell to form a network, we introduce a novel search mechanism with new search space where a lightweight model can be effectively explored through the cell-level diversity and latencyoriented constraint.

Real-Time Semantic Segmentation

Composing Knowledge Graph Embeddings via Word Embeddings

no code implementations9 Sep 2019 Lianbo Ma, Peng Sun, Zhiwei Lin, Hui Wang

As $(\mathbf{h},\mathbf{r},\mathbf{t})$ is learned from the existing facts within a knowledge graph, these representations can not be used to detect unknown facts (if the entities or relations never occur in the knowledge graph).

Knowledge Graph Completion Knowledge Graph Embedding +3

OVSNet : Towards One-Pass Real-Time Video Object Segmentation

no code implementations24 May 2019 Peng Sun, Peiwen Lin, Guangliang Cheng, Jianping Shi, Jiawan Zhang, Xi Li

Video object segmentation aims at accurately segmenting the target object regions across consecutive frames.

Object object-detection +6

AD-VAT: An Asymmetric Dueling mechanism for learning Visual Active Tracking

no code implementations ICLR 2019 Fangwei Zhong, Peng Sun, Wenhan Luo, Tingyun Yan, Yizhou Wang

In AD-VAT, both the tracker and the target are approximated by end-to-end neural networks, and are trained via RL in a dueling/competitive manner: i. e., the tracker intends to lockup the target, while the target tries to escape from the tracker.

Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes

1 code implementation19 Feb 2019 Peng Sun, Wansen Feng, Ruobing Han, Shengen Yan, Yonggang Wen

To address this problem, we propose a communication backend named GradientFlow for distributed DNN training, and employ a set of network optimization techniques.

Distributed, Parallel, and Cluster Computing

Salience Biased Loss for Object Detection in Aerial Images

no code implementations18 Oct 2018 Peng Sun, Guang Chen, Guerdan Luke, Yi Shang

Experimental results show our proposed loss function with the RetinaNet architecture outperformed other state-of-art object detection models by at least 4. 31 mAP, and RetinaNet by 2. 26 mAP with the same inference speed of RetinaNet.

Object object-detection +1

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game

3 code implementations19 Sep 2018 Peng Sun, Xinghai Sun, Lei Han, Jiechao Xiong, Qing Wang, Bo Li, Yang Zheng, Ji Liu, Yongsheng Liu, Han Liu, Tong Zhang

Both TStarBot1 and TStarBot2 are able to defeat the built-in AI agents from level 1 to level 10 in a full game (1v1 Zerg-vs-Zerg game on the AbyssalReef map), noting that level 8, level 9, and level 10 are cheating agents with unfair advantages such as full vision on the whole map and resource harvest boosting.

Decision Making Starcraft +1

End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning

no code implementations10 Aug 2018 Wenhan Luo, Peng Sun, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang

We further propose an environment augmentation technique and a customized reward function, which are crucial for successful training.

Object Object Tracking +1

An Algorithmic Framework of Variable Metric Over-Relaxed Hybrid Proximal Extra-Gradient Method

no code implementations ICML 2018 Li Shen, Peng Sun, Yitong Wang, Wei Liu, Tong Zhang

Specifically, we find that a large class of primal and primal-dual operator splitting algorithms are all special cases of VMOR-HPE.

Tagging like Humans: Diverse and Distinct Image Annotation

no code implementations CVPR 2018 Baoyuan Wu, Weidong Chen, Peng Sun, Wei Liu, Bernard Ghanem, Siwei Lyu

In D2IA, we generate a relevant and distinct tag subset, in which the tags are relevant to the image contents and semantically distinct to each other, using sequential sampling from a determinantal point process (DPP) model.

Generative Adversarial Network TAG

End-to-end Active Object Tracking via Reinforcement Learning

no code implementations ICML 2018 Wenhan Luo, Peng Sun, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang

We study active object tracking, where a tracker takes as input the visual observation (i. e., frame sequence) and produces the camera control signal (e. g., move forward, turn left, etc.).

Object Object Tracking +2

Fast Segmentation of Left Ventricle in CT Images by Explicit Shape Regression using Random Pixel Difference Features

no code implementations27 Jul 2015 Peng Sun, Haoyin Zhou, Devon Lundine, James K. Min, Guanglei Xiong

On a dataset consisting of 139 CT volumes, a 5-fold cross validation shows the segmentation error is $1. 21 \pm 0. 11$ for LV endocardium and $1. 23 \pm 0. 11$ millimeters for epicardium.

Computed Tomography (CT) LV Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.