Search Results for author: Peng Wei

Found 28 papers, 2 papers with code

Reinforcement Learning with Latent State Inference for Autonomous On-ramp Merging under Observation Delay

no code implementations • 18 Mar 2024 • Amin Tabrizian, Zhitong Huang, Peng Wei

L3IS shows a 99. 90% success rate in a challenging on-ramp merging case generated from the real US Highway 101 data.

Paper
Add Code

Edge Information Hub: Orchestrating Satellites, UAVs, MEC, Sensing and Communications for 6G Closed-Loop Controls

no code implementations • 11 Mar 2024 • Chengleyang Lei, Wei Feng, Peng Wei, Yunfei Chen, Ning Ge, Shiwen Mao

Specifically, the linear quadratic regulator (LQR) control cost is used to measure the closed-loop utility, and a sum LQR cost minimization problem is formulated to jointly optimize the splitting of sensor data and allocation of communication and computing resources.

Edge-computing

Paper
Add Code

Progressive Feature Fusion Network for Enhancing Image Quality Assessment

no code implementations • 13 Jan 2024 • Kaiqun Wu, Xiaoling Jiang, Rui Yu, Yonggang Luo, Tian Jiang, Xi Wu, Peng Wei

To capture the subtle differences, a fine-grained network is adopted to acquire multi-scale features.

Image Compression Image Quality Assessment

Paper
Add Code

Two-stage space construction for real-time modeling of distributed parameter systems under sparse sensing

no code implementations • 28 Oct 2023 • Peng Wei

The high-dimensional space construction method is employed to derive continuous spatial basis functions (SBFs).

Paper
Add Code

Fusion-Driven Tree Reconstruction and Fruit Localization: Advancing Precision in Agriculture

no code implementations • 23 Oct 2023 • Kaiming Fu, Peng Wei, Juan Villacres, Zhaodan Kong, Stavros G. Vougioukas, Brian N. Bailey

Fruit distribution is pivotal in shaping the future of both agriculture and agricultural robotics, paving the way for a streamlined supply chain.

Paper
Add Code

Multiscale Fusion for Abnormality Detection and Localization of Distributed Parameter Systems

no code implementations • 10 Oct 2023 • Peng Wei, Han-Xiong Li

Numerous industrial thermal processes and fluid processes can be described by distributed parameter systems (DPSs), wherein many process parameters and variables vary in space and time.

Anomaly Detection Fault Detection

Paper
Add Code

Integrated Conflict Management for UAM with Strategic Demand Capacity Balancing and Learning-based Tactical Deconfliction

no code implementations • 17 May 2023 • Shulu Chen, Antony Evans, Marc Brittain, Peng Wei

By using DCB to precondition traffic to proper density levels, we show that reinforcement learning can achieve much better performance for tactical safety separation.

Management reinforcement-learning

Paper
Add Code

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

1 code implementation • 21 Feb 2023 • Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

To this end, we propose MAC-PO, which formulates optimal prioritized experience replay for multi-agent problems as a regret minimization over the sampling weights of transitions.

Decision Making Multi-agent Reinforcement Learning +3

Paper
Code

A Survey on Reinforcement Learning in Aviation Applications

no code implementations • 3 Nov 2022 • Pouria Razzaghi, Amin Tabrizian, Wei Guo, Shulu Chen, Abenezer Taye, Ellis Thompson, Alexis Bregeon, Ali Baheri, Peng Wei

Then we survey the landscape of existing RL-based applications in aviation.

Decision Making reinforcement-learning +1

Paper
Add Code

A Verification Framework for Certifying Learning-Based Safety-Critical Aviation Systems

no code implementations • 9 May 2022 • Ali Baheri, Hao Ren, Benjamin Johnson, Pouria Razzaghi, Peng Wei

We present a safety verification framework for design-time and run-time assurance of learning-based components in aviation systems.

Decision Making

Paper
Add Code

Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence

no code implementations • 27 Jan 2022 • Peng Wei, Kun Guo, Ye Li, Jue Wang, Wei Feng, Shi Jin, Ning Ge, Ying-Chang Liang

Mobile edge computing (MEC) is considered a novel paradigm for computation-intensive and delay-sensitive tasks in fifth generation (5G) networks and beyond.

Edge-computing reinforcement-learning +1

Paper
Add Code

Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning

no code implementations • 13 Nov 2021 • Jueming Hu, Xuxi Yang, Weichang Wang, Peng Wei, Lei Ying, Yongming Liu

Obstacle avoidance for small unmanned aircraft is vital for the safety of future urban air mobility (UAM) and Unmanned Aircraft System (UAS) Traffic Management (UTM).

Continuous Control Management +2

Paper
Add Code

Power System Transient Modeling and Simulation using Integrated Circuit

no code implementations • 6 Jun 2021 • Xiang Zhang, Renchang Dai, Peng Wei, Yijing Liu, Guangyi Liu, Zhiwei Wang

Transient stability analysis (TSA) plays an important role in power system analysis to investigate the stability of power system.

Numerical Integration

Paper
Add Code

Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance

no code implementations • 5 May 2021 • Wei Guo, Marc Brittain, Peng Wei

We demonstrate the effectiveness of the two sub-modules in an open-source air traffic simulator with challenging environment settings.

Data Augmentation reinforcement-learning +1

Paper
Add Code

Scalable FastMDP for Pre-departure Airspace Reservation and Strategic De-conflict

no code implementations • 8 Aug 2020 • Joshua R. Bertram, Peng Wei, Joseph Zambreno

Our results show that on commodity GPU hardware we can perform flight plan scheduling against 2000-3000 known flight plans and with server-class hardware the performance can be higher.

Scheduling

Paper
Add Code

Continuous Control for Searching and Planning with a Learned Model

no code implementations • 12 Jun 2020 • Xuxi Yang, Werner Duvaud, Peng Wei

Decision-making agents with planning capabilities have achieved huge success in the challenging domain like Chess, Shogi, and Go.

Continuous Control Decision Making +2

Paper
Add Code

A Deep Multi-Agent Reinforcement Learning Approach to Autonomous Separation Assurance

no code implementations • 17 Mar 2020 • Marc Brittain, Xuxi Yang, Peng Wei

A novel deep multi-agent reinforcement learning framework is proposed to identify and resolve conflicts among a variable number of aircraft in a high-density, stochastic, and dynamic sector.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

An Efficient Algorithm for Multiple-Pursuer-Multiple-Evader Pursuit/Evasion Game

no code implementations • 9 Sep 2019 • Joshua R. Bertram, Peng Wei

We present a method for pursuit/evasion that is highly efficient and and scales to large teams of aircraft.

Paper
Add Code

Prioritized Sequence Experience Replay

no code implementations • 25 May 2019 • Marc Brittain, Josh Bertram, Xuxi Yang, Peng Wei

Experience replay is widely used in deep reinforcement learning algorithms and allows agents to remember and learn from experiences from the past.

Q-Learning reinforcement-learning +1

Paper
Add Code

Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach

no code implementations • 2 May 2019 • Marc Brittain, Peng Wei

Air traffic control is a real-time safety-critical decision making process in highly dynamic and stochastic environments.

Decision Making Multi-agent Reinforcement Learning +2

Paper
Add Code

Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking

no code implementations • 18 Feb 2019 • Syed Arbab Mohd Shihab, Caleb Logemann, Deepak-George Thomas, Peng Wei

This paper focuses on the latter problem - the seat inventory control problem - casting it as a Markov Decision Process to be able to find the optimal policy.

Management Q-Learning

Paper
Add Code

A PCB Dataset for Defects Detection and Classification

5 code implementations • 24 Jan 2019 • Weibo Huang, Peng Wei

To coupe with the difficulties in the process of inspection and classification of defects in Printed Circuit Board (PCB), other researchers have proposed many methods.

Classification General Classification

387

Paper
Code

Basis Signal Optimization for N-Continuous OFDM

no code implementations • 28 Dec 2018 • Peng Wei, Yue Xiao, Wei Xiang

A novel basis signal optimization method is proposed for reducing the interference in the N-continuous orthogonal frequency division multiplexing (NC-OFDM) system.

Paper
Add Code

AutoAccel: Automated Accelerator Generation and Optimization with Composable, Parallel and Pipeline Architecture

no code implementations • 30 Jul 2018 • Jason Cong, Peng Wei, Cody Hao Yu, Peng Zhang

Such a well-defined template is able to support efficient accelerator designs for a broad class of computation kernels, and more importantly, drastically reduce the design space.

Distributed, Parallel, and Cluster Computing Hardware Architecture

Paper
Add Code

Explainable Deterministic MDPs

no code implementations • 9 Jun 2018 • Josh Bertram, Peng Wei

We present a method for a certain class of Markov Decision Processes (MDPs) that can relate the optimal policy back to one or more reward sources in the environment.

Paper
Add Code

Hierarchical Reinforcement Learning with Deep Nested Agents

no code implementations • 18 May 2018 • Marc Brittain, Peng Wei

Deep hierarchical reinforcement learning has gained a lot of attention in recent years due to its ability to produce state-of-the-art results in challenging environments where non-hierarchical frameworks fail to learn useful policies.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Memoryless Exact Solutions for Deterministic MDPs with Sparse Rewards

no code implementations • 17 May 2018 • Joshua R. Bertram, Peng Wei

The algorithm to solve the MDP does not depend on the size of the state space for either time or memory complexity, and the ability to follow the optimal policy is linear in time and space with the path length of following the optimal policy from the initial state.

Paper
Add Code

Fast Online Exact Solutions for Deterministic MDPs with Sparse Rewards

no code implementations • 8 May 2018 • Joshua R. Bertram, Xuxi Yang, Peng Wei

Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential decision making under uncertainty.

Decision Making Decision Making Under Uncertainty

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.