Search Results for author: Shuhao Zhang

Found 6 papers, 1 papers with code

Online Continual Knowledge Learning for Language Models

no code implementations16 Nov 2023 Yuhao Wu, Tongjun Shi, Karthick Sharma, Chun Wei Seah, Shuhao Zhang

In this paper, we introduce a novel problem in the realm of continual learning: Online Continual Knowledge Learning (OCKL).

Continual Learning Fact Checking +2

Harnessing Scalable Transactional Stream Processing for Managing Large Language Models [Vision]

no code implementations17 Jul 2023 Shuhao Zhang, Xianzhi Zeng, Yuhao Wu, Zhonghao Yang

Large Language Models (LLMs) have demonstrated extraordinary performance across a broad array of applications, from traditional language processing tasks to interpreting structured sequences like time-series data.

Decision Making Management +1

SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning

no code implementations16 Mar 2023 Shuhan Qi, Shuhao Zhang, Qiang Wang, Jiajia Zhang, Jing Xiao, Xuan Wang

In this paper, we propose a scalable value-decomposition exploration (SVDE) method, which includes a scalable training mechanism, intrinsic reward design, and explorative experience replay.

Multi-agent Reinforcement Learning reinforcement-learning +3

MolMiner: You only look once for chemical structure recognition

no code implementations23 May 2022 Youjun Xu, Jinchuan Xiao, Chia-Han Chou, Jianhang Zhang, Jintao Zhu, Qiwan Hu, Hemin Li, Ningsheng Han, Bingyu Liu, Shuaipeng Zhang, Jinyu Han, Zhen Zhang, Shuhao Zhang, Weilin Zhang, Luhua Lai, Jianfeng Pei

Due to a backlog of decades and an increasing amount of these printed literature, there is a high demand for the translation of printed depictions into machine-readable formats, which is known as Optical Chemical Structure Recognition (OCSR).

object-detection Object Detection +1

Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning

no code implementations11 May 2022 Shuhan Qi, Shuhao Zhang, Xiaohan Hou, Jiajia Zhang, Xuan Wang, Jing Xiao

However, due to the slow sample collection and poor sample exploration, there are still some problems in multi-agent reinforcement learning, such as unstable model iteration and low training efficiency.

reinforcement-learning Reinforcement Learning (RL) +1

A Framework for Fast Polarity Labelling of Massive Data Streams

1 code implementation23 Mar 2022 Huilin Wu, Mian Lu, Zhao Zheng, Shuhao Zhang

Many of the existing sentiment analysis techniques are based on supervised learning, and they demand the availability of valuable training datasets to train their models.

Sentiment Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.