TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Question Answering	DuoRC	Vector Database (ChromaDB)	Accuracy	55.71	# 1
Question Answering	DuoRC	Hybrid-RecallM	Accuracy	52.68	# 2
Question Answering	DuoRC	RecallM	Accuracy	48.13	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recallm-an-architecture-for-temporal-context/question-answering-on-duorc)](https://paperswithcode.com/sota/question-answering-on-duorc?p=recallm-an-architecture-for-temporal-context)`

RecallM: An Adaptable Memory Mechanism with Temporal Understanding for Large Language Models

6 Jul 2023 · Brandon Kynoch, Hugo Latapie, Dwane van der Sluis ·

Large Language Models (LLMs) have made extraordinary progress in the field of Artificial Intelligence and have demonstrated remarkable capabilities across a large variety of tasks and domains. However, as we venture closer to creating Artificial General Intelligence (AGI) systems, we recognize the need to supplement LLMs with long-term memory to overcome the context window limitation and more importantly, to create a foundation for sustained reasoning, cumulative learning and long-term user interaction. In this paper we propose RecallM, a novel architecture for providing LLMs with an adaptable and updatable long-term memory mechanism. Unlike previous methods, the RecallM architecture is particularly effective at belief updating and maintaining a temporal understanding of the knowledge provided to it. We demonstrate through various experiments the effectiveness of this architecture. Furthermore, through our own temporal understanding and belief updating experiments, we show that RecallM is four times more effective than using a vector database for updating knowledge previously stored in long-term memory. We also demonstrate that RecallM shows competitive performance on general question-answering and in-context learning tasks.

PDF Abstract

Code

Add Remove Mark official

cisco-open/DeepVision official

Tasks

Add Remove

Continual Learning

In-Context Learning

Language Modelling

Large Language Model

Question Answering

Datasets

TruthfulQA

DuoRC

Results from the Paper

Edit

Ranked #1 on Question Answering on DuoRC

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Question Answering	DuoRC	Vector Database (ChromaDB)	Accuracy	55.71	# 1	Compare
Question Answering	DuoRC	Hybrid-RecallM	Accuracy	52.68	# 2	Compare
Question Answering	DuoRC	RecallM	Accuracy	48.13	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

RecallM: An Adaptable Memory Mechanism with Temporal Understanding for Large Language Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove