Search Results for author: Xinbo Wu

Found 2 papers, 0 papers with code

Transformer-based Causal Language Models Perform Clustering

no code implementations • 19 Feb 2024 • Xinbo Wu, Lav R. Varshney

Even though large language models (LLMs) have demonstrated remarkable capability in solving various natural language tasks, the capability of an LLM to follow human instructions is still a concern.

Clustering Instruction Following +1

Paper
Add Code

A Meta-Learning Perspective on Transformers for Causal Language Modeling

no code implementations • 9 Oct 2023 • Xinbo Wu, Lav R. Varshney

Focused on the training process, here we establish a meta-learning view of the Transformer architecture when trained for the causal language modeling task, by explicating an inner optimization process within the Transformer.

Causal Language Modeling Language Modelling +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.