Search Results for author: Xinbo Wu

Found 2 papers, 0 papers with code

Transformer-based Causal Language Models Perform Clustering

no code implementations19 Feb 2024 Xinbo Wu, Lav R. Varshney

Even though large language models (LLMs) have demonstrated remarkable capability in solving various natural language tasks, the capability of an LLM to follow human instructions is still a concern.

Clustering Instruction Following +1

A Meta-Learning Perspective on Transformers for Causal Language Modeling

no code implementations9 Oct 2023 Xinbo Wu, Lav R. Varshney

Focused on the training process, here we establish a meta-learning view of the Transformer architecture when trained for the causal language modeling task, by explicating an inner optimization process within the Transformer.

Causal Language Modeling Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.