Search Results for author: Zhiquan Tan

Found 9 papers, 6 papers with code

The Information of Large Language Model Geometry

no code implementations • 1 Feb 2024 • Zhiquan Tan, Chenghai Li, Weiran Huang

This paper investigates the information encoded in the embeddings of large language models (LLMs).

Language Modelling Large Language Model +1

Paper
Add Code

Large Language Model Evaluation via Matrix Entropy

1 code implementation • 30 Jan 2024 • Lai Wei, Zhiquan Tan, Chenghai Li, Jindong Wang, Weiran Huang

Large language models (LLMs) have revolutionized the field of natural language processing, extending their strong capabilities into multi-modal domains.

Data Compression Language Modelling +1

Paper
Code

Understanding Grokking Through A Robustness Viewpoint

no code implementations • 11 Nov 2023 • Zhiquan Tan, Weiran Huang

Recently, an interesting phenomenon called grokking has gained much attention, where generalization occurs long after the models have initially overfitted the training data.

Paper
Add Code

OTMatch: Improving Semi-Supervised Learning with Optimal Transport

no code implementations • 26 Oct 2023 • Zhiquan Tan, Kaipeng Zheng, Weiran Huang

In this paper, we present a new approach called OTMatch, which leverages semantic relationships among classes by employing an optimal transport loss function.

Paper
Add Code

Information Flow in Self-Supervised Learning

2 code implementations • 29 Sep 2023 • Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan, Yifan Zhang

In this paper, we provide a comprehensive toolbox for understanding and enhancing self-supervised learning (SSL) methods through the lens of matrix information theory.

Self-Supervised Learning

Paper
Code

Matrix Information Theory for Self-Supervised Learning

3 code implementations • 27 May 2023 • Yifan Zhang, Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan

Inspired by this framework, we introduce Matrix-SSL, a novel approach that leverages matrix information theory to interpret the maximum entropy encoding loss as matrix uniformity loss.

Ranked #1 on Contrastive Learning on imagenet-1k

Contrastive Learning GSM8K +4

Paper
Code

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

1 code implementation • 17 May 2023 • Yifan Zhang, Jingqin Yang, Zhiquan Tan, Yang Yuan

Semi-supervised learning has achieved notable success by leveraging very few labeled data and exploiting the wealth of information derived from unlabeled data.

Ranked #1 on Semi-Supervised Image Classification on STL-10, 40 Labels