Search Results for author: Zhiquan Tan

Found 9 papers, 6 papers with code

The Information of Large Language Model Geometry

no code implementations1 Feb 2024 Zhiquan Tan, Chenghai Li, Weiran Huang

This paper investigates the information encoded in the embeddings of large language models (LLMs).

Language Modelling Large Language Model +1

Large Language Model Evaluation via Matrix Entropy

1 code implementation30 Jan 2024 Lai Wei, Zhiquan Tan, Chenghai Li, Jindong Wang, Weiran Huang

Large language models (LLMs) have revolutionized the field of natural language processing, extending their strong capabilities into multi-modal domains.

Data Compression Language Modelling +1

Understanding Grokking Through A Robustness Viewpoint

no code implementations11 Nov 2023 Zhiquan Tan, Weiran Huang

Recently, an interesting phenomenon called grokking has gained much attention, where generalization occurs long after the models have initially overfitted the training data.

OTMatch: Improving Semi-Supervised Learning with Optimal Transport

no code implementations26 Oct 2023 Zhiquan Tan, Kaipeng Zheng, Weiran Huang

In this paper, we present a new approach called OTMatch, which leverages semantic relationships among classes by employing an optimal transport loss function.

Information Flow in Self-Supervised Learning

2 code implementations29 Sep 2023 Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan, Yifan Zhang

In this paper, we provide a comprehensive toolbox for understanding and enhancing self-supervised learning (SSL) methods through the lens of matrix information theory.

Self-Supervised Learning

Matrix Information Theory for Self-Supervised Learning

3 code implementations27 May 2023 Yifan Zhang, Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan

Inspired by this framework, we introduce Matrix-SSL, a novel approach that leverages matrix information theory to interpret the maximum entropy encoding loss as matrix uniformity loss.

Contrastive Learning GSM8K +4

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

1 code implementation17 May 2023 Yifan Zhang, Jingqin Yang, Zhiquan Tan, Yang Yuan

Semi-supervised learning has achieved notable success by leveraging very few labeled data and exploiting the wealth of information derived from unlabeled data.

Semi-Supervised Image Classification

SEAL: Simultaneous Label Hierarchy Exploration And Learning

1 code implementation26 Apr 2023 Zhiquan Tan, ZiHao Wang, Yifan Zhang

Label hierarchy is an important source of external knowledge that can enhance classification performance.

Contrastive Learning Is Spectral Clustering On Similarity Graph

1 code implementation27 Mar 2023 Zhiquan Tan, Yifan Zhang, Jingqin Yang, Yang Yuan

Contrastive learning is a powerful self-supervised learning method, but we have a limited theoretical understanding of how it works and why it works.

Clustering Contrastive Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.