Search Results for author: Xiangwen Kong

Found 4 papers, 3 papers with code

DreamLLM: Synergistic Multimodal Comprehension and Creation

1 code implementation20 Sep 2023 Runpei Dong, Chunrui Han, Yuang Peng, Zekun Qi, Zheng Ge, Jinrong Yang, Liang Zhao, Jianjian Sun, HongYu Zhou, Haoran Wei, Xiangwen Kong, Xiangyu Zhang, Kaisheng Ma, Li Yi

This paper presents DreamLLM, a learning framework that first achieves versatile Multimodal Large Language Models (MLLMs) empowered with frequently overlooked synergy between multimodal comprehension and creation.

 Ranked #1 on Visual Question Answering on MMBench (GPT-3.5 score metric)

multimodal generation Visual Question Answering +2

Reversible Column Networks

1 code implementation22 Dec 2022 Yuxuan Cai, Yizhuang Zhou, Qi Han, Jianjian Sun, Xiangwen Kong, Jun Li, Xiangyu Zhang

Such architectural scheme attributes RevCol very different behavior from conventional networks: during forward propagation, features in RevCol are learned to be gradually disentangled when passing through each column, whose total information is maintained rather than compressed or discarded as other network does.

Ranked #8 on Semantic Segmentation on ADE20K (using extra training data)

Image Classification object-detection +3

Revisiting the Critical Factors of Augmentation-Invariant Representation Learning

1 code implementation30 Jul 2022 Junqiang Huang, Xiangwen Kong, Xiangyu Zhang

We focus on better understanding the critical factors of augmentation-invariant representation learning.

Representation Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.