Search Results for author: Bingchen Zhao

Found 28 papers, 17 papers with code

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

no code implementations • 15 Apr 2024 • Mude Hui, Siwei Yang, Bingchen Zhao, Yichun Shi, Heng Wang, Peng Wang, Yuyin Zhou, Cihang Xie

This study introduces HQ-Edit, a high-quality instruction-based image editing dataset with around 200, 000 edits.

Attribute

Paper
Add Code

Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery

1 code implementation • 13 Apr 2024 • Ye Wang, Yaxiong Wang, Yujiao Wu, Bingchen Zhao, Xueming Qian

To counteract this inefficiency, we opt to cluster only the unlabelled instances and subsequently expand the cluster prototypes with our introduced potential prototypes to fast explore novel classes.

Clustering Contrastive Learning

Paper
Code

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

4 code implementations • 8 Apr 2024 • Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao, Peng Zhou, Jian Zhu, Rui-Jie Zhu

We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture.

11,728

Paper
Code

Beyond the Known: Novel Class Discovery for Open-world Graph Learning

no code implementations • 29 Mar 2024 • Yucheng Jin, Yun Xiong, Juncheng Fang, Xixi Wu, Dongxiao He, Xing Jia, Bingchen Zhao, Philip Yu

Inter-class correlations are subsequently eliminated by the prototypical attention network, leading to distinctive representations for different classes.

Graph Learning Node Classification +1

Paper
Add Code

AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability

1 code implementation • 14 Feb 2024 • Siwei Yang, Bingchen Zhao, Cihang Xie

This paper introduces AQA-Bench, a novel benchmark to assess the sequential reasoning capabilities of large language models (LLMs) in algorithmic contexts, such as depth-first search (DFS).

Paper
Code

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

no code implementations • 18 Dec 2023 • Bingchen Zhao, Haoqin Tu, Chen Wei, Jieru Mei, Cihang Xie

This paper introduces an efficient strategy to transform Large Language Models (LLMs) into Multi-Modal Large Language Models (MLLMs).

Domain Adaptation

Paper
Add Code

Compress & Align: Curating Image-Text Data with Human Knowledge

no code implementations • 11 Dec 2023 • Lei Zhang, Fangxun Shu, Sucheng Ren, Bingchen Zhao, Hao Jiang, Cihang Xie

The massive growth of image-text data through web crawling inherently presents the challenge of variability in data quality.

Image Captioning Text Retrieval

Paper
Add Code

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs

1 code implementation • 27 Nov 2023 • Haoqin Tu, Chenhang Cui, Zijun Wang, Yiyang Zhou, Bingchen Zhao, Junlin Han, Wangchunshu Zhou, Huaxiu Yao, Cihang Xie

Different from prior studies, we shift our focus from evaluating standard performance to introducing a comprehensive safety evaluation suite, covering both out-of-distribution (OOD) generalization and adversarial robustness.

Adversarial Robustness Visual Question Answering (VQA) +1

Paper
Code

What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models

1 code implementation • 10 Oct 2023 • Letian Zhang, Xiaotong Zhai, Zhongkai Zhao, Yongshuo Zong, Xin Wen, Bingchen Zhao

In light of the advancements in current multi-modal large language models, we explore their effectiveness in counterfactual reasoning.

Benchmarking Code Generation +4

Paper
Code

Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations

1 code implementation • 2 Oct 2023 • Yongshuo Zong, Tingyang Yu, Bingchen Zhao, Ruchika Chavhan, Timothy Hospedales

Large language and vision-language models are rapidly being deployed in practice thanks to their impressive capabilities in instruction following, in-context learning, and so on.

In-Context Learning Instruction Following +3

Paper
Code

Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics

1 code implementation • 13 Sep 2023 • Haoqin Tu, Bingchen Zhao, Chen Wei, Cihang Xie

Multi-modal large language models (MLLMs) are trained based on large language models (LLM), with an enhanced capability to comprehend multi-modal inputs and generate textual responses.

Ethics

Paper
Code

Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery

1 code implementation • ICCV 2023 • Bingchen Zhao, Xin Wen, Kai Han

In this paper, we address the problem of generalized category discovery (GCD), \ie, given a set of images where part of them are labelled and the rest are not, the task is to automatically cluster the images in the unlabelled data, leveraging the information from the labelled data, while the unlabelled data contain images from the labelled classes and also new ones.

Contrastive Learning Image Classification +2

Paper
Code

Incremental Generalized Category Discovery

no code implementations • ICCV 2023 • Bingchen Zhao, Oisin Mac Aodha

We explore the problem of Incremental Generalized Category Discovery (IGCD).

Fine-Grained Visual Categorization Incremental Learning

Paper
Add Code

OOD-CV-v2: An extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images

no code implementations • 17 Apr 2023 • Bingchen Zhao, Jiahao Wang, Wufei Ma, Artur Jesslen, Siwei Yang, Shaozuo Yu, Oliver Zendel, Christian Theobalt, Alan Yuille, Adam Kortylewski

Enhancing the robustness of vision algorithms in real-world scenarios is challenging.

3D Pose Estimation Benchmarking +4

Paper
Add Code

Vision Learners Meet Web Image-Text Pairs

no code implementations • 17 Jan 2023 • Bingchen Zhao, Quan Cui, Hao Wu, Osamu Yoshie, Cheng Yang, Oisin Mac Aodha

In this work, given the excellent scalability of web data, we consider self-supervised pre-training on noisy web sourced image-text paired data.

Benchmarking Self-Supervised Learning +1

Paper
Add Code

Parametric Classification for Generalized Category Discovery: A Baseline Study

2 code implementations • ICCV 2023 • Xin Wen, Bingchen Zhao, Xiaojuan Qi

Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples.

Ranked #1 on Open-World Semi-Supervised Learning on ImageNet-100

Classification Novel Class Discovery +2

Paper
Code

One Venue, Two Conferences: The Separation of Chinese and American Citation Networks

no code implementations • 17 Nov 2022 • Bingchen Zhao, Yuling Gu, Jessica Zosa Forde, Naomi Saphra

At NeurIPS, American and Chinese institutions cite papers from each other's regions substantially less than they cite endogamously.

Paper
Add Code

XCon: Learning with Experts for Fine-grained Category Discovery

1 code implementation • 3 Aug 2022 • Yixin Fei, Zhongkai Zhao, Siwei Yang, Bingchen Zhao

We address the problem of generalized category discovery (GCD) in this paper, i. e. clustering the unlabeled images leveraging the information from a set of seen classes, where the unlabeled images could contain both seen classes and unseen classes.

Clustering Contrastive Learning +1

Paper
Code

Self-Supervised Visual Representation Learning with Semantic Grouping

1 code implementation • 30 May 2022 • Xin Wen, Bingchen Zhao, Anlin Zheng, Xiangyu Zhang, Xiaojuan Qi

The semantic grouping is performed by assigning pixels to a set of learnable prototypes, which can adapt to each sample by attentive pooling over the feature and form new slots.

Ranked #15 on Unsupervised Semantic Segmentation on COCO-Stuff-27 (Accuracy metric)

Contrastive Learning Instance Segmentation +6

Paper
Code

Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective

1 code implementation • 8 Mar 2022 • Quan Cui, Bingchen Zhao, Zhao-Min Chen, Borui Zhao, RenJie Song, Jiajun Liang, Boyan Zhou, Osamu Yoshie

This work simultaneously considers the discriminability and transferability properties of deep representations in the typical supervised learning task, i. e., image classification.

Image Classification Transfer Learning

Paper
Code

OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images

no code implementations • 29 Nov 2021 • Bingchen Zhao, Shaozuo Yu, Wufei Ma, Mingxin Yu, Shenxiao Mei, Angtian Wang, Ju He, Alan Yuille, Adam Kortylewski

One reason is that existing robustness benchmarks are limited, as they either rely on synthetic data or ignore the effects of individual nuisance factors.

3D Pose Estimation Benchmarking +5

Paper
Add Code

Improving Contrastive Learning by Visualizing Feature Transformation

1 code implementation • ICCV 2021 • Rui Zhu, Bingchen Zhao, Jingen Liu, Zhenglong Sun, Chang Wen Chen

To our knowledge, this is the first attempt of its kind.

Contrastive Learning Data Augmentation +2

Paper
Code

Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation

no code implementations • NeurIPS 2021 • Bingchen Zhao, Kai Han

In this paper, we tackle the problem of novel visual category discovery, i. e., grouping unlabelled images from new classes into different semantic partitions by leveraging a labelled dataset that contains images from other different but relevant categories.

Fine-Grained Visual Recognition Knowledge Distillation

Paper
Add Code

Rail-5k: a Real-World Dataset for Rail Surface Defects Detection

no code implementations • 28 Jun 2021 • Zihao Zhang, Shaozuo Yu, Siwei Yang, Yu Zhou, Bingchen Zhao

This paper presents the Rail-5k dataset for benchmarking the performance of visual algorithms in a real-world application scenario, namely the rail surface defects detection task.

4k Benchmarking

Paper
Add Code

Reducing the feature divergence of RGB and near-infrared images using Switchable Normalization

1 code implementation • 6 Jun 2021 • Siwei Yang, Shaozuo Yu, Bingchen Zhao, Yin Wang

Visual pattern recognition over agricultural areas is an important application of aerial image processing.

Paper
Code

Temporal Context Aggregation for Video Retrieval with Contrastive Learning

1 code implementation • 4 Aug 2020 • Jie Shao, Xin Wen, Bingchen Zhao, xiangyang xue

The current research focus on Content-Based Video Retrieval requires higher-level video representation describing the long-range semantic dependencies of relevant incidents, events, etc.

Ranked #6 on Video Retrieval on FIVR-200K

Contrastive Learning Representation Learning +2

Paper
Code

Distilling Visual Priors from Self-Supervised Learning

1 code implementation • 1 Aug 2020 • Bingchen Zhao, Xin Wen

Convolutional Neural Networks (CNNs) are prone to overfit small training datasets.

Classification Contrastive Learning +4

Paper
Code

The 1st Agriculture-Vision Challenge: Methods and Results

1 code implementation • 21 Apr 2020 • Mang Tik Chiu, Xingqian Xu, Kai Wang, Jennifer Hobbs, Naira Hovakimyan, Thomas S. Huang, Honghui Shi, Yunchao Wei, Zilong Huang, Alexander Schwing, Robert Brunner, Ivan Dozier, Wyatt Dozier, Karen Ghandilyan, David Wilson, Hyunseong Park, Junhee Kim, Sungho Kim, Qinghui Liu, Michael C. Kampffmeyer, Robert Jenssen, Arnt B. Salberg, Alexandre Barbosa, Rodrigo Trevisan, Bingchen Zhao, Shaozuo Yu, Siwei Yang, Yin Wang, Hao Sheng, Xiao Chen, Jingyi Su, Ram Rajagopal, Andrew Ng, Van Thong Huynh, Soo-Hyung Kim, In-Seop Na, Ujjwal Baid, Shubham Innani, Prasad Dutande, Bhakti Baheti, Sanjay Talbar, Jianyu Tang

The first Agriculture-Vision Challenge aims to encourage research in developing novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset.

Segmentation Semantic Segmentation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.