Search Results for author: Meng Cao

Found 54 papers, 22 papers with code

Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation

no code implementations • 3 Apr 2024 • Xiaoshuang Huang, Hongxiang Li, Meng Cao, Long Chen, Chenyu You, Dong An

Recent developments underscore the potential of textual information in enhancing learning models for a deeper understanding of medical visual semantics.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

Mechanisms of non-factual hallucinations in language models

1 code implementation • 27 Mar 2024 • Lei Yu, Meng Cao, Jackie Chi Kit Cheung, Yue Dong

Our study investigates the mechanistic causes of hallucination, specifically non-factual ones where the LM incorrectly predicts object attributes in response to subject-relation queries.

Attribute Hallucination +2

Paper
Code

depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers

1 code implementation • 14 Mar 2024 • Kaichao You, Runsheng Bai, Meng Cao, Jianmin Wang, Ion Stoica, Mingsheng Long

PyTorch \texttt{2. x} introduces a compiler designed to accelerate deep learning programs.

274

Paper
Code

Predicting Learning Performance with Large Language Models: A Study in Adult Literacy

no code implementations • 4 Mar 2024 • Liang Zhang, Jionghao Lin, Conrad Borchers, John Sabatini, John Hollander, Meng Cao, Xiangen Hu

This research is motivated by the potential of LLMs to predict learning performance based on its inherent reasoning and computational capabilities.

Knowledge Tracing Reading Comprehension

Paper
Add Code

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

no code implementations • 19 Feb 2024 • Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen

In this paper, we propose a method to evaluate the response preference by using the output probabilities of response pairs under contrastive prompt pairs, which could achieve better performance on LLaMA2-7B and LLaMA2-13B compared to RLAIF.

Language Modelling Large Language Model

Paper
Add Code

Recommendation Fairness in Social Networks Over Time

no code implementations • 5 Feb 2024 • Meng Cao, Hussain Hussain, Sandipan Sikdar, Denis Helic, Markus Strohmaier, Roman Kern

We further study how interventions on network properties influence fairness by examining counterfactual scenarios with alternative evolution outcomes and differing network properties.

counterfactual Fairness +1

Paper
Add Code

3DG: A Framework for Using Generative AI for Handling Sparse Learner Performance Data From Intelligent Tutoring Systems

no code implementations • 29 Jan 2024 • Liang Zhang, Jionghao Lin, Conrad Borchers, Meng Cao, Xiangen Hu

Learning performance data (e. g., quiz scores and attempts) is significant for understanding learner engagement and knowledge mastery level.

Generative Adversarial Network Imputation

Paper
Add Code

Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation

no code implementations • 14 Jan 2024 • Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng

We investigate this approach under two different settings: one where the policy model is smaller and is paired with a more powerful critic model, and another where a single language model fulfills both roles.

Language Modelling reinforcement-learning +2

Paper
Add Code

Responsible AI Considerations in Text Summarization Research: A Review of Current Practices

no code implementations • 18 Nov 2023 • Yu Lu Liu, Meng Cao, Su Lin Blodgett, Jackie Chi Kit Cheung, Alexandra Olteanu, Adam Trischler

We focus on how, which, and when responsible AI issues are covered, which relevant stakeholders are considered, and mismatches between stated and realized research goals.

Text Summarization

Paper
Add Code

Exploring Recommendation Capabilities of GPT-4V(ision): A Preliminary Case Study

no code implementations • 7 Nov 2023 • Peilin Zhou, Meng Cao, You-Liang Huang, Qichen Ye, Peiyan Zhang, Junling Liu, Yueqi Xie, Yining Hua, Jaeboum Kim

Large Multimodal Models (LMMs) have demonstrated impressive performance across various vision and language tasks, yet their potential applications in recommendation tasks with visual assistance remain unexplored.

General Knowledge Reading Comprehension

Paper
Add Code

Successor Features for Efficient Multisubject Controlled Text Generation

no code implementations • 3 Nov 2023 • Meng Cao, Mehdi Fatemi, Jackie Chi Kit Cheung, Samira Shabanian

While large language models (LLMs) have achieved impressive performance in generating fluent and realistic text, controlling the generated text so that it exhibits properties such as safety, factuality, and non-toxicity remains challenging.

Computational Efficiency Language Modelling +1

Paper
Add Code

Video Referring Expression Comprehension via Transformer with Content-conditioned Query

no code implementations • 25 Oct 2023 • Ji Jiang, Meng Cao, Tengtao Song, Long Chen, Yi Wang, Yuexian Zou

Video Referring Expression Comprehension (REC) aims to localize a target object in videos based on the queried natural language.

Referring Expression Referring Expression Comprehension +1

Paper
Add Code

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

1 code implementation • 13 Oct 2023 • Qichen Ye, Junling Liu, Dading Chong, Peilin Zhou, Yining Hua, Fenglin Liu, Meng Cao, ZiMing Wang, Xuxin Cheng, Zhu Lei, Zhenhua Guo

In the CPT and SFT phases, Qilin-Med achieved 38. 4% and 40. 0% accuracy on the CMExam test set, respectively.

Knowledge Graphs Language Modelling +2

Paper
Code

VeCLIP: Improving CLIP Training via Visual-enriched Captions

1 code implementation • 11 Oct 2023 • Zhengfeng Lai, Haotian Zhang, BoWen Zhang, Wentao Wu, Haoping Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao

For example, VeCLIP achieves up to +25. 2% gain in COCO and Flickr30k retrieval tasks under the 12M setting.

Retrieval Text Retrieval +1

173

Paper
Code

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

1 code implementation • ICCV 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou

Due to two annoying issues in video grounding: (1) the co-existence of some visual entities in both ground truth and other moments, \ie semantic overlapping; (2) only a few moments in the video are annotated, \ie sparse annotation dilemma, vanilla contrastive learning is unable to model the correlations between temporally distant moments and learned inconsistent video representations.

Contrastive Learning Video Grounding

Paper
Code

Improving Retrieval-Augmented Large Language Models via Data Importance Learning

1 code implementation • 6 Jul 2023 • Xiaozhong Lyu, Stefan Grafberger, Samantha Biegel, Shaopeng Wei, Meng Cao, Sebastian Schelter, Ce Zhang

There are exponentially many terms in the multilinear extension, and one key contribution of this paper is a polynomial time algorithm that computes exactly, given a retrieval-augmented model with an additive utility function and a validation set, the data importance of data points in the retrieval corpus using the multilinear extension of the model's utility function.

Imputation Question Answering +1

Paper
Code

Improving Reference-based Distinctive Image Captioning with Contrastive Rewards

no code implementations • 25 Jun 2023 • Yangjun Mao, Jun Xiao, Dong Zhang, Meng Cao, Jian Shao, Yueting Zhuang, Long Chen

A recent DIC method proposes to generate distinctive captions by comparing the target image with a set of semantic-similar reference images, i. e., reference-based DIC (Ref-DIC).

Benchmarking Contrastive Learning +1

Paper
Add Code

VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON

no code implementations • 13 Jun 2023 • Haoping Bai, Shancong Mou, Tatiana Likhomanenko, Ramazan Gokberk Cinbis, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi, Meng Cao

We introduce the VISION Datasets, a diverse collection of 14 industrial inspection datasets, uniquely poised to meet these challenges.

Defect Detection Instance Segmentation +1

Paper
Add Code

Efficient ConvBN Blocks for Transfer Learning and Beyond

1 code implementation • 19 May 2023 • Kaichao You, Guo Qin, Anchang Bao, Meng Cao, Ping Huang, Jiulong Shan, Mingsheng Long

Subsequently, we propose a novel Tune mode to bridge the gap between Eval mode and Deploy mode.

Computational Efficiency object-detection +2

Paper
Code

Systematic Rectification of Language Models via Dead-end Analysis

1 code implementation • 27 Feb 2023 • Meng Cao, Mehdi Fatemi, Jackie Chi Kit Cheung, Samira Shabanian

Other methods rely on rule-based or prompt-based token elimination, which are limited as they dismiss future tokens and the overall meaning of the complete discourse.

Reinforcement Learning (RL)

Paper
Code

RGI: robust GAN-inversion for mask-free image inpainting and unsupervised pixel-wise anomaly detection

no code implementations • 24 Feb 2023 • Shancong Mou, Xiaoyi Gu, Meng Cao, Haoping Bai, Ping Huang, Jiulong Shan, Jianjun Shi

In this paper, we propose a Robust GAN-inversion (RGI) method with a provable robustness guarantee to achieve image restoration under unknown \textit{gross} corruptions, where a small fraction of pixels are completely corrupted.

Anomaly Detection Image Inpainting +1

Paper
Add Code

Learning with Rejection for Abstractive Text Summarization

1 code implementation • 16 Feb 2023 • Meng Cao, Yue Dong, Jingyi He, Jackie Chi Kit Cheung

State-of-the-art abstractive summarization systems frequently hallucinate content that is not supported by the source document, mainly due to noise in the training dataset.

Abstractive Text Summarization

Paper
Code

Exploiting Auxiliary Caption for Video Grounding

no code implementations • 15 Jan 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Zhihong Zhu, Yaowei Li, Yuexian Zou

Video grounding aims to locate a moment of interest matching the given query sentence from an untrimmed video.

Contrastive Learning Dense Video Captioning +2

Paper
Add Code

Iterative Proposal Refinement for Weakly-Supervised Video Grounding

no code implementations • CVPR 2023 • Meng Cao, Fangyun Wei, Can Xu, Xiubo Geng, Long Chen, Can Zhang, Yuexian Zou, Tao Shen, Daxin Jiang

Weakly-Supervised Video Grounding (WSVG) aims to localize events of interest in untrimmed videos with only video-level annotations.

Sentence Video Grounding

Paper
Add Code

Video Referring Expression Comprehension via Transformer with Content-aware Query

1 code implementation • 6 Oct 2022 • Ji Jiang, Meng Cao, Tengtao Song, Yuexian Zou

To this end, we introduce two new datasets (i. e., VID-Entity and VidSTG-Entity) by augmenting the VIDSentence and VidSTG datasets with the explicitly referred words in the whole sentence, respectively.

Referring Expression Referring Expression Comprehension +1

Paper
Code

Adversarial Inter-Group Link Injection Degrades the Fairness of Graph Neural Networks

1 code implementation • 13 Sep 2022 • Hussain Hussain, Meng Cao, Sandipan Sikdar, Denis Helic, Elisabeth Lex, Markus Strohmaier, Roman Kern

We hope our findings raise awareness about this issue in our community and lay a foundation for the future development of GNN models that are more robust to such attacks.

Fairness Node Classification

Paper
Code

Latent Heterogeneous Graph Network for Incomplete Multi-View Learning

no code implementations • 29 Aug 2022 • Pengfei Zhu, Xinjie Yao, Yu Wang, Meng Cao, Binyuan Hui, Shuai Zhao, QinGhua Hu

Multi-view learning has progressed rapidly in recent years.

Graph Learning MULTI-VIEW LEARNING +1

Paper
Add Code

Correspondence Matters for Video Referring Expression Comprehension

1 code implementation • 21 Jul 2022 • Meng Cao, Ji Jiang, Long Chen, Yuexian Zou

Extensive experiments demonstrate that our DCNet achieves state-of-the-art performance on both video and image REC benchmarks.

Contrastive Learning Referring Expression +3

Paper
Code

LocVTP: Video-Text Pre-training for Temporal Localization

1 code implementation • 21 Jul 2022 • Meng Cao, Tianyu Yang, Junwu Weng, Can Zhang, Jue Wang, Yuexian Zou

To further enhance the temporal reasoning ability of the learned feature, we propose a context projection head and a temporal aware contrastive loss to perceive the contextual relationships.

Retrieval Temporal Localization +1

Paper
Code

A Survey on Neural Abstractive Summarization Methods and Factual Consistency of Summarization

no code implementations • 20 Apr 2022 • Meng Cao

Automatic summarization is the process of shortening a set of textual data computationally, to create a subset (a summary) that represents the most important pieces of information in the original text.

Abstractive Text Summarization

Paper
Add Code

Jacobian Norm for Unsupervised Source-Free Domain Adaptation

no code implementations • 7 Apr 2022 • Weikai Li, Meng Cao, Songcan Chen

Unsupervised Source (data) Free domain adaptation (USFDA) aims to transfer knowledge from a well-trained source model to a related but unlabeled target domain.

Source-Free Domain Adaptation

Paper
Add Code

PAEDID: Patch Autoencoder Based Deep Image Decomposition For Pixel-level Defective Region Segmentation

no code implementations • 28 Mar 2022 • Shancong Mou, Meng Cao, Haoping Bai, Ping Huang, Jianjun Shi, Jiulong Shan

To combine the best of both worlds, we present an unsupervised patch autoencoder based deep image decomposition (PAEDID) method for defective region segmentation.

Anomaly Detection

Paper
Add Code

Unsupervised Pre-training for Temporal Action Localization Tasks

1 code implementation • CVPR 2022 • Can Zhang, Tianyu Yang, Junwu Weng, Meng Cao, Jue Wang, Yuexian Zou

These pre-trained models can be sub-optimal for temporal localization tasks due to the inherent discrepancy between video-level classification and clip-level localization.

Contrastive Learning Representation Learning +4

Paper
Code

Synthetic Defect Generation for Display Front-of-Screen Quality Inspection: A Survey

no code implementations • 3 Mar 2022 • Shancong Mou, Meng Cao, Zhendong Hong, Ping Huang, Jiulong Shan, Jianjun Shi

Display front-of-screen (FOS) quality inspection is essential for the mass production of displays in the manufacturing process.

Synthetic Data Generation

Paper
Add Code

Information Gain Propagation: a new way to Graph Active Learning with Soft Labels

1 code implementation • ICLR 2022 • Wentao Zhang, Yexin Wang, Zhenbang You, Meng Cao, Ping Huang, Jiulong Shan, Zhi Yang, Bin Cui

Graph Neural Networks (GNNs) have achieved great success in various tasks, but their performance highly relies on a large number of labeled nodes, which typically requires considerable human effort.

Active Learning

Paper
Code

Self-supervised Semi-supervised Learning for Data Labeling and Quality Evaluation

no code implementations • 22 Nov 2021 • Haoping Bai, Meng Cao, Ping Huang, Jiulong Shan

On active learning task, our method achieves 97. 0% Top-1 Accuracy on CIFAR10 with 0. 1% annotated data, and 83. 9% Top-1 Accuracy on CIFAR100 with 10% annotated data.

Active Learning Representation Learning

Paper
Add Code

RIM: Reliable Influence-based Active Learning on Graphs

1 code implementation • NeurIPS 2021 • Wentao Zhang, Yexin Wang, Zhenbang You, Meng Cao, Ping Huang, Jiulong Shan, Zhi Yang, Bin Cui

Message passing is the core of most graph models such as Graph Convolutional Network (GCN) and Label Propagation (LP), which usually require a large number of clean labeled data to smooth out the neighborhood over the graph.

Active Learning

Paper
Code

On Pursuit of Designing Multi-modal Transformer for Video Grounding

no code implementations • EMNLP 2021 • Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang, Yuexian Zou

Almost all existing video grounding methods fall into two frameworks: 1) Top-down model: It predefines a set of segment candidates and then conducts segment classification and regression.

Sentence Video Grounding

Paper
Add Code

Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive Summarization

1 code implementation • ACL 2022 • Meng Cao, Yue Dong, Jackie Chi Kit Cheung

State-of-the-art abstractive summarization systems often generate \emph{hallucinations}; i. e., content that is not directly inferable from the source text.

Abstractive Text Summarization Reinforcement Learning (RL) +1

Paper
Code

Deep Motion Prior for Weakly-Supervised Temporal Action Localization

no code implementations • 12 Aug 2021 • Meng Cao, Can Zhang, Long Chen, Mike Zheng Shou, Yuexian Zou

In this paper, we analyze that the motion cues behind the optical flow features are complementary informative.

Optical Flow Estimation Weakly-supervised Temporal Action Localization +1

Paper
Add Code

UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing

no code implementations • 12 Aug 2021 • Meng Cao, HaoZhi Huang, Hao Wang, Xuan Wang, Li Shen, Sheng Wang, Linchao Bao, Zhifeng Li, Jiebo Luo

Compared with the state-of-the-art facial image editing methods, our framework generates video portraits that are more photo-realistic and temporally smooth.

3D Reconstruction Face Reenactment +3

Paper
Add Code

All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection

no code implementations • 24 Jun 2021 • Meng Cao, Can Zhang, Dongming Yang, Yuexian Zou

Compared to the traditional single-stage segmentation network, our NASK conducts the detection in a coarse-to-fine manner with the first stage segmentation spotting the rectangle text proposals and the second one retrieving compact representations.

Instance Segmentation Segmentation +2

Paper
Add Code

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

no code implementations • NeurIPS 2021 • Haoping Bai, Meng Cao, Ping Huang, Jiulong Shan

While single-shot quantized neural architecture search enjoys flexibility in both model architecture and quantization policy, the combined search space comes with many challenges, including instability when training the weight-sharing supernet and difficulty in navigating the exponentially growing search space.

Hardware Aware Neural Architecture Search Model Optimization +2

Paper
Add Code

Video Frame Interpolation via Structure-Motion based Iterative Fusion

no code implementations • 11 May 2021 • Xi Li, Meng Cao, Yingying Tang, Scott Johnston, Zhendong Hong, Huimin Ma, Jiulong Shan

Inspired by the observation that audiences have different visual preferences on foreground and background objects, we for the first time propose to use saliency masks in the evaluation processes of the task of video frame interpolation.

Optical Flow Estimation Video Frame Interpolation

Paper
Add Code

RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection

no code implementations • 30 Apr 2021 • Dongming Yang, Yuexian Zou, Can Zhang, Meng Cao, Jie Chen

Upon the frame, an Interaction Intensifier Module and a Correlation Parsing Module are carefully designed, where: a) interactive semantics from humans can be exploited and passed to objects to intensify interactions, b) interactive correlations among humans, objects and interactions are integrated to promote predictions.

Human-Object Interaction Detection Relation

Paper
Add Code

CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning

1 code implementation • CVPR 2021 • Can Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou

In this paper, we argue that learning by comparing helps identify these hard snippets and we propose to utilize snippet Contrastive learning to Localize Actions, CoLA for short.

Ranked #4 on Weakly Supervised Action Localization on ActivityNet-1.2

CoLA Contrastive Learning +3

Paper
Code

Quantum error-correcting codes from matrix-product codes related to quasi-orthogonal and quasi-unitary matrices

no code implementations • 31 Dec 2020 • Meng Cao

The construction of matrix-product codes with certain self-orthogonality over finite fields is an effective way to obtain good $q$-ary quantum codes of large length.

Information Theory Information Theory Quantum Physics

Paper
Add Code

Factual Error Correction for Abstractive Summarization Models

1 code implementation • EMNLP 2020 • Meng Cao, Yue Dong, Jiapeng Wu, Jackie Chi Kit Cheung

Experimental results show that our model is able to correct factual errors in summaries generated by other neural summarization models and outperforms previous models on factual consistency evaluation on the CNN/DailyMail dataset.

Abstractive Text Summarization

Paper
Code

TeMP: Temporal Message Passing for Temporal Knowledge Graph Completion

1 code implementation • EMNLP 2020 • Jiapeng Wu, Meng Cao, Jackie Chi Kit Cheung, William L. Hamilton

Our analysis also reveals important sources of variability both within and across TKG datasets, and we introduce several simple but strong baselines that outperform the prior state of the art in certain settings.

Imputation Temporal Knowledge Graph Completion

Paper
Code

An artificial intelligence system for predicting the deterioration of COVID-19 patients in the emergency department

1 code implementation • 4 Aug 2020 • Farah E. Shamout, Yiqiu Shen, Nan Wu, Aakash Kaku, Jungkyu Park, Taro Makino, Stanisław Jastrzębski, Duo Wang, Ben Zhang, Siddhant Dogra, Meng Cao, Narges Razavian, David Kudlowitz, Lea Azour, William Moore, Yvonne W. Lui, Yindalon Aphinyanaphongs, Carlos Fernandez-Granda, Krzysztof J. Geras

In order to verify performance in a real clinical setting, we silently deployed a preliminary version of the deep neural network at New York University Langone Health during the first wave of the pandemic, which produced accurate predictions in real-time.

COVID-19 Diagnosis Decision Making +1

Paper
Code

Task-agnostic Temporally Consistent Facial Video Editing

no code implementations • 3 Jul 2020 • Meng Cao, Hao-Zhi Huang, Hao Wang, Xuan Wang, Li Shen, Sheng Wang, Linchao Bao, Zhifeng Li, Jiebo Luo

Compared with the state-of-the-art facial image editing methods, our framework generates video portraits that are more photo-realistic and temporally smooth.

3D Reconstruction Video Editing

Paper
Add Code

All you need is a second look: Towards Tighter Arbitrary shape text detection

no code implementations • 26 Apr 2020 • Meng Cao, Yuexian Zou

Specifically, \textit{NASK} consists of a Text Instance Segmentation network namely \textit{TIS} ($1^{st}$ stage), a Text RoI Pooling module and a Fiducial pOint eXpression module termed as \textit{FOX} ($2^{nd}$ stage).

Instance Segmentation Scene Text Detection +3

Paper
Add Code

Unsupervised Domain Adaptation Through Transferring both the Source-Knowledge and Target-Relatedness Simultaneously

no code implementations • 18 Mar 2020 • Qing Tian, Yanan Zhu, Chuang Ma, Meng Cao

Unsupervised domain adaptation (UDA) is an emerging research topic in the field of machine learning and pattern recognition, which aims to help the learning of unlabeled target domain by transferring knowledge from the source domain.

BIG-bench Machine Learning Unsupervised Domain Adaptation

Paper
Add Code

Referring Expression Generation Using Entity Profiles

1 code implementation • IJCNLP 2019 • Meng Cao, Jackie Chi Kit Cheung

Referring Expression Generation (REG) is the task of generating contextually appropriate references to entities.

Referring Expression Referring expression generation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.