Search Results for author: Mingxiao Li

Found 14 papers, 6 papers with code

Animate Your Motion: Turning Still Images into Dynamic Videos

no code implementations • 15 Mar 2024 • Mingxiao Li, Bo Wan, Marie-Francine Moens, Tinne Tuytelaars

For the first time, we integrate both semantic and motion cues within a diffusion model for video generation, as demonstrated in Fig 1.

Specificity Text-to-Video Generation +1

Paper
Add Code

NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties

no code implementations • 2 Feb 2024 • Jingyuan Sun, Mingxiao Li, Zijiao Chen, Marie-Francine Moens

In the pursuit to understand the intricacies of human brain's visual processing, reconstructing dynamic visual experiences from brain activities emerges as a challenging yet fascinating endeavor.

Contrastive Learning SSIM +1

Paper
Add Code

Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence

no code implementations • 2 Oct 2023 • Wei Sun, Mingxiao Li, Damien Sileo, Jesse Davis, Marie-Francine Moens

Medical Question Answering~(medical QA) systems play an essential role in assisting healthcare workers in finding answers to their questions.

Explanation Generation Question Answering

Paper
Add Code

Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion

no code implementations • 30 Sep 2023 • Jingyuan Sun, Mingxiao Li, Marie-Francine Moens

Reconstructing visual stimuli from human brain activities provides a promising opportunity to advance our understanding of the brain's visual system and its connection with computer vision models.

Contrastive Learning

Paper
Add Code

Elucidating the Exposure Bias in Diffusion Models

4 code implementations • 29 Aug 2023 • Mang Ning, Mingxiao Li, Jianlin Su, Albert Ali Salah, Itir Onal Ertugrul

In this paper, we systematically investigate the exposure bias problem in diffusion models by first analytically modelling the sampling distribution, based on which we then attribute the prediction error at each sampling step as the root cause of the exposure bias issue.

Ranked #9 on Image Generation on CIFAR-10

Attribute Image Generation

Paper
Code

Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities

1 code implementation • NeurIPS 2023 • Jingyuan Sun, Mingxiao Li, Zijiao Chen, Yunhao Zhang, Shaonan Wang, Marie-Francine Moens

The second phase tunes the feature learner to attend to neural activation patterns most informative for visual reconstruction with guidance from an image auto-encoder.

Ranked #1 on Brain Visual Reconstruction from fMRI on GOD

Brain Decoding Brain Visual Reconstruction from fMRI +1

Paper
Code

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

1 code implementation • 24 May 2023 • Mingxiao Li, Tingyu Qu, Ruicong Yao, Wei Sun, Marie-Francine Moens

In this work, we conduct a systematic study of exposure bias in DPM and, intriguingly, we find that the exposure bias could be alleviated with a novel sampling method that we propose, without retraining the model.

Denoising

Paper
Code

Crossword: A Semantic Approach to Data Compression via Masking

no code implementations • 3 Apr 2023 • Mingxiao Li, Rui Jin, Liyao Xiang, Kaiming Shen, Shuguang Cui

The traditional methods for data compression are typically based on the symbol-level statistics, with the information source modeled as a long sequence of i. i. d.

Data Compression

Paper
Add Code

Layout-aware Dreamer for Embodied Referring Expression Grounding

1 code implementation • 30 Nov 2022 • Mingxiao Li, Zehao Wang, Tinne Tuytelaars, Marie-Francine Moens

In this work, we study the problem of Embodied Referring Expression Grounding, where an agent needs to navigate in a previously unseen environment and localize a remote object described by a concise high-level natural language instruction.

Common Sense Reasoning Navigate +1

Paper
Code

Find a Way Forward: a Language-Guided Semantic Map Navigator

no code implementations • 7 Mar 2022 • Zehao Wang, Mingxiao Li, Minye Wu, Marie-Francine Moens, Tinne Tuytelaars

In this paper, we introduce the map-language navigation task where an agent executes natural language instructions and moves to the target position based only on a given 3D semantic map.

Imitation Learning

Paper
Add Code

Modeling Coreference Relations in Visual Dialog

no code implementations • EACL 2021 • Mingxiao Li, Marie-Francine Moens

Visual dialog is a vision-language task where an agent needs to answer a series of questions grounded in an image based on the understanding of the dialog history and the image.

Question Answering Visual Dialog +1

Paper
Add Code

Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering

1 code implementation • 6 Mar 2022 • Mingxiao Li, Marie-Francine Moens

Knowledge-based visual question answering (VQA) is a vision-language task that requires an agent to correctly answer image-related questions using knowledge that is not presented in the given image.

Graph Attention Question Answering +1

Paper
Code

Towards Understanding Iterative Magnitude Pruning: Why Lottery Tickets Win

no code implementations • 13 Jun 2021 • Jaron Maene, Mingxiao Li, Marie-Francine Moens

The lottery ticket hypothesis states that sparse subnetworks exist in randomly initialized dense networks that can be trained to the same accuracy as the dense network they reside in.

Linear Mode Connectivity

Paper
Add Code

Multiscale Dynamic Human Mobility Flow Dataset in the U.S. during the COVID-19 Epidemic

6 code implementations • 27 Aug 2020 • Yuhao Kang, Song Gao, Yunlei Liang, Mingxiao Li, Jinmeng Rao, Jake Kruse

Understanding dynamic human mobility changes and spatial interaction patterns at different geographic scales is crucial for monitoring and measuring the impacts of non-pharmaceutical interventions (such as stay-at-home orders) during the pandemic.

Social and Information Networks Physics and Society

125

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.