Search Results for author: Haoyuan Shi

Found 2 papers, 0 papers with code

Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment

no code implementations21 Feb 2024 Yunxin Li, Xinyu Chen, Baotian Hu, Haoyuan Shi, Min Zhang

Evaluating and Rethinking the current landscape of Large Multimodal Models (LMMs), we observe that widely-used visual-language projection approaches (e. g., Q-former or MLP) focus on the alignment of image-text descriptions yet ignore the visual knowledge-dimension alignment, i. e., connecting visuals to their relevant knowledge.

Language Modelling Question Answering +1

Toward Moiré-Free and Detail-Preserving Demosaicking

no code implementations15 May 2023 Xuanchen Li, Yan Niu, Bo Zhao, Haoyuan Shi, Zitong An

In both applications, our model substantially alleviates artifacts such as Moir\'e and over-smoothness at similar or lower computational cost to currently top-performing models, as validated by diverse evaluations.

Demosaicking Denoising +1

Cannot find the paper you are looking for? You can Submit a new open access paper.