no code implementations • 1 May 2024 • Zhihao Guo, Peng Wang
Additionally, we integrated various monocular depth estimation methods into the removal NeRF model, i. e., SpinNeRF, to assess their capacity to improve object removal performance.
no code implementations • 29 Feb 2024 • Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun
It can represent that the contrastive learning between the visual holistic representations and the multimodal fine-grained features of document objects can assist the vision encoder in acquiring more effective visual cues, thereby enhancing the comprehension of text-rich documents in LVLMs.
1 code implementation • 13 Mar 2023 • Xiaopeng Yan, Yindi Yang, Zhihao Guo, Liangliang Peng, Lei Xie
This paper describes our NPU-Elevoc personalized speech enhancement system (NAPSE) for the 5th Deep Noise Suppression Challenge at ICASSP 2023.