Search Results for author: Chiuman Ho

Found 7 papers, 2 papers with code

Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder

1 code implementation CVPR 2023 Xinmiao Lin, Yikang Li, Jenhao Hsiao, Chiuman Ho, Yu Kong

The popular VQ-VAE models reconstruct images through learning a discrete codebook but suffer from a significant issue in the rapid quality degradation of image reconstruction as the compression rate rises.

Image Generation Image Reconstruction

ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation

no code implementations12 Dec 2022 Daitao Xing, Jinglin Shen, Chiuman Ho, Anthony Tzes

The exploration of mutual-benefit cross-domains has shown great potential toward accurate self-supervised depth estimation.

Monocular Depth Estimation

Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features

no code implementations19 Aug 2022 Shichao Xu, Yikang Li, Jenhao Hsiao, Chiuman Ho, Zhu Qi

In computer vision, multi-label recognition are important tasks with many real-world applications, but classifying previously unseen labels remains a significant challenge.

Classification Multi-Label Classification +1

Dual-Flattening Transformers through Decomposed Row and Column Queries for Semantic Segmentation

no code implementations22 Jan 2022 Ying Wang, Chiuman Ho, Wenju Xu, Ziwei Xuan, Xudong Liu, Guo-Jun Qi

We propose a Dual-Flattening Transformer (DFlatFormer) to enable high-resolution output by reducing complexity to $\mathcal{O}(hw(H+W))$ that is multiple orders of magnitude smaller than the naive dense transformer.

Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.