Search Results for author: Rowel Atienza

Found 12 papers, 12 papers with code

Scene Text Recognition Models Explainability Using Local Features

1 code implementation14 Oct 2023 Mark Vincent Ty, Rowel Atienza

In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction.

Scene Text Recognition

EfficientSpeech: An On-Device Text to Speech Model

1 code implementation23 May 2023 Rowel Atienza

State of the art (SOTA) neural text to speech (TTS) models can generate natural-sounding synthetic voices.

Scene Text Recognition with Permuted Autoregressive Sequence Models

1 code implementation14 Jul 2022 Darwin Bautista, Rowel Atienza

Context-aware STR methods typically use internal autoregressive (AR) language models (LM).

Ranked #4 on Scene Text Recognition on COCO-Text (using extra training data)

Language Modelling Scene Text Recognition

Depth Pruning with Auxiliary Networks for TinyML

1 code implementation22 Apr 2022 Josen Daniel De Leon, Rowel Atienza

Pruning is a neural network optimization technique that sacrifices accuracy in exchange for lower computational requirements.

Keyword Spotting

Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

1 code implementation20 Oct 2021 Rowel Atienza

Experimental results further show that unlike other regularization terms such as label smoothing, AgMax can take advantage of the data augmentation to consistently improve model generalization by a significant margin.

Data Augmentation object-detection +1

Data Augmentation for Scene Text Recognition

1 code implementation16 Aug 2021 Rowel Atienza

Scene text recognition (STR) is a challenging task in computer vision due to the large number of possible text appearances in natural scenes.

Image Augmentation Scene Text Recognition

GOO: A Dataset for Gaze Object Prediction in Retail Environments

1 code implementation22 May 2021 Henri Tomas, Marcus Reyes, Raimarc Dionido, Mark Ty, Jonric Mirando, Joel Casimiro, Rowel Atienza, Richard Guinto

To this end, we present a challenging new task called gaze object prediction, where the goal is to predict a bounding box for a person's gazed-at object.

Domain Adaptation Gaze Estimation +1

Vision Transformer for Fast and Efficient Scene Text Recognition

3 code implementations18 May 2021 Rowel Atienza

On a comparable strong baseline method such as TRBA with accuracy of 84. 3%, our small ViTSTR achieves a competitive accuracy of 82. 6% (84. 2% with data augmentation) at 2. 4x speed up, using only 43. 4% of the number of parameters and 42. 2% FLOPS.

Computational Efficiency Data Augmentation +1

Next-Best View Policy for 3D Reconstruction

2 code implementations28 Aug 2020 Daryl Peralta, Joel Casimiro, Aldrin Michael Nilles, Justine Aletta Aguilar, Rowel Atienza, Rhandley Cajote

Our experiments show that using Scan-RL, the agent can scan houses with fewer number of steps and a shorter distance compared to our baseline circular path.

3D Reconstruction

Pyramid U-Network for Skeleton Extraction From Shape Points

1 code implementation IEEE 2019 CVPR Workshop 2019 Rowel Atienza

PSPU-SkelNet is a pyramid of three U-Nets that predicts the skeleton from a given shape point cloud.

A Conditional Generative Adversarial Network for Rendering Point Clouds

1 code implementation IEEE 2019 CVPR Workshop 2019 Rowel Atienza

In computer graphics, point clouds from laser scanning devices are difficult to render into photo-realistic images due to lack of information they carry about color, normal, lighting, and connection between points.

Generative Adversarial Network Surface Reconstruction

Fast Disparity Estimation using Dense Networks

1 code implementation19 May 2018 Rowel Atienza

Disparity estimation is a difficult problem in stereo vision because the correspondence technique fails in images with textureless and repetitive regions.

Disparity Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.