Search Results for author: Yash Patel

Found 30 papers, 10 papers with code

Conformal Contextual Robust Optimization

no code implementations16 Oct 2023 Yash Patel, Sahana Rayan, Ambuj Tewari

Data-driven approaches to predict-then-optimize decision-making problems seek to mitigate the risk of uncertainty region misspecification in safety-critical settings.

Conformal Prediction Decision Making

Integrated Image and Location Analysis for Wound Classification: A Deep Learning Approach

no code implementations23 Aug 2023 Yash Patel, Tirth Shah, Mrinal Kanti Dhar, Taiyu Zhang, Jeffrey Niezgoda, Sandeep Gopalakrishnan, Zeyun Yu

The global burden of acute and chronic wounds presents a compelling case for enhancing wound classification methods, a vital step in diagnosing and determining optimal treatments.

Classification Image Classification

Diffusion Models for Probabilistic Deconvolution of Galaxy Images

1 code implementation20 Jul 2023 Zhiwei Xue, Yuhang Li, Yash Patel, Jeffrey Regier

As an alternative, we propose a classifier-free conditional diffusion model for PSF deconvolution of galaxy images.

Amortized Variational Inference with Coverage Guarantees

no code implementations23 May 2023 Yash Patel, Declan McNamara, Jackson Loper, Jeffrey Regier, Ambuj Tewari

We prove lower bounds on the predictive efficiency of the regions produced by CANVI and explore how the quality of a posterior approximation relates to the predictive efficiency of prediction regions based on that approximation.

Variational Inference

FUSegNet: A Deep Convolutional Neural Network for Foot Ulcer Segmentation

1 code implementation4 May 2023 Mrinal Kanti Dhar, Taiyu Zhang, Yash Patel, Sandeep Gopalakrishnan, Zeyun Yu

As the top decoder stage carries a limited number of feature maps, max-out scSE is bypassed there to form a shorted P-scSE.

Neural Network Training and Non-Differentiable Objective Functions

no code implementations3 May 2023 Yash Patel

An appropriate proxy has to be designed for a novel task, which may not be feasible for a non-specialist.

DocILE Benchmark for Document Information Localization and Extraction

1 code implementation11 Feb 2023 Štěpán Šimsa, Milan Šulc, Michal Uřičář, Yash Patel, Ahmed Hamdi, Matěj Kocián, Matyáš Skalický, Jiří Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas

This paper introduces the DocILE benchmark with the largest dataset of business documents for the tasks of Key Information Localization and Extraction and Line Item Recognition.

Key Information Extraction Unsupervised Pre-training

SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation

no code implementations7 Feb 2023 Yash Patel, Yusheng Xie, Yi Zhu, Srikar Appalaraju, R. Manmatha

Instead of purely relying on the alignment from the noisy data, this paper proposes a novel loss function termed SimCon, which accounts for intra-modal similarities to determine the appropriate set of positive samples to align.

Semantic Segmentation

DocILE 2023 Teaser: Document Information Localization and Extraction

no code implementations29 Jan 2023 Štěpán Šimsa, Milan Šulc, Matyáš Skalický, Yash Patel, Ahmed Hamdi

The DocILE 2023 competition, hosted as a lab at the CLEF 2023 conference and as an ICDAR 2023 competition, will run the first major benchmark for the tasks of Key Information Localization and Extraction (KILE) and Line Item Recognition (LIR) from business documents.

Information Retrieval Retrieval

Generalized Differentiable RANSAC

2 code implementations ICCV 2023 Tong Wei, Yash Patel, Alexander Shekhovtsov, Jiri Matas, Daniel Barath

We propose $\nabla$-RANSAC, a generalized differentiable RANSAC that allows learning the entire randomized robust estimation pipeline.

Point Cloud Registration

RL Boltzmann Generators for Conformer Generation in Data-Sparse Environments

1 code implementation19 Nov 2022 Yash Patel, Ambuj Tewari

The generation of conformers has been a long-standing interest to structural chemists and biologists alike.

Contrastive Classification and Representation Learning with Probabilistic Interpretation

no code implementations7 Nov 2022 Rahaf Aljundi, Yash Patel, Milan Sulc, Daniel Olmeda, Nikolay Chumerin

In this work, we investigate the possibility of learning both the representation and the classifier using one objective function that combines the robustness of contrastive learning and the probabilistic interpretation of cross entropy loss.

Classification Contrastive Learning +1

Wound Severity Classification using Deep Neural Network

no code implementations17 Apr 2022 D. M. Anisuzzaman, Yash Patel, Jeffrey Niezgoda, Sandeep Gopalakrishnan, Zeyun Yu

This study used wound photos to construct a deep neural network-based wound severity classifier that classified them into one of three classes: green, yellow, or red.

Classification Multi-class Classification +1

Multi-modal Wound Classification using Wound Image and Location by Deep Neural Network

no code implementations14 Sep 2021 D. M. Anisuzzaman, Yash Patel, Behrouz Rostami, Jeffrey Niezgoda, Sandeep Gopalakrishnan, Zeyun Yu

This study developed a deep neural network-based multi-modal classifier using wound images and their corresponding locations to categorize wound images into multiple classes, including diabetic, pressure, surgical, and venous ulcers.

TAG

Recall@k Surrogate Loss with Large Batches and Similarity Mixup

2 code implementations CVPR 2022 Yash Patel, Giorgos Tolias, Jiri Matas

This work focuses on learning deep visual representation models for retrieval by exploring the interplay between a new loss function, the batch size, and a new regularization approach.

Image Retrieval Metric Learning +2

FEDS -- Filtered Edit Distance Surrogate

no code implementations8 Mar 2021 Yash Patel, Jiri Matas

This paper proposes a procedure to train a scene text recognition model using a robust learned surrogate of edit distance.

Scene Text Recognition

Neural Network-based Acoustic Vehicle Counting

no code implementations22 Oct 2020 Slobodan Djukanović, Yash Patel, Jiři Matas, Tuomas Virtanen

This distance is predicted from audio using a two-stage (coarse-fine) regression, with both stages realised via neural networks (NNs).

Distance regression regression

A Mobile App for Wound Localization using Deep Learning

1 code implementation15 Sep 2020 D. M. Anisuzzaman, Yash Patel, Jeffrey Niezgoda, Sandeep Gopalakrishnan, Zeyun Yu

We present an automated wound localizer from 2D wound and ulcer images by using deep neural network, as the first step towards building an automated and complete wound diagnostic system.

Learning Surrogates via Deep Embedding

no code implementations ECCV 2020 Yash Patel, Tomas Hodan, Jiri Matas

The effectiveness of the proposed technique is demonstrated in a post-tuning setup, where a trained model is tuned using the learned surrogate.

Scene Text Recognition

Saliency Driven Perceptual Image Compression

no code implementations12 Feb 2020 Yash Patel, Srikar Appalaraju, R. Manmatha

The proposed compression model incorporates the salient regions and optimizes on the proposed perceptual similarity metric.

Image Compression MS-SSIM +3

Human Perceptual Evaluations for Image Compression

no code implementations9 Aug 2019 Yash Patel, Srikar Appalaraju, R. Manmatha

Recently, there has been much interest in deep learning techniques to do image compression and there have been claims that several of these produce better results than engineered compression schemes (such as JPEG, JPEG2000 or BPG).

Image Compression MS-SSIM +1

Deep Perceptual Compression

no code implementations18 Jul 2019 Yash Patel, Srikar Appalaraju, R. Manmatha

In several cases, the MS-SSIM for deep learned techniques is higher than say a conventional, non-deep learned codec such as JPEG-2000 or BPG.

Image Compression MS-SSIM +3

Self-Supervised Visual Representations for Cross-Modal Retrieval

no code implementations31 Jan 2019 Yash Patel, Lluis Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

Cross-modal retrieval methods have been significantly improved in last years with the use of deep neural networks and large-scale annotated datasets such as ImageNet and Places.

Cross-Modal Retrieval Image Classification +3

TextTopicNet - Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces

1 code implementation4 Jul 2018 Yash Patel, Lluis Gomez, Raul Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

We show that adequate visual features can be learned efficiently by training a CNN to predict the semantic textual context in which a particular image is more probable to appear as an illustration.

Image Classification object-detection +3

Learning Sampling Policies for Domain Adaptation

no code implementations19 May 2018 Yash Patel, Kashyap Chitta, Bhavan Jasani

We address the problem of semi-supervised domain adaptation of classification algorithms through deep Q-learning.

Classification Domain Adaptation +3

E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text

3 code implementations30 Jan 2018 Michal Bušta, Yash Patel, Jiri Matas

An end-to-end trainable (fully differentiable) method for multi-language scene text localization and recognition is proposed.

Optical Character Recognition (OCR)

Self-supervised learning of visual features through embedding images into text topic spaces

no code implementations CVPR 2017 Lluis Gomez, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

End-to-end training from scratch of current deep architectures for new computer vision problems would require Imagenet-scale datasets, and this is not always possible.

Image Classification object-detection +3

Cannot find the paper you are looking for? You can Submit a new open access paper.