no code implementations • 26 Feb 2024 • Bidur Khanal, Prashant Shrestha, Sanskar Amgain, Bishesh Khanal, Binod Bhattarai, Cristian A. Linte
Label noise in medical image classification datasets significantly hampers the training of supervised deep learning methods, undermining their generalizability.
no code implementations • 15 Feb 2024 • Sanskar Amgain, Prashant Shrestha, Sophia Bano, Ignacio del Valle Torres, Michael Cunniffe, Victor Hernandez, Phil Beales, Binod Bhattarai
Purpose: We apply federated learning to train an OCT image classifier simulating a realistic scenario with multiple clients and statistical heterogeneous data distribution where data in the clients lack samples of some categories entirely.
no code implementations • 15 Jan 2024 • Bidur Khanal, Binod Bhattarai, Bishesh Khanal, Cristian Linte
In this study, we address three key questions: i) How does label noise impact various medical image classification datasets?
no code implementations • 11 Dec 2023 • Prashant Shrestha, Sanskar Amgain, Bidur Khanal, Cristian A. Linte, Binod Bhattarai
Medical Vision Language Pretraining (VLP) has recently emerged as a promising solution to the scarcity of labeled data in the medical domain.
no code implementations • 4 Dec 2023 • Mulham Fawakherji, Eduard Vazquez, Pasquale Giampa, Binod Bhattarai
In this study, we investigate the effectiveness of two computer vision data augmentation techniques: cutout and cutmix, for text augmentation in multi-modal person re-identification.
1 code implementation • 4 Dec 2023 • Razvan Caramalau, Binod Bhattarai, Danail Stoyanov
In this paper, we introduce Active Learning framework in Federated Learning for Target Domain Generalisation, harnessing the strength from both learning paradigms.
no code implementations • 30 Nov 2023 • Suman Sapkota, Binod Bhattarai
The recent success of multiple neural architectures like CNNs, Transformers, and MLP-Mixers motivated us to look for similarities and differences between them.
no code implementations • 31 Oct 2023 • Suman Sapkota, Binod Bhattarai
Global Neuron Importance Estimation is used to prune neural networks for efficiency reasons.
no code implementations • 14 Oct 2023 • Jacob Thrasher, Alina Devkota, Prasiddha Siwakotai, Rohit Chivukula, Pranav Poudel, Chaunbo Hu, Binod Bhattarai, Prashnna Gyawali
Recent advancements in multimodal machine learning have empowered the development of accurate and robust AI systems in the medical domain, especially within centralized database systems.
no code implementations • 8 Oct 2023 • Sandesh Pokhrel, Sanjay Bhandari, Eduard Vazquez, Yash Raj Shrestha, Binod Bhattarai
Coronary Artery Diseases(CADs) though preventable are one of the leading causes of death and disability.
no code implementations • 7 Oct 2023 • Sandesh Pokhrel, Sanjay Bhandari, Eduard Vazquez, Yash Raj Shrestha, Binod Bhattarai
Coronary Artery Diseases although preventable are one of the leading cause of mortality worldwide.
1 code implementation • 2 Oct 2023 • Anita Rau, Binod Bhattarai, Lourdes Agapito, Danail Stoyanov
Colorectal cancer remains one of the deadliest cancers in the world.
1 code implementation • 8 Aug 2023 • Bidur Khanal, Binod Bhattarai, Bishesh Khanal, Cristian A. Linte
In this work, we explore contrastive and pretext task-based self-supervised pretraining to initialize the weights of a deep learning classification model for two medical datasets with self-induced noisy labels -- NCT-CRC-HE-100K tissue histological images and COVID-QU-Ex chest X-ray images.
no code implementations • 22 Jun 2023 • Suman Sapkota, Pranav Poudel, Sudarshan Regmi, Bibek Panthi, Binod Bhattarai
In this study, we show an application of neural network pruning in polyp segmentation.
1 code implementation • 21 Jun 2023 • Bidur Khanal, Binod Bhattarai, Bishesh Khanal, Danail Stoyanov, Cristian A. Linte
Acquiring properly annotated data is expensive in the medical field as it requires experts, time-consuming protocols, and rigorous validation.
no code implementations • 14 Jun 2023 • Ronast Subedi, Rebati Raman Gaire, Sharib Ali, Anh Nguyen, Danail Stoyanov, Binod Bhattarai
This paper presents a solution to the cross-domain adaptation problem for 2D surgical image segmentation, explicitly considering the privacy protection of distributed datasets belonging to different centers.
no code implementations • 28 May 2023 • Sudarshan Regmi, Bibek Panthi, Sakar Dotel, Prashnna K. Gyawali, Danail Stoyanov, Binod Bhattarai
Indeed, the naive incorporation of feature normalization within neural networks does not guarantee substantial improvement in OOD detection performance.
no code implementations • 10 May 2023 • Rumeysa Bodur, Erhan Gundogdu, Binod Bhattarai, Tae-Kyun Kim, Michael Donoser, Loris Bazzani
We propose a novel learning method for text-guided image editing, namely \texttt{iEdit}, that generates images conditioned on a source image and a textual edit prompt.
no code implementations • 16 Apr 2023 • Jingxuan Kang, Tudor Jianu, Baoru Huang, Binod Bhattarai, Ngan Le, Frans Coenen, Anh Nguyen
In this paper, we propose a new method to translate simulation images from an endovascular simulator to X-ray images.
no code implementations • CVPR 2023 • Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Sharib Ali, Vincent Andrearczyk, Marc Aubreville, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano, Jorge Bernal, Sebastian Bodenstedt, Alessandro Casella, Veronika Cheplygina, Marie Daum, Marleen de Bruijne, Adrien Depeursinge, Reuben Dorent, Jan Egger, David G. Ellis, Sandy Engelhardt, Melanie Ganz, Noha Ghatwary, Gabriel Girard, Patrick Godau, Anubha Gupta, Lasse Hansen, Kanako Harada, Mattias Heinrich, Nicholas Heller, Alessa Hering, Arnaud Huaulmé, Pierre Jannin, Ali Emre Kavur, Oldřich Kodym, Michal Kozubek, Jianning Li, Hongwei Li, Jun Ma, Carlos Martín-Isla, Bjoern Menze, Alison Noble, Valentin Oreiller, Nicolas Padoy, Sarthak Pati, Kelly Payette, Tim Rädsch, Jonathan Rafael-Patiño, Vivek Singh Bawa, Stefanie Speidel, Carole H. Sudre, Kimberlin Van Wijnen, Martin Wagner, Donglai Wei, Amine Yamlahi, Moi Hoon Yap, Chun Yuan, Maximilian Zenk, Aneeq Zia, David Zimmerer, Dogu Baran Aydogan, Binod Bhattarai, Louise Bloch, Raphael Brüngel, Jihoon Cho, Chanyeol Choi, Qi Dou, Ivan Ezhov, Christoph M. Friedrich, Clifton Fuller, Rebati Raman Gaire, Adrian Galdran, Álvaro García Faura, Maria Grammatikopoulou, SeulGi Hong, Mostafa Jahanifar, Ikbeom Jang, Abdolrahim Kadkhodamohammadi, Inha Kang, Florian Kofler, Satoshi Kondo, Hugo Kuijf, Mingxing Li, Minh Huan Luu, Tomaž Martinčič, Pedro Morais, Mohamed A. Naser, Bruno Oliveira, David Owen, Subeen Pang, Jinah Park, Sung-Hong Park, Szymon Płotka, Elodie Puybareau, Nasir Rajpoot, Kanghyun Ryu, Numan Saeed, Adam Shephard, Pengcheng Shi, Dejan Štepec, Ronast Subedi, Guillaume Tochon, Helena R. Torres, Helene Urien, João L. Vilaça, Kareem Abdul Wahid, Haojie Wang, Jiacheng Wang, Liansheng Wang, Xiyue Wang, Benedikt Wiestler, Marek Wodzinski, Fangfang Xia, Juanying Xie, Zhiwei Xiong, Sen yang, Yanwu Yang, Zixuan Zhao, Klaus Maier-Hein, Paul F. Jäger, Annette Kopp-Schneider, Lena Maier-Hein
The "typical" lead of a winning team is a computer scientist with a doctoral degree, five years of experience in biomedical image analysis, and four years of experience in deep learning.
2 code implementations • 13 Feb 2023 • Chinedu Innocent Nwoye, Tong Yu, Saurav Sharma, Aditya Murali, Deepak Alapatt, Armine Vardazaryan, Kun Yuan, Jonas Hajek, Wolfgang Reiter, Amine Yamlahi, Finn-Henri Smidt, Xiaoyang Zou, Guoyan Zheng, Bruno Oliveira, Helena R. Torres, Satoshi Kondo, Satoshi Kasai, Felix Holm, Ege Özsoy, Shuangchun Gui, Han Li, Sista Raviteja, Rachana Sathish, Pranav Poudel, Binod Bhattarai, Ziheng Wang, Guo Rui, Melanie Schellenberg, João L. Vilaça, Tobias Czempiel, Zhenkun Wang, Debdoot Sheet, Shrawan Kumar Thapa, Max Berniker, Patrick Godau, Pedro Morais, Sudarshan Regmi, Thuy Nuong Tran, Jaime Fonseca, Jan-Hinrich Nölke, Estevão Lima, Eduard Vazquez, Lena Maier-Hein, Nassir Navab, Pietro Mascagni, Barbara Seeliger, Cristians Gonzalez, Didier Mutter, Nicolas Padoy
This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection.
Ranked #1 on Action Triplet Detection on CholecT50 (Challenge)
1 code implementation • 4 Jan 2023 • Razvan Caramalau, Binod Bhattarai, Danail Stoyanov, Tae-Kyun Kim
We present MoBYv2AL, a novel self-supervised active learning framework for image classification.
no code implementations • 7 Nov 2022 • Abhinav Joshi, Naman Gupta, Jinang Shah, Binod Bhattarai, Ashutosh Modi, Danail Stoyanov
In order to process the multimodal information automatically and use it for an end application, Multimodal Representation Learning (MRL) has emerged as an active area of research in recent times.
1 code implementation • 10 Jul 2022 • Suman Sapkota, Binod Bhattarai
Network Morphism based Neural Architecture Search (NAS) is one of the most efficient methods, however, knowing where and when to add new neurons or remove dis-functional ones is generally left to black-box Reinforcement Learning models.
1 code implementation • 24 Jun 2022 • Sophia Bano, Alessandro Casella, Francisco Vasconcelos, Abdul Qayyum, Abdesslam Benzinou, Moona Mazher, Fabrice Meriaudeau, Chiara Lena, Ilaria Anita Cintorrino, Gaia Romana De Paolis, Jessica Biagioli, Daria Grechishnikova, Jing Jiao, Bizhe Bai, Yanyan Qiao, Binod Bhattarai, Rebati Raman Gaire, Ronast Subedi, Eduard Vazquez, Szymon Płotka, Aneta Lisowska, Arkadiusz Sitek, George Attilakos, Ruwan Wimalasundera, Anna L David, Dario Paladini, Jan Deprest, Elena De Momi, Leonardo S Mattos, Sara Moccia, Danail Stoyanov
For this challenge, we released a dataset of 2060 images, pixel-annotated for vessels, tool, fetus and background classes, from 18 in-vivo TTTS fetoscopy procedures and 18 short video clips.
1 code implementation • 11 Apr 2022 • Anita Rau, Binod Bhattarai, Lourdes Agapito, Danail Stoyanov
Deducing the 3D structure of endoscopic scenes from images is exceedingly challenging.
1 code implementation • 7 Apr 2022 • Shrawan Kumar Thapa, Pranav Poudel, Binod Bhattarai, Danail Stoyanov
Semantic segmentation of polyps and depth estimation are two important research problems in endoscopic image analysis.
1 code implementation • 2 Apr 2022 • Binod Bhattarai, Ronast Subedi, Rebati Raman Gaire, Eduard Vazquez, Danail Stoyanov
We employed our method on two powerful and widely used semantic segmentation networks: UNet and U2Net to train in a multi-task setup.
no code implementations • 8 Dec 2021 • Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim
Recently, hierarchical networks that consist of both a global network dealing with the whole image and multiple local networks focusing on local parts are showing success.
no code implementations • 17 Nov 2021 • Jiaze Sun, Binod Bhattarai, Zhixiang Chen, Tae-Kyun Kim
Whilst both branches are required during training, the RGB branch is our primary network and the semantic branch is not needed for inference.
no code implementations • 7 Sep 2021 • Suman Sapkota, Manish Juneja, Laurynas Keleras, Pranav Kotwal, Binod Bhattarai
In this paper we present our solution to extract albedo of branded labels for e-commerce products.
1 code implementation • 16 Jun 2021 • Suman Sapkota, Binod Bhattarai
In the experiments section, we use our methods for classification tasks using an ensemble of 1-vs-all models as well as using a single multiclass model on larger-scale datasets.
1 code implementation • 7 Jun 2021 • Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim
In this paper, we present a novel pipeline for pool-based Active Learning.
no code implementations • 12 May 2021 • Suman Sapkota, Bidur Khanal, Binod Bhattarai, Bishesh Khanal, Tae-Kyun Kim
Multi-domain image-to-image translation with conditional Generative Adversarial Networks (GANs) can generate highly photo realistic images with desired target classes, yet these synthetic images have not always been helpful to improve downstream supervised tasks such as image classification.
1 code implementation • 1 Oct 2020 • Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim
We propose a Bayesian approximation to a deep learning architecture for 3D hand pose estimation.
no code implementations • 30 Sep 2020 • Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim
We utilise this dataset to minimise the novel depth consistency loss via adversarial learning (note we do not have ground truth depth maps for generated face images) and the depth categorical loss of synthetic data on the discriminator.
1 code implementation • CVPR 2021 • Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim
We flip the label of newly queried nodes from unlabelled to labelled, re-train the learner to optimise the downstream task and the graph to minimise its modified objective.
Ranked #4 on Active Learning on CIFAR10 (10,000)
no code implementations • 11 Jun 2020 • Jiaze Sun, Binod Bhattarai, Tae-Kyun Kim
We perform augmentation by randomly sampling sensible labels from the label space of the few labelled examples available and assigning them as target labels to the abundant unlabelled examples from the same distribution as that of the labelled ones.
no code implementations • 23 Mar 2020 • Sharib Ali, Binod Bhattarai, Tae-Kyun Kim, Jens Rittscher
In this work, we propose to use a few-shot learning approach that requires less training data and can be used to predict label classes of test samples from an unseen dataset.
no code implementations • ECCV 2020 • Binod Bhattarai, Tae-Kyun Kim
Existing conditional GANs commonly encode target domain label information as hard-coded categorical vectors in the form of 0s and 1s.
no code implementations • 10 Sep 2019 • Binod Bhattarai, Seungryul Baek, Rumeysa Bodur, Tae-Kyun Kim
Unlike previous studies of randomly augmenting the synthetic data with real data, we present our simple, effective and easy to implement synthetic data sampling methods to train deep CNNs more efficiently and accurately.
no code implementations • 15 Jul 2019 • Binod Bhattarai, Rumeysa Bodur, Tae-Kyun Kim
Augmenting data in image space (eg.
1 code implementation • ECCV 2018 • Baris Gecer, Binod Bhattarai, Josef Kittler, Tae-Kyun Kim
We propose a novel end-to-end semi-supervised adversarial framework to generate photorealistic face images of new identities with wide ranges of expressions, poses, and illuminations conditioned by a 3D morphable model.
Ranked #16 on Face Verification on IJB-A
no code implementations • 1 Nov 2016 • Binod Bhattarai, Gaurav Sharma, Frederic Jurie
The challenge addressed in this paper is to design a common universal representation such that a single merged signature is transmitted to the server, whatever be the type and number of features computed by the client, ensuring nonetheless an optimal performance.
no code implementations • CVPR 2016 • Binod Bhattarai, Gaurav Sharma, Frederic Jurie
The experiments clearly demonstrate the scalability and improved performance of the proposed method on the tasks of identity and age based face image retrieval compared to competitive existing methods, on the standard datasets and with the presence of a million distractor face images.