Search Results for author: Shervin Minaee

Found 50 papers, 9 papers with code

Large Language Models: A Survey

no code implementations9 Feb 2024 Shervin Minaee, Tomas Mikolov, Narjes Nikzad, Meysam Chenaghlu, Richard Socher, Xavier Amatriain, Jianfeng Gao

Large Language Models (LLMs) have drawn a lot of attention due to their strong performance on a wide range of natural language tasks, since the release of ChatGPT in November 2022.

Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

1 code implementation CVPR 2022 Ligong Han, Jian Ren, Hsin-Ying Lee, Francesco Barbieri, Kyle Olszewski, Shervin Minaee, Dimitris Metaxas, Sergey Tulyakov

In addition, our model can extract visual information as suggested by the text prompt, e. g., "an object in image one is moving northeast", and generate corresponding videos.

Self-Learning Text Augmentation +1

Modern Augmented Reality: Applications, Trends, and Future Directions

no code implementations18 Feb 2022 Shervin Minaee, Xiaodan Liang, Shuicheng Yan

Augmented reality (AR) is one of the relatively old, yet trending areas in the intersection of computer vision and computer graphics with numerous applications in several areas, from gaming and entertainment, to education and healthcare.

Going Deeper Into Face Detection: A Survey

no code implementations27 Mar 2021 Shervin Minaee, Ping Luo, Zhe Lin, Kevin Bowyer

In this work, we provide a detailed overview of some of the most representative deep learning based face detection methods by grouping them into a few major categories, and present their core architectural designs and accuracies on popular benchmarks.

Face Detection Image Classification

Age and Gender Prediction From Face Images Using Attentional Convolutional Network

no code implementations8 Oct 2020 Amirali Abdolrashidi, Mehdi Minaei, Elham Azimi, Shervin Minaee

We train our model in a multi-task learning fashion, and augment the feature embedding of the age classifier, with the predicted gender, and show that doing so can further increase the accuracy of age prediction.

Gender Prediction Multi-Task Learning

COVID CT-Net: Predicting Covid-19 From Chest CT Images Using Attentional Convolutional Network

no code implementations10 Sep 2020 Shakib Yazdani, Shervin Minaee, Rahele Kafieh, Narges Saeedizadeh, Milan Sonka

We also provide a visualization of the attention maps of the model for several test images, and show that our model is attending to the infected regions as intended.

Computed Tomography (CT) Specificity

COVID TV-UNet: Segmenting COVID-19 Chest CT Images Using Connectivity Imposed U-Net

1 code implementation24 Jul 2020 Narges Saeedizadeh, Shervin Minaee, Rahele Kafieh, Shakib Yazdani, Milan Sonka

Through experimental results on a relatively large-scale CT segmentation dataset of around 900 images, we show that adding this new regularization term leads to 2\% gain on overall segmentation performance compared to the U-Net model.

Computed Tomography (CT) Segmentation

Deep Learning Based Text Classification: A Comprehensive Review

2 code implementations6 Apr 2020 Shervin Minaee, Nal Kalchbrenner, Erik Cambria, Narjes Nikzad, Meysam Chenaghlu, Jianfeng Gao

Deep learning based models have surpassed classical machine learning based approaches in various text classification tasks, including sentiment analysis, news categorization, question answering, and natural language inference.

BIG-bench Machine Learning General Classification +5

Palm-GAN: Generating Realistic Palmprint Images Using Total-Variation Regularized GAN

no code implementations21 Mar 2020 Shervin Minaee, Mehdi Minaei, Amirali Abdolrashidi

We apply this framework to a popular palmprint databases, and generate images which look very realistic, and similar to the samples in this database.

Regularized Submodular Maximization at Scale

no code implementations10 Feb 2020 Ehsan Kazemi, Shervin Minaee, Moran Feldman, Amin Karbasi

In this paper, we propose scalable methods for maximizing a regularized submodular function $f = g - \ell$ expressed as the difference between a monotone submodular function $g$ and a modular function $\ell$.

Data Summarization Point Processes +1

Image Segmentation Using Deep Learning: A Survey

2 code implementations15 Jan 2020 Shervin Minaee, Yuri Boykov, Fatih Porikli, Antonio Plaza, Nasser Kehtarnavaz, Demetri Terzopoulos

Image segmentation is a key topic in image processing and computer vision with applications such as scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, and image compression, among many others.

Image Compression Image Segmentation +3

Biometrics Recognition Using Deep Learning: A Survey

1 code implementation30 Nov 2019 Shervin Minaee, Amirali Abdolrashidi, Hang Su, Mohammed Bennamoun, David Zhang

Deep learning-based models have been very successful in achieving state-of-the-art results in many of the computer vision, speech recognition, and natural language processing tasks in the last few years.

Gait Recognition speech-recognition +1

Masked-RPCA: Sparse and Low-rank Decomposition Under Overlaying Model and Application to Moving Object Detection

no code implementations17 Sep 2019 Amirhossein Khalilian-Gourtani, Shervin Minaee, Yao Wang

Robust Principal Component Analysis (RPCA) performs low-rank and sparse decomposition and accomplishes such a task when the background is stationary and the foreground is dynamic and relatively small.

Moving Object Detection object-detection

FingerNet: Pushing The Limits of Fingerprint Recognition Using Convolutional Neural Network

no code implementations28 Jul 2019 Shervin Minaee, Elham Azimi, Amirali Abdolrashidi

Fingerprint recognition has been utilized for cellphone authentication, airport security and beyond.

DeepIris: Iris Recognition Using A Deep Learning Approach

1 code implementation22 Jul 2019 Shervin Minaee, Amirali Abdolrashidi

Iris recognition has been an active research area during last few decades, because of its wide applications in security, from airports to homeland security border control.

Iris Recognition

Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models

no code implementations8 Apr 2019 Shervin Minaee, Elham Azimi, Amirali Abdolrashidi

With the popularity of social networks, and e-commerce websites, sentiment analysis has become a more active area of research in the past few years.

Sentiment Analysis

Finger-GAN: Generating Realistic Fingerprint Images Using Connectivity Imposed GAN

no code implementations25 Dec 2018 Shervin Minaee, Amirali Abdolrashidi

Through experimental results, we show that the generated fingerprint images have a good diversity, and are able to capture different parts of the prior distribution.

Efficient Super Resolution For Large-Scale Images Using Attentional GAN

no code implementations12 Dec 2018 Harsh Nilesh Pathak, Xinxin Li, Shervin Minaee, Brooke Cowan

At Expedia Group, we were tasked with generating images of at least 2000px for display on the website, four times greater than the sizes typically reported in the literature.

Generative Adversarial Network Image Super-Resolution +1

MTBI Identification From Diffusion MR Images Using Bag of Adversarial Visual Features

no code implementations27 Jun 2018 Shervin Minaee, Yao Wang, Alp Aygar, Sohae Chung, Xiuyuan Wang, Yvonne W. Lui, Els Fieremans, Steven Flanagan, Joseph Rath

Unlike most of previous works, which use hand-crafted features extracted from different parts of brain for MTBI classification, we employ feature learning algorithms to learn more discriminative representation for this task.

Ad-Net: Audio-Visual Convolutional Neural Network for Advertisement Detection In Videos

no code implementations22 Jun 2018 Shervin Minaee, Imed Bouazizi, Prakash Kolan, Hossein Najafzadeh

Personalized advertisement is a crucial task for many of the online businesses and video broadcasters.

Image Segmentation Using Subspace Representation and Sparse Decomposition

no code implementations6 Apr 2018 Shervin Minaee

In this dissertation, we focus on the extraction of text and graphics in mixed-content images, and design novel approaches for various aspects of this problem.

Image Segmentation Motion Segmentation +2

A Deep Unsupervised Learning Approach Toward MTBI Identification Using Diffusion MRI

no code implementations8 Feb 2018 Shervin Minaee, Yao Wang, Anna Choromanska, Sohae Chung, Xiuyuan Wang, Els Fieremans, Steven Flanagan, Joseph Rath, Yvonne W. Lui

Mild traumatic brain injury is a growing public health problem with an estimated incidence of over 1. 7 million people annually in US.

Identifying Mild Traumatic Brain Injury Patients From MR Images Using Bag of Visual Words

no code implementations18 Oct 2017 Shervin Minaee, Siyun Wang, Yao Wang, Sohae Chung, Xiuyuan Wang, Els Fieremans, Steven Flanagan, Joseph Rath, Yvonne W. Lui

Mild traumatic brain injury (mTBI) is a growing public health problem with an estimated incidence of one million people annually in US.

feature selection

Automatic Question-Answering Using A Deep Similarity Neural Network

no code implementations5 Aug 2017 Shervin Minaee, Zhu Liu

We first train this model on a large-scale public question-answering database, and then fine-tune it to transfer to the customer-care chat data.

Question Answering

Text Extraction From Texture Images Using Masked Signal Decomposition

no code implementations11 Jun 2017 Shervin Minaee, Yao Wang

Text extraction is an important problem in image processing with applications from optical character recognition to autonomous driving.

Autonomous Driving Optical Character Recognition +2

An ADMM Approach to Masked Signal Decomposition Using Subspace Representation

no code implementations25 Apr 2017 Shervin Minaee, Yao Wang

Signal decomposition is a classical problem in signal processing, which aims to separate an observed signal into two or more components each with its own property.

Subspace Learning in The Presence of Sparse Structured Outliers and Noise

no code implementations14 Mar 2017 Shervin Minaee, Yao Wang

Subspace learning is an important problem, which has many applications in image and video processing.

Clustering Image Segmentation +2

Image Segmentation Using Overlapping Group Sparsity

no code implementations23 Nov 2016 Shervin Minaee, Yao Wang

Sparse decomposition has been widely used for different applications, such as source separation, image classification and image denoising.

Clustering Image Classification +4

Image Decomposition Using a Robust Regression Approach

no code implementations13 Sep 2016 Shervin Minaee, Yao Wang

This paper considers how to separate text and/or graphics from smooth background in screen content and mixed content images and proposes an algorithm to perform this segmentation task.

Clustering Foreground Segmentation +2

Screen Content Image Segmentation Using Robust Regression and Sparse Decomposition

no code implementations8 Jul 2016 Shervin Minaee, Yao Wang

This paper considers how to separate text and/or graphics from smooth background in screen content and mixed document images and proposes two approaches to perform this segmentation task.

Image Segmentation Medical Image Segmentation +2

Palmprint Recognition Using Deep Scattering Convolutional Network

no code implementations30 Mar 2016 Shervin Minaee, Yao Wang

Many algorithms have been proposed for palmprint recognition in the past, majority of them being based on features extracted from the transform domain.

Translation

Screen Content Image Segmentation Using Sparse Decomposition and Total Variation Minimization

no code implementations7 Feb 2016 Shervin Minaee, Yao Wang

Sparse decomposition has been widely used for different applications, such as source separation, image classification, image denoising and more.

Clustering Image Classification +3

Fingerprint Recognition Using Translation Invariant Scattering Network

no code implementations11 Sep 2015 Shervin Minaee, Yao Wang

Different features and algorithms have been used for fingerprint recognition in the past.

Template Matching Translation

Screen Content Image Segmentation Using Least Absolute Deviation Fitting

no code implementations15 Jan 2015 Shervin Minaee, Yao Wang

The proposed method is designed based on the assumption that the background part of the image is smoothly varying and can be represented by a linear combination of a few smoothly varying basis functions, while the foreground text and graphics create sharp discontinuity and cannot be modeled by this smooth representation.

Clustering Foreground Segmentation +3

Multispectral Palmprint Recognition Using Textural Features

no code implementations28 Aug 2014 Shervin Minaee, Amirali Abdolrashidi

In order to utilize identification to the best extent, we need robust and fast algorithms and systems to process the data.

Highly Accurate Multispectral Palmprint Recognition Using Statistical and Wavelet Features

no code implementations16 Aug 2014 Shervin Minaee, Amirali Abdolrashidi

Palmprint is one of the most useful physiological biometrics that can be used as a powerful means in personal recognition systems.

General Classification

Multispectral Palmprint Recognition Using a Hybrid Feature

no code implementations27 Dec 2011 Sina Akbari Mistani, Shervin Minaee, Emad Fatemizadeh

Personal identification problem has been a major field of research in recent years.

A Geometric Approach For Fully Automatic Chromosome Segmentation

2 code implementations18 Dec 2011 Shervin Minaee, Mehran Fotouhi, Babak Hossein Khalaj

The next step is detection of touching and overlapping chromosomes, and the final step is separation of such chromosomes.

Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.