Search Results for author: Tomer Lancewicki

Found 6 papers, 2 papers with code

ITEm: Unsupervised Image-Text Embedding Learning for eCommerce

no code implementations22 Oct 2023 Baohao Liao, Michael Kozielski, Sanjika Hewavitharana, Jiangbo Yuan, Shahram Khadivi, Tomer Lancewicki

How to teach a model to learn embedding from different modalities without neglecting information from the less dominant modality is challenging.

Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking

1 code implementation24 Mar 2022 Iñigo Urteaga, Moulay-Zaïdane Draïdia, Tomer Lancewicki, Shahram Khadivi

We propose a multi-armed bandit framework for the sequential selection of TLM pre-training hyperparameters, aimed at optimizing language model performance, in a resource efficient manner.

Bayesian Optimization Decision Making +3

Deploying a BERT-based Query-Title Relevance Classifier in a Production System: a View from the Trenches

no code implementations23 Aug 2021 Leonard Dahlmann, Tomer Lancewicki

We successfully optimize a Query-Title Relevance (QTR) classifier for deployment via a compact model, which we name BERT Bidirectional Long Short-Term Memory (BertBiLSTM).

Data Augmentation Knowledge Distillation +5

Automatic and Simultaneous Adjustment of Learning Rate and Momentum for Stochastic Gradient Descent

1 code implementation20 Aug 2019 Tomer Lancewicki, Selcuk Kopru

Stochastic Gradient Descent (SGD) methods are prominent for training machine learning and deep learning models.

Cannot find the paper you are looking for? You can Submit a new open access paper.