Search Results for author: Jonathan Katzy

Found 4 papers, 2 papers with code

An Exploratory Investigation into Code License Infringements in Large Language Model Training Datasets

1 code implementation22 Mar 2024 Jonathan Katzy, Răzvan-Mihai Popescu, Arie van Deursen, Maliheh Izadi

Based on the findings of our study, which highlights the pervasive issue of license inconsistencies in large language models trained on code, our recommendation for both researchers and the community is to prioritize the development and adoption of best practices for dataset creation and management.

Language Modelling Large Language Model

Language Models for Code Completion: A Practical Evaluation

1 code implementation25 Feb 2024 Maliheh Izadi, Jonathan Katzy, Tim van Dam, Marc Otten, Razvan Mihai Popescu, Arie van Deursen

InCoder outperformed the other models across all programming languages, highlighting the significance of training data and objectives.

Code Completion valid

On the Impact of Language Selection for Training and Evaluating Programming Language Models

no code implementations25 Aug 2023 Jonathan Katzy, Maliheh Izadi, Arie van Deursen

The recent advancements in Transformer-based Language Models have demonstrated significant potential in enhancing the multilingual capabilities of these models.

A Survey on Distributed Machine Learning

no code implementations20 Dec 2019 Joost Verbraeken, Matthijs Wolting, Jonathan Katzy, Jeroen Kloppenburg, Tim Verbelen, Jan S. Rellermeyer

The demand for artificial intelligence has grown significantly over the last decade and this growth has been fueled by advances in machine learning techniques and the ability to leverage hardware acceleration.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.