Search Results for author: Jonathan Cohen

Found 10 papers, 4 papers with code

Nemotron-4 15B Technical Report

no code implementations • 26 Feb 2024 • Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Mostofa Patwary, Sandeep Subramanian, Dan Su, Chen Zhu, Deepak Narayanan, Aastha Jhunjhunwala, Ayush Dattagupta, Vibhu Jawa, Jiwei Liu, Ameya Mahabaleshwarkar, Osvald Nitski, Annika Brundyn, James Maki, Miguel Martinez, Jiaxuan You, John Kamalu, Patrick Legresley, Denys Fridman, Jared Casper, Ashwath Aithal, Oleksii Kuchaiev, Mohammad Shoeybi, Jonathan Cohen, Bryan Catanzaro

We introduce Nemotron-4 15B, a 15-billion-parameter large multilingual language model trained on 8 trillion text tokens.

Language Modelling

Paper
Add Code

NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable Rails

1 code implementation • 16 Oct 2023 • Traian Rebedea, Razvan Dinu, Makesh Sreedhar, Christopher Parisien, Jonathan Cohen

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Dialogue Management Management

3,427

Paper
Code

Beyond Transformers for Function Learning

no code implementations • 19 Apr 2023 • Simon Segert, Jonathan Cohen

The ability to learn and predict simple functions is a key aspect of human intelligence.

Paper
Add Code

Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers

1 code implementation • 1 Apr 2023 • Awni Altabaa, Taylor Webb, Jonathan Cohen, John Lafferty

An extension of Transformers is proposed that enables explicit relational reasoning through a novel module called the Abstractor.

Inductive Bias Relational Reasoning

Paper
Code

Generalization to Out-of-Distribution transformations

no code implementations • 29 Sep 2021 • Shanka Subhra Mondal, Zack Dulberg, Jonathan Cohen

Humans understand a set of canonical geometric transformations (such as translation, rotation and scaling) that support generalization by being untethered to any specific object.

Translation

Paper
Add Code

Modelling the development of counting with memory-augmented neural networks

1 code implementation • 21 May 2021 • Zack Dulberg, Taylor Webb, Jonathan Cohen

Learning to count is an important example of the broader human capacity for systematic generalization, and the development of counting is often characterized by an inflection point when children rapidly acquire proficiency with the procedures that support this ability.

Systematic Generalization

Paper
Code

Learning Canonical Transformations

no code implementations • 17 Nov 2020 • Zachary Dulberg, Jonathan Cohen

Humans understand a set of canonical geometric transformations (such as translation and rotation) that support generalization by being untethered to any specific object.

Translation

Paper
Add Code

Novel Edge and Density Metrics for Link Cohesion

no code implementations • 6 Mar 2020 • Cetin Savkli, Catherine Schwartz, Amanda Galante, Jonathan Cohen

We present a new metric of link cohesion for measuring the strength of edges in complex, highly connected graphs.

Community Detection

Paper
Add Code

Thyroid Cancer Malignancy Prediction From Whole Slide Cytopathology Images

no code implementations • 29 Mar 2019 • David Dov, Shahar Kovalsky, Jonathan Cohen, Danielle Range, Ricardo Henao, Lawrence Carin

We consider preoperative prediction of thyroid cancer based on ultra-high-resolution whole-slide cytopathology images.

Multiple Instance Learning

Paper
Add Code

cuDNN: Efficient Primitives for Deep Learning

3 code implementations • 3 Oct 2014 • Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, Evan Shelhamer

To address this problem, we have created a library similar in intent to BLAS, with optimized routines for deep learning workloads.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.