no code implementations • 26 Feb 2024 • Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Mostofa Patwary, Sandeep Subramanian, Dan Su, Chen Zhu, Deepak Narayanan, Aastha Jhunjhunwala, Ayush Dattagupta, Vibhu Jawa, Jiwei Liu, Ameya Mahabaleshwarkar, Osvald Nitski, Annika Brundyn, James Maki, Miguel Martinez, Jiaxuan You, John Kamalu, Patrick Legresley, Denys Fridman, Jared Casper, Ashwath Aithal, Oleksii Kuchaiev, Mohammad Shoeybi, Jonathan Cohen, Bryan Catanzaro
We introduce Nemotron-4 15B, a 15-billion-parameter large multilingual language model trained on 8 trillion text tokens.
1 code implementation • 16 Oct 2023 • Traian Rebedea, Razvan Dinu, Makesh Sreedhar, Christopher Parisien, Jonathan Cohen
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
no code implementations • 19 Apr 2023 • Simon Segert, Jonathan Cohen
The ability to learn and predict simple functions is a key aspect of human intelligence.
1 code implementation • 1 Apr 2023 • Awni Altabaa, Taylor Webb, Jonathan Cohen, John Lafferty
An extension of Transformers is proposed that enables explicit relational reasoning through a novel module called the Abstractor.
no code implementations • 29 Sep 2021 • Shanka Subhra Mondal, Zack Dulberg, Jonathan Cohen
Humans understand a set of canonical geometric transformations (such as translation, rotation and scaling) that support generalization by being untethered to any specific object.
1 code implementation • 21 May 2021 • Zack Dulberg, Taylor Webb, Jonathan Cohen
Learning to count is an important example of the broader human capacity for systematic generalization, and the development of counting is often characterized by an inflection point when children rapidly acquire proficiency with the procedures that support this ability.
no code implementations • 17 Nov 2020 • Zachary Dulberg, Jonathan Cohen
Humans understand a set of canonical geometric transformations (such as translation and rotation) that support generalization by being untethered to any specific object.
no code implementations • 6 Mar 2020 • Cetin Savkli, Catherine Schwartz, Amanda Galante, Jonathan Cohen
We present a new metric of link cohesion for measuring the strength of edges in complex, highly connected graphs.
no code implementations • 29 Mar 2019 • David Dov, Shahar Kovalsky, Jonathan Cohen, Danielle Range, Ricardo Henao, Lawrence Carin
We consider preoperative prediction of thyroid cancer based on ultra-high-resolution whole-slide cytopathology images.
3 code implementations • 3 Oct 2014 • Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, Evan Shelhamer
To address this problem, we have created a library similar in intent to BLAS, with optimized routines for deep learning workloads.