Search Results for author: Maximilian Golub

Found 5 papers, 2 papers with code

Shaping Human-AI Collaboration: Varied Scaffolding Levels in Co-writing with Language Models

no code implementations • 18 Feb 2024 • Paramveer S. Dhillon, Somayeh Molaei, Jiaqi Li, Maximilian Golub, Shaochun Zheng, Lionel P. Robert

Advances in language modeling have paved the way for novel human-AI co-writing experiences.

Language Modelling Sentence

Paper
Add Code

Microscaling Data Formats for Deep Learning

1 code implementation • 16 Oct 2023 • Bita Darvish Rouhani, Ritchie Zhao, Ankit More, Mathew Hall, Alireza Khodamoradi, Summer Deng, Dhruv Choudhary, Marius Cornea, Eric Dellinger, Kristof Denolf, Stosic Dusan, Venmugil Elango, Maximilian Golub, Alexander Heinecke, Phil James-Roxby, Dharmesh Jani, Gaurav Kolhe, Martin Langhammer, Ada Li, Levi Melnick, Maral Mesmakhosroshahi, Andres Rodriguez, Michael Schulte, Rasoul Shafipour, Lei Shao, Michael Siu, Pradeep Dubey, Paulius Micikevicius, Maxim Naumov, Colin Verrilli, Ralph Wittig, Doug Burger, Eric Chung

Narrow bit-width data formats are key to reducing the computational and storage costs of modern deep learning applications.

Friction

103

Paper
Code

With Shared Microexponents, A Little Shifting Goes a Long Way

no code implementations • 16 Feb 2023 • Bita Rouhani, Ritchie Zhao, Venmugil Elango, Rasoul Shafipour, Mathew Hall, Maral Mesmakhosroshahi, Ankit More, Levi Melnick, Maximilian Golub, Girish Varatkar, Lei Shao, Gaurav Kolhe, Dimitry Melts, Jasmine Klar, Renee L'Heureux, Matt Perry, Doug Burger, Eric Chung, Zhaoxia Deng, Sam Naghshineh, Jongsoo Park, Maxim Naumov

This paper introduces Block Data Representations (BDR), a framework for exploring and evaluating a wide spectrum of narrow-precision formats for deep learning.

Quantization Recommendation Systems

Paper
Add Code

Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training

no code implementations • 23 Sep 2020 • Dingqing Yang, Amin Ghasemazar, Xiaowei Ren, Maximilian Golub, Guy Lemieux, Mieszko Lis

The success of DNN pruning has led to the development of energy-efficient inference accelerators that support pruned models with sparse weight and activation tensors.

Paper
Add Code

Full deep neural network training on a pruned weight budget

1 code implementation • 11 Jun 2018 • Maximilian Golub, Guy Lemieux, Mieszko Lis

We introduce a DNN training technique that learns only a fraction of the full parameter set without incurring an accuracy penalty.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.