Search Results for author: Jeremy Kepner

Found 35 papers, 3 papers with code

Testing RadiX-Nets: Advances in Viable Sparse Topologies

no code implementations • 6 Nov 2023 • Kevin Kwak, Zack West, Hayden Jananthan, Jeremy Kepner

The exponential growth of data has sparked computational demands on ML research and industry use.

Paper
Add Code

Lincoln AI Computing Survey (LAICS) Update

1 code implementation • 13 Oct 2023 • Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner

Finally, a brief description of each of the new accelerators that have been added in the survey this year is included.

136

Paper
Code

From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference

no code implementations • 4 Oct 2023 • Siddharth Samsi, Dan Zhao, Joseph McDonald, Baolin Li, Adam Michaleas, Michael Jones, William Bergeron, Jeremy Kepner, Devesh Tiwari, Vijay Gadepally

Large language models (LLMs) have exploded in popularity due to their new generative capabilities that go far beyond prior state-of-the-art.

Benchmarking GSM8K +2

Paper
Add Code

Are ChatGPT and Other Similar Systems the Modern Lernaean Hydras of AI?

no code implementations • 15 Jun 2023 • Dimitrios Ioannidis, Jeremy Kepner, Andrew Bowne, Harriet S. Bryant

The rise of Generative Artificial Intelligence systems ("AI systems") has created unprecedented social engagement.

Code Generation

Paper
Add Code

AI Enabled Maneuver Identification via the Maneuver Identification Challenge

no code implementations • 28 Nov 2022 • Kaira Samuel, Matthew LaRosa, Kyle McAlpin, Morgan Schaefer, Brandon Swenson, Devin Wasilefsky, Yan Wu, Dan Zhao, Jeremy Kepner

Artificial intelligence (AI) has enormous potential to improve Air Force pilot training by providing actionable feedback to pilot trainees on the quality of their maneuvers and enabling instructor-less flying familiarization for early-stage trainees in low-cost simulators.

Paper
Add Code

Developing a Series of AI Challenges for the United States Department of the Air Force

1 code implementation • 14 Jul 2022 • Vijay Gadepally, Gregory Angelides, Andrei Barbu, Andrew Bowne, Laura J. Brattain, Tamara Broderick, Armando Cabrera, Glenn Carl, Ronisha Carter, Miriam Cha, Emilie Cowen, Jesse Cummings, Bill Freeman, James Glass, Sam Goldberg, Mark Hamilton, Thomas Heldt, Kuan Wei Huang, Phillip Isola, Boris Katz, Jamie Koerner, Yen-Chen Lin, David Mayo, Kyle McAlpin, Taylor Perron, Jean Piou, Hrishikesh M. Rao, Hayley Reynolds, Kaira Samuel, Siddharth Samsi, Morgan Schmidt, Leslie Shing, Olga Simek, Brandon Swenson, Vivienne Sze, Jonathan Taylor, Paul Tylkin, Mark Veillette, Matthew L Weiss, Allan Wollaber, Sophia Yuditskaya, Jeremy Kepner

Through a series of federal initiatives and orders, the U. S. Government has been making a concerted effort to ensure American leadership in AI.

Paper
Code

The MIT Supercloud Workload Classification Challenge

no code implementations • 12 Apr 2022 • Benny J. Tang, Qiqi Chen, Matthew L. Weiss, Nathan Frey, Joseph McDonald, David Bestor, Charles Yee, William Arcand, Chansup Byun, Daniel Edelman, Matthew Hubbell, Michael Jones, Jeremy Kepner, Anna Klein, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Andrew Bowne, Lindsey McEvoy, Baolin Li, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi

We introduce a labelled dataset that can be used to develop new approaches to workload classification and present initial results based on existing approaches.

Classification

Paper
Add Code

Naming Schema for a Human Brain-Scale Neural Network

no code implementations • 22 Sep 2021 • Morgan Schaefer, Lauren Michelin, Jeremy Kepner

Deep neural networks have become increasingly large and sparse, allowing for the storage of large-scale neural networks with decreased costs of storage and computation.

Paper
Add Code

AI Accelerator Survey and Trends

1 code implementation • 18 Sep 2021 • Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner

Over the past several years, new machine learning accelerators were being announced and released every month for a variety of applications from speech recognition, video object detection, assisted driving, and many data center applications.

Benchmarking Computational Efficiency +4

136

Paper
Code

Maneuver Identification Challenge

no code implementations • 25 Aug 2021 • Kaira Samuel, Vijay Gadepally, David Jacobs, Michael Jones, Kyle McAlpin, Kyle Palko, Ben Paulk, Sid Samsi, Ho Chit Siu, Charles Yee, Jeremy Kepner

The Maneuver Identification Challenge hosted at maneuver-id. mit. edu provides thousands of trajectories collected from pilots practicing in flight simulators, descriptions of maneuvers, and examples of these maneuvers performed by experienced pilots.

Paper
Add Code

The MIT Supercloud Dataset

no code implementations • 4 Aug 2021 • Siddharth Samsi, Matthew L Weiss, David Bestor, Baolin Li, Michael Jones, Albert Reuther, Daniel Edelman, William Arcand, Chansup Byun, John Holodnack, Matthew Hubbell, Jeremy Kepner, Anna Klein, Joseph McDonald, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia Mullen, Charles Yee, Benjamin Price, Andrew Prout, Antonio Rosa, Allan Vanterpool, Lindsey McEvoy, Anson Cheng, Devesh Tiwari, Vijay Gadepally

In this paper we introduce the MIT Supercloud Dataset which aims to foster innovative AI/ML approaches to the analysis of large scale HPC and datacenter/cloud operations.

Scheduling

Paper
Add Code

Mathematics of Digital Hyperspace

no code implementations • 28 Mar 2021 • Jeremy Kepner, Timothy Davis, Vijay Gadepally, Hayden Jananthan, Lauren Milechin

The GraphBLAS standard currently supports hypergraphs, hypersparse matrices, the mathematics required for semilinks, and seamlessly performs graph, network, and matrix operations.

Navigate

Paper
Add Code

Survey of Machine Learning Accelerators

no code implementations • 1 Sep 2020 • Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner

New machine learning accelerators are being announced and released each month for a variety of applications from speech recognition, video object detection, assisted driving, and many data center applications.

BIG-bench Machine Learning object-detection +3

Paper
Add Code

Accuracy and Performance Comparison of Video Action Recognition Approaches

no code implementations • 20 Aug 2020 • Matthew Hutchinson, Siddharth Samsi, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Micheal Houle, Matthew Hubbell, Micheal Jones, Jeremy Kepner, Andrew Kirby, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Albert Reuther, Charles Yee, Vijay Gadepally

Over the past few years, there has been significant interest in video action recognition systems and models.

Action Recognition Temporal Action Localization

Paper
Add Code

Benchmarking network fabrics for data distributed training of deep neural networks

no code implementations • 18 Aug 2020 • Siddharth Samsi, Andrew Prout, Michael Jones, Andrew Kirby, Bill Arcand, Bill Bergeron, David Bestor, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Charles Yee, Albert Reuther, Jeremy Kepner

The large computational requirements for training deep models have necessitated the development of new methods for faster training.

Benchmarking BIG-bench Machine Learning

Paper
Add Code

Layer-Parallel Training with GPU Concurrency of Deep Residual Neural Networks via Nonlinear Multigrid

no code implementations • 14 Jul 2020 • Andrew C. Kirby, Siddharth Samsi, Michael Jones, Albert Reuther, Jeremy Kepner, Vijay Gadepally

A Multigrid Full Approximation Storage algorithm for solving Deep Residual Networks is developed to enable neural network parallelized layer-wise training and concurrent computational kernel execution on GPUs.

Paper
Add Code

GraphChallenge.org Sparse Deep Neural Network Performance

no code implementations • 25 Mar 2020 • Jeremy Kepner, Simon Alford, Vijay Gadepally, Michael Jones, Lauren Milechin, Albert Reuther, Ryan Robinett, Sid Samsi

The Sparse Deep Neural Network (DNN) Challenge draws upon prior challenges from machine learning, high performance computing, and visual analytics to create a challenge that is reflective of emerging sparse AI systems.

Paper
Add Code

GraphChallenge.org Triangle Counting Performance

no code implementations • 18 Mar 2020 • Siddharth Samsi, Jeremy Kepner, Vijay Gadepally, Michael Hurley, Michael Jones, Edward Kao, Sanjeev Mohindra, Albert Reuther, Steven Smith, William Song, Diane Staheli, Paul Monticciolo

In 2017, 2018, and 2019 many triangle counting submissions were received from a wide range of authors and organizations.

Distributed, Parallel, and Cluster Computing Performance

Paper
Add Code

Sparse Deep Neural Network Graph Challenge

no code implementations • 2 Sep 2019 • Jeremy Kepner, Simon Alford, Vijay Gadepally, Michael Jones, Lauren Milechin, Ryan Robinett, Sid Samsi

The Sparse DNN Challenge is based on a mathematically well-defined DNN inference computation and can be implemented in any programming environment.

Paper
Add Code

Survey and Benchmarking of Machine Learning Accelerators

no code implementations • 29 Aug 2019 • Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner

Advances in multicore processors and accelerators have opened the flood gates to greater exploration and application of machine learning techniques to a variety of applications.

Performance B.8; C.4

Paper
Add Code

Securing HPC using Federated Authentication

no code implementations • 20 Aug 2019 • Andrew Prout, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther, Jeremy Kepner

Federated authentication can drastically reduce the overhead of basic account maintenance while simultaneously improving overall system security.

Distributed, Parallel, and Cluster Computing Cryptography and Security

Paper
Add Code

Streaming 1.9 Billion Hypersparse Network Updates per Second with D4M

no code implementations • 6 Jul 2019 • Jeremy Kepner, Vijay Gadepally, Lauren Milechin, Siddharth Samsi, William Arcand, David Bestor, William Bergeron, Chansup Byun, Matthew Hubbell, Michael Houle, Michael Jones, Anne Klein, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Albert Reuther

This work describes the design and performance optimization of an implementation of hierarchical associative arrays that reduces memory pressure and dramatically increases the update rate into an associative array.

Paper
Add Code

AI Enabling Technologies: A Survey

no code implementations • 8 May 2019 • Vijay Gadepally, Justin Goodwin, Jeremy Kepner, Albert Reuther, Hayley Reynolds, Siddharth Samsi, Jonathan Su, David Martinez

Artificial Intelligence (AI) has the opportunity to revolutionize the way the United States Department of Defense (DoD) and Intelligence Community (IC) address the challenges of evolving threats, data deluge, and rapid courses of action.

Paper
Add Code

RadiX-Net: Structured Sparse Matrices for Deep Neural Networks

no code implementations • 30 Apr 2019 • Ryan A. Robinett, Jeremy Kepner

We further present a functional-analytic conjecture based on the longstanding observation that sparse neural network topologies can attain the same expressive power as dense counterparts

Paper
Add Code

A Billion Updates per Second Using 30,000 Hierarchical In-Memory D4M Databases

no code implementations • 3 Feb 2019 • Jeremy Kepner, Vijay Gadepally, Lauren Milechin, Siddharth Samsi, William Arcand, David Bestor, William Bergeron, Chansup Byun, Matthew Hubbell, Micheal Houle, Micheal Jones, Anne Klein, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Albert Reuther

Streaming updates to a large associative array requires a hierarchical implementation to optimize the performance of the memory hierarchy.

Databases Distributed, Parallel, and Cluster Computing Data Structures and Algorithms Networking and Internet Architecture

Paper
Add Code

Training Behavior of Sparse Neural Network Topologies

no code implementations • 30 Sep 2018 • Simon Alford, Ryan Robinett, Lauren Milechin, Jeremy Kepner

We test pruning-based topologies, which are derived from an initially dense network whose connections are pruned, as well as RadiX-Nets, a class of network topologies with proven connectivity and sparsity properties.

Paper
Add Code

Uncertainty Propagation in Deep Neural Networks Using Extended Kalman Filtering

no code implementations • 17 Sep 2018 • Jessica S. Titensky, Hayden Jananthan, Jeremy Kepner

Extended Kalman Filtering (EKF) can be used to propagate and quantify input uncertainty through a Deep Neural Network (DNN) assuming mild hypotheses on the input distribution.

Paper
Add Code

Neural Network Topologies for Sparse Training

no code implementations • 14 Sep 2018 • Ryan A. Robinett, Jeremy Kepner

The sizes of deep neural networks (DNNs) are rapidly outgrowing the capacity of hardware to store and train them.

Paper
Add Code

TabulaROSA: Tabular Operating System Architecture for Massively Parallel Heterogeneous Compute Engines

no code implementations • 14 Jul 2018 • Jeremy Kepner, Ron Brightwell, Alan Edelman, Vijay Gadepally, Hayden Jananthan, Michael Jones, Sam Madden, Peter Michaleas, Hamed Okhravi, Kevin Pedretti, Albert Reuther, Thomas Sterling, Mike Stonebraker

In this context, an operating system can be viewed as software that brokers and tracks the resources of the compute engines and is akin to a database management system.

Distributed, Parallel, and Cluster Computing Databases Operating Systems Performance

Paper
Add Code

Sparse Deep Neural Network Exact Solutions

no code implementations • 6 Jul 2018 • Jeremy Kepner, Vijay Gadepally, Hayden Jananthan, Lauren Milechin, Sid Samsi

This work uses associative array DNNs to construct exact solutions and corresponding perturbation models to the rectified linear unit (ReLU) DNN equations that can be used to construct test vectors for sparse DNN implementations over various precisions.

Paper
Add Code

Static Graph Challenge: Subgraph Isomorphism

no code implementations • 23 Aug 2017 • Siddharth Samsi, Vijay Gadepally, Michael Hurley, Michael Jones, Edward Kao, Sanjeev Mohindra, Paul Monticciolo, Albert Reuther, Steven Smith, William Song, Diane Staheli, Jeremy Kepner

The proposed Subgraph Isomorphism Graph Challenge draws upon prior challenges from machine learning, high performance computing, and visual analytics to create a graph challenge that is reflective of many real-world graph analytics processing systems.

Distributed, Parallel, and Cluster Computing Data Structures and Algorithms

Paper
Add Code

Enabling Massive Deep Neural Networks with the GraphBLAS

no code implementations • 9 Aug 2017 • Jeremy Kepner, Manoj Kumar, José Moreira, Pratap Pattnaik, Mauricio Serrano, Henry Tufo

The performance of the GraphBLAS implementation is measured relative to a standard dense linear algebra library implementation.

Math

Paper
Add Code

Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor

no code implementations • 12 Jul 2017 • Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther

Thus, the performance of these applications on KNL systems is of high interest to LLSC users and the broader data analysis and machine learning communities.

Performance Instrumentation and Methods for Astrophysics Distributed, Parallel, and Cluster Computing Computational Physics

Paper
Add Code

Non-Negative Matrix Factorization Test Cases

no code implementations • 30 Dec 2016 • Connor Sell, Jeremy Kepner

Non-negative matrix factorization (NMF) is a prob- lem with many applications, ranging from facial recognition to document clustering.

Clustering

Paper
Add Code

Large Enforced Sparse Non-Negative Matrix Factorization

no code implementations • 18 Oct 2015 • Brendan Gavin, Vijay Gadepally, Jeremy Kepner

Non-negative matrix factorization (NMF) is a common method for generating topic models from text data.

Topic Models

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.