Search Results for author: Bodun Hu

Found 3 papers, 1 papers with code

FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping

no code implementations5 Apr 2024 Ajay Jaiswal, Bodun Hu, Lu Yin, Yeonju Ro, Shiwei Liu, Tianlong Chen, Aditya Akella

In this work, we observed the saturation of computationally expensive feed-forward blocks of LLM layers and proposed FFN-SkipLLM, which is a novel fine-grained skip strategy of autoregressive LLMs.

Attribute Hallucination +1

MOSEL: Inference Serving Using Dynamic Modality Selection

no code implementations27 Oct 2023 Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J. Yadwadkar, Aditya Akella

Rapid advancements over the years have helped machine learning models reach previously hard-to-achieve goals, sometimes even exceeding human capabilities.

ALTIS: Modernizing GPGPU Benchmarking

1 code implementation25 Jun 2019 Bodun Hu, Christopher J. Rossbach

This paper presents Altis, a benchmark suite for modern GPGPU computing.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.