Search Results for author: Phitchaya Mangpo Phothilimthana

Found 9 papers, 4 papers with code

Accelerating Retrieval-Augmented Language Model Serving with Speculation

no code implementations • 25 Jan 2024 • Zhihao Zhang, Alan Zhu, Lijie Yang, Yihua Xu, LanTing LI, Phitchaya Mangpo Phothilimthana, Zhihao Jia

Retrieval-augmented language models (RaLM) have demonstrated the potential to solve knowledge-intensive natural language processing (NLP) tasks by combining a non-parametric knowledge base with a parametric language model.

Language Modelling Retrieval

Paper
Add Code

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

1 code implementation • NeurIPS 2023 • Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Kaidi Cao, Bahare Fatemi, Mike Burrows, Charith Mendis, Bryan Perozzi

TpuGraphs provides 25x more graphs than the largest graph property prediction dataset (with comparable graph sizes), and 770x larger graphs on average compared to existing performance prediction datasets on machine learning programs.

Graph Property Prediction Property Prediction

120

Paper
Code

Learning Large Graph Property Prediction via Graph Segment Training

1 code implementation • NeurIPS 2023 • Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi

Here we propose Graph Segment Training (GST), a general framework that utilizes a divide-and-conquer approach to allow learning large graph property prediction with a constant memory footprint.

Graph Property Prediction Property Prediction

Paper
Code

Optimizing Memory Mapping Using Deep Reinforcement Learning

no code implementations • 11 May 2023 • Pengming Wang, Mikita Sazanovich, Berkin Ilbeyi, Phitchaya Mangpo Phothilimthana, Manish Purohit, Han Yang Tay, Ngân Vũ, Miaosen Wang, Cosmin Paduraru, Edouard Leurent, Anton Zhernov, Po-Sen Huang, Julian Schrittwieser, Thomas Hubert, Robert Tung, Paula Kurylowicz, Kieran Milan, Oriol Vinyals, Daniel J. Mankowitz

We also introduce a Reinforcement Learning agent, mallocMuZero, and show that it is capable of playing this game to discover new and improved memory mapping solutions that lead to faster execution times on real ML workloads on ML accelerators.

Cloud Computing Decision Making +3

Paper
Add Code

GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation

1 code implementation • 8 Oct 2022 • Ondrej Sykora, Phitchaya Mangpo Phothilimthana, Charith Mendis, Amir Yazdanbakhsh

In this paper, we introduce GRANITE, a new machine learning model that estimates the throughput of basic blocks across different microarchitectures.

Multi-Task Learning

Paper
Code

Neural Architecture Search using Property Guided Synthesis

1 code implementation • 8 May 2022 • Charles Jin, Phitchaya Mangpo Phothilimthana, Sudip Roy

To enable this approach, we also propose a novel efficient synthesis procedure, which accepts a set of promising program properties, and returns a satisfying neural architecture.

Neural Architecture Search

32,835

Paper
Code

A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules

no code implementations • 7 Dec 2021 • Xinfeng Xie, Prakash Prabhu, Ulysse Beaugnon, Phitchaya Mangpo Phothilimthana, Sudip Roy, Azalia Mirhoseini, Eugene Brevdo, James Laudon, Yanqi Zhou

Partitioning ML graphs for MCMs is particularly hard as the search space grows exponentially with the number of chiplets available and the number of nodes in the neural network.

BIG-bench Machine Learning Reinforcement Learning (RL) +1

Paper
Add Code

Transferable Graph Optimizers for ML Compilers

no code implementations • NeurIPS 2020 • Yanqi Zhou, Sudip Roy, Amirali Abdolrashidi, Daniel Wong, Peter Ma, Qiumin Xu, Hanxiao Liu, Phitchaya Mangpo Phothilimthana, Shen Wang, Anna Goldie, Azalia Mirhoseini, James Laudon

Most compilers for machine learning (ML) frameworks need to solve many correlated optimization problems to generate efficient machine code.

Paper
Add Code

A Learned Performance Model for Tensor Processing Units

no code implementations • 3 Aug 2020 • Samuel J. Kaufman, Phitchaya Mangpo Phothilimthana, Yanqi Zhou, Charith Mendis, Sudip Roy, Amit Sabne, Mike Burrows

Accurate hardware performance models are critical to efficient code generation.

Code Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.