no code implementations • LREC 2022 • Nancy Ide, Keith Suderman, Jingxuan Tu, Marc Verhagen, Shanan Peters, Ian Ross, John Lawson, Andrew Borg, James Pustejovsky
This paper provides an overview of the xDD/LAPPS Grid framework and provides results of evaluating the AskMe retrievalengine using the BEIR benchmark datasets.
no code implementations • 30 Aug 2020 • John Lawson
Automatically tuning parallel compute kernels allows libraries and frameworks to achieve performance on a wide range of hardware, however these techniques are typically focused on finding optimal kernel parameters for particular input sizes and parameters.
no code implementations • 15 Mar 2020 • John Lawson
Automated tuning of compute kernels is a popular area of research, mainly focused on finding optimal kernel parameters for a problem with fixed input sizes.
no code implementations • 10 Apr 2019 • John Lawson, Mehdi Goli, Duncan McBain, Daniel Soutar, Louis Sugy
Over recent years heterogeneous systems have become more prevalent across HPC systems, with over 100 supercomputers in the TOP500 incorporating GPUs or other accelerators.
no code implementations • 8 Apr 2019 • Rod Burns, John Lawson, Duncan McBain, Daniel Soutar
There are a number of approaches available to developers for utilizing GPGPU technologies such as SYCL, OpenCL and CUDA, however many applications require the same low level mathematical routines.