no code implementations • 11 Feb 2022 • Junkyeong Choi, Hyucksung Kwon, Woongkyu Lee, Jungwook Choi, Jieun Lim
In this method, we devise a search space that explores the thread tile and warp sizes to increase the data reuse despite a large matrix operand of reduced-precision MMA.
1 code implementation • 2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC) 2022 • Woongkyu Lee, Hyucksung Kwon, Jungwook Choi
However, the computation-demanding nature of DNNs, along with the time-consuming fusion of video and thermal camera frames, raises hurdles for the cost-effective deployment of such AI thermometer systems.