Search Results for author: Weiyang Wang

Found 7 papers, 1 papers with code

How to Build Low-cost Networks for Large Language Models (without Sacrificing Performance)?

no code implementations22 Jul 2023 Weiyang Wang, Manya Ghobadi, Kayvon Shakeri, Ying Zhang, Naader Hasani

We show that LLMs exhibit a unique communication pattern where only small groups of GPUs require high-bandwidth communication to achieve near-optimal training performance.

Blocking Language Modelling +1

Efficient Direct-Connect Topologies for Collective Communications

no code implementations7 Feb 2022 Liangyu Zhao, Siddharth Pal, Tapan Chugh, Weiyang Wang, Jason Fantl, Prithwish Basu, Joud Khoury, Arvind Krishnamurthy

Our algorithms start from small, optimal base topologies and associated communication schedules and use a set of techniques that can be iteratively applied to derive much larger topologies and schedules.

On the FRB luminosity function -- II. Event rate density

1 code implementation10 Mar 2020 Rui Luo, Yunpeng Men, Kejia Lee, Weiyang Wang, D. R. Lorimer, Bing Zhang

Assuming a Schechter luminosity function form, we infer (at the 95% confidence level) that the characteristic FRB event rate density at the upper cut-off luminosity $L^*=2. 9_{-1. 7}^{+11. 9}\times10^{44}\,\rm erg\, s^{-1}$ is $\phi^*=339_{-313}^{+1074}\,\rm Gpc^{-3}\, yr^{-1}$, the power-law index is $\alpha=-1. 79_{-0. 35}^{+0. 31}$, and the lower cut-off luminosity is $L_0\le9. 1\times10^{41}\,\rm erg\, s^{-1}$.

High Energy Astrophysical Phenomena Cosmology and Nongalactic Astrophysics

Distributed Low Precision Training Without Mixed Precision

no code implementations18 Nov 2019 Zehua Cheng, Weiyang Wang, Yan Pan, Thomas Lukasiewicz

However, most low precision training solution is based on a mixed precision strategy.

Model Compression

Segmentation is All You Need

no code implementations30 Apr 2019 Zehua Cheng, Yuxiang Wu, Zhenghua Xu, Thomas Lukasiewicz, Weiyang Wang

Region proposal mechanisms are essential for existing deep learning approaches to object detection in images.

Face Detection Head Detection +5

FoxNet: A Multi-face Alignment Method

no code implementations22 Apr 2019 Yuxiang Wu, Zehua Cheng, Bin Huang, Yiming Chen, Xinghui Zhu, Weiyang Wang

Multi-face alignment aims to identify geometry structures of multiple faces in an image, and its performance is essential for the many practical tasks, such as face recognition, face tracking, and face animation.

Clustering Face Alignment +1

Quantum anomalous Hall effect in stable 1T-YN$_2$ monolayer with a large nontrivial band gap and high Chern number

no code implementations6 Jul 2017 Xiangru Kong, Linyang Li, Ortwin Leenaerts, Weiyang Wang, Xiong-Jun Liu, François M. Peeters

The quantum anomalous Hall (QAH) effect is a topologically nontrivial phase, characterized by a non-zero Chern number defined in the bulk and chiral edge states in the boundary.

Materials Science

Cannot find the paper you are looking for? You can Submit a new open access paper.