Leveraging NVLINK and asynchronous data transfer to scale beyond the memory capacity of GPUsDavid AppelhansBob Walkup2017ScalA/SC 2017