Synchronization-reducing and Communication-reducing Algorithms and Programming Models for Large-scale Simulations (January 9-13, 2012)


Organizing Committee
Images

 

[Image courtesy of Hatem Ltaief]
Directed Acyclic Graph for the execution of a symmetric generalized eigensolver for a dense 4x4 matrix

 


Description

As concurrency in scientific computing pushes beyond a million threads and performance of individual threads becomes less reliable for hardware-related reasons, attention of mathematicians, computer scientists, and supercomputer users and suppliers inevitably focuses on reducing communication and synchronization bottlenecks. Though convenient for succinctness, reproducibility, and stability, instruction ordering in contemporary codes is commonly overspecified. This workshop attempts to outline evolution of simulation codes from today's infra-petascale to the ultra-exascale and to encourage importation of ideas from other areas of computer science into numerical algorithms, new invention, and programming model generalization.