Arm-based supercomputer prototype to be deployed at Sandia National Laboratories by DOE
ALBUQUERQUE, N.M. — Microprocessors designed by Arm are ubiquitous in automobile electronics, cellphones and other embedded applications, but until recently they have not provided the performance necessary to make them practical for high-performance computing.
A computer-automated design conception of Sandia National Laboratories’ Astra supercomputer, used to work out the floor layout of the supercomputer’s compute, cooling, network and data storage cabinets. (Illustration courtesy of Hewlett Packard Enterprise.)
Astra, one of the first supercomputers to use processors based on the Arm architecture in a large-scale high-performance computing platform, is expected to be deployed in late summer at Sandia National Laboratories.
The U.S. Department of Energy’s National Nuclear Security Administration today announced that Astra, the first of a potential series of advanced architecture prototype platforms, will be deployed as part of the Vanguard program. This program will evaluate the feasibility of emerging high-performance computing architectures as production platforms to support NNSA’s mission to maintain and enhance the safety, security and effectiveness of the U.S. nuclear stockpile.
Astra will be based on the recently announced Cavium Inc. ThunderX2 64-bit Armv8 microprocessor. The platform will consist of 2,592 compute nodes, of which each is 28-core, dual-socket capable, and will be at a theoretical peak of more than 2.3 petaflops, equivalent to 2.3 quadrillion floating-point operations (FLOPS), or calculations, per second. While being the fastest is not one of the goals of Astra or the Vanguard program in general, a single Astra node is roughly one hundred times faster than a modern Arm-based cellphone.
“One of the important questions Astra will help us answer is how well does the peak performance of this architecture translate into real performance for our mission applications,” says Mark Anderson, program director for NNSA’s Advanced Simulation and Computing program, which funds Astra.
A first step for Vanguard
Scott Collis, director of Sandia’s Center for Computing Research, says: “Emerging architectures come with many challenges. Since the NNSA has not previously deployed high-performance computing platforms based on Arm processors, there are gaps in the software that must be addressed before considering this technology for future platforms much larger in scale than Astra.”
As part of a multiple lab partnership, researchers anticipate continually improving Astra and future platforms.
“Sandia researchers partnering with counterparts at Los Alamos and Lawrence Livermore national laboratories expect to develop an improved software-and-tools environment that will enable mission codes to make increasingly effective use of Astra as well as future leadership-class platforms,” says Ken Alvin, senior manager of Sandia’s extreme scale computing group. “The Vanguard program is designed to allow the NNSA to take prudent risks in exploring emerging technologies and broadening our future computing options.”
Astra will be installed at Sandia in an expanded part of the building that originally housed the innovative Red Storm supercomputer.
The Astra platform will be deployed in partnership with Westwind Computer Products Inc. and Hewlett Packard Enterprise.
“Astra, like Red Storm, will require a very intimate collaboration between Sandia and commercial partners,” says James Laros, Vanguard project lead. “In this case, all three NNSA defense labs will work closely with Westwind, HP Enterprise, Arm, Cavium, and the wider high-performance computing community to achieve a successful outcome of this project.”
Astra takes its name from the Latin phrase “per aspera ad astra,” which translates as “through difficulties to the stars.”
“The development of a scalable Arm platform based on HP Enterprise’s Apollo 70 systems will become a key resource to expand the Arm high-performance computing ecosystem. Westwind is honored to be entrusted by Sandia, in its continued commitment to developing small businesses here in New Mexico, to implement such an important project.”
— Steve Hull, president of Westwind Computer Product
“By delivering the world’s largest Arm-based supercomputer featuring the HPE Apollo 70 platform, a purpose-built architecture that includes advanced performance and storage capabilities, we are enabling the U.S. Department of Energy and National Nuclear Security Administration to power innovative solutions for energy and national security uses.”
— Mike Vildibill, vice president of the Advanced Technology Group at HPE
“Arm has been deeply engaged with Sandia National Laboratories working to comprehend and deliver on the needs of the high-performance computing community. We are eager to support the Vanguard program as a key milestone deployment for Arm and our partners, delivering on a shared vision to spur innovation in this critical domain.”
— Drew Henry, senior vice president and general manager of Arm’s Infrastructure Business Line
“Cavium is pleased to partner with Sandia National Laboratories to enable the Armv8-based high-performance computing cluster as part of the Vanguard program. Vanguard is an additional proof point regarding readiness and maturity of ThunderX2 processors for large-scale deployments and will further accelerate the entire computing ecosystem on the Arm server architecture.”
— Gopal Hegde, vice president and general manager of the Data Center Processor Group at Cavium
Sandia National Laboratories is a multimission laboratory operated by National Technology and Engineering Solutions of Sandia LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy’s National Nuclear Security Administration. Sandia Labs has major research and development responsibilities in nuclear deterrence, global security, defense, energy technologies and economic competitiveness, with main facilities in Albuquerque, New Mexico, and Livermore, California.
Sandia news media contact: