China Sunway Supercomputer Team Claims 174 Trillion Parameter AI Model

A team of researchers from China used the Sunway supercomputer to train an AI model with 174 trillion parameters called ‘bagualu,’ which means “alchemist’s pot.” The AI parameters are comparable to the number of synapses (1000 trillion in the human brain). However, human synapses and AI parameters are not equivalent. In 2020, Microsoft trained a…
China Sunway Supercomputer Team Claims 174 Trillion Parameter AI Model


A team of researchers from China used the Sunway supercomputer to train an AI model with 174 trillion parameters called ‘bagualu,’ which means “alchemist’s pot.” The AI parameters are comparable to the number of synapses (1000 trillion in the human brain). However, human synapses and AI parameters are not equivalent.

In 2020, Microsoft trained a natural language model using 17 billion parameters.

In2021, Google announced an AI model trained with 1.6 trillion parameters.

The Sunway supercomputer has a speed of a billion operations per second, or 5.3 floating-point operations per second (exaflops). It is using FP32 single precision operations. According to the researchers, it has 37 million CPU cores — four times as many as Frontier — and nine petabytes of memory. They also claim the 96,000 semi-independent computer systems, called nodes, resemble the power of a human brain. Communications between these nodes take place at a speed of more than 23 petabytes per second.

Here is a research paper describing the pre-training of the BaGuaLu model.

BaGuaLu: Targeting Brain Scale Pretrained Models with over 37 Million Cores

Abstract


Large-scale pretrained AI models have shown state-of-theart accuracy in a series of important applications. As the size of pretrained AI models grows dramatically each year in an effort to achieve higher accuracy, training such models requires massive computing and memory capabilities, which accelerates the convergence of AI and HPC. However, there are still gaps in deploying AI applications on HPC systems, which need application and system co-design based on specific hardware features.

To this end, this paper proposes BaGuaLu, the first work targeting training brain scale models on an entire exascale supercomputer, the New Generation Sunway Supercomputer. By combining hardware-specific intra-node optimization and hybrid parallel strategies, BaGuaLu enables decent performance and scalability on unprecedentedly large models.

The evaluation shows that BaGuaLu can train 14.5-trillion parameter models with a performance of over 1 EFLOPS


using mixed-precision and has the capability to train 174-trillion-parameter models, which rivals the number of synapses in a human brain.

Read More

Total
0
Shares
Leave a Reply

Your email address will not be published.

Related Posts
Where Do Battery Materials Come From?
Read More

Where Do Battery Materials Come From?

There are many common materials in batteries like iron, those materials are sourced locally from where the batteries are made. Most of the world’s batteries are made in China. Therefore, most of the materials would be sourced in China. China also has quite a bit of lithium which is one of the primary more scarce…
Fact Mix 850: FUMU
Read More

Fact Mix 850: FUMU

FUMU dedicates his whiplash-inducing Fact mix to “those living with inattentiveness and hyperactivity,” tracing the global response to the east African singeli movement. Over the last four years FUMU (Fuck Up Move Up) has emerged as one of the deadliest operators within Andrew Lyster‘s YOUTH enclave, releasing a steady drip of killer productions, shooting frazzled…
Sasha Smirnova & nara is neus play with memory and digital reproduction in Don’t You Recall?
Read More

Sasha Smirnova & nara is neus play with memory and digital reproduction in Don’t You Recall?

A somnambulant trip through fictional landscapes created with 3D scans of physical locations, set against the sounds of electroacoustic musician nara is neus. Don’t You Recall? originated as a live audiovisual performance at the 2021 edition of MIRA Festival, Barcelona’s foremost digital arts festival, which saw electroacoustic musician nara is neus join forces with visual…