It is the first time in the industry to realize the end-to-end AI big model training of domestic GPU from 0 to 1

Source: contribution

2024-05-27 13:52:16

[Live Broadcast Preview] Can SQL audit all database problems?

Moore Thread and Wuwen Core Dome jointly announced today that the two sides have officially completed the training of "MT-infini-3B", a large-scale model based on the domestic full function GPU kilocalorie cluster, The model is based on the kilocalorie cluster composed of domestic full function GPU MTT S4000 of Moore Thread , as well as the AIStudio PaaS platform of the Core Free Dome.

It is reported that this MT-infini-3B model training took 13.2 days in total, and the whole process was stable and uninterrupted. The stability of cluster training reached 100%. Compared with single machine, the expansion efficiency of kilocalorie training exceeded 90%, It is said to have "fully verified the reliability of Kua'e thousand calorie intelligent computing cluster in the large model training scene, and also pioneered the new paradigm of in-depth cooperation between domestic big language models and domestic GPU thousand calorie intelligent computing cluster" in the industry.

This training fully verified the reliability of Kua'e thousand calorie intelligent computing cluster in the large model training scene, and also took the lead in opening a new paradigm of in-depth cooperation between domestic big language models and domestic GPU thousand calorie intelligent computing cluster in the industry.

At present, the MT-infini-3B performance that has been trained ranks first among models of the same scale. Compared with other models that have been trained on international mainstream hardware, the MT-infini-3B performance leads in C-Eval, MMLU, CMMLU and other three test sets.

MT-infini-3B performance

Xia Lixue, co-founder and CEO of Wuwen Core Dome, said: "The ultimate goal of the coordinated development of domestic big models and domestic chip software and hardware is to build a mature ecosystem. Wuwen Core Dome is building an 'M × N' intermediate layer product between 'M models' and' N chips', to achieve efficient and unified deployment of multiple big model algorithms on multiple chips. Moore Thread is the first domestic GPU company to access the core free dome and conduct kilocalorie level large model training, and the 'MT-infini-3B' training is the first end-to-end large model training case based on domestic GPU chips from 0 to 1 in the industry. "

Zhang Jianzhong, the founder and CEO of Moore Threads, said: "The zero start large model training of Wuwen Core Dome on the Kua'e Qianka intelligent computing cluster is not only a powerful certification of Moore Threads' technical strength, but also a domestic closed loop of large model training. Based on a full function GPU, the Moore Thread Kuaeqianka intelligent computing cluster provides a full stack solution that integrates software and hardware. It has comprehensive advantages such as high compatibility, high stability, and high scalability. We are committed to becoming a solid and reliable advanced infrastructure for large model training in the AGI era. "

Previously, Moore Thread and Wuwen Core Dome had reached a deep strategic cooperation. The development and service platform "Infini AI without dome" and the Moore thread large model intelligent calculation kilocalorie cluster Kwa E have completed the system level integration adaptation. The platform can flexibly call Kwa E's cluster ability to complete the training, fine-tuning and reasoning tasks of the large model. In the future, the two sides will also carry out more adaptation and testing, promote the rapid development and application popularization of domestic large model technology, and contribute to the vigorous development of China's artificial intelligence industry.

▼ On the core dome without question

Infinity AI relies on the industry leading and proven AI computing optimization capability and computing power solutions to pursue the ultimate energy efficiency of large model landing. Create "M × N" intermediate layer products between "M models" and "N chips", and realize efficient and unified deployment of multiple large model algorithms on multiple chips. Link the upstream and downstream, jointly build the infrastructure of the AGI era model, and accelerate the implementation of AGI in thousands of industries.

▼ About Moore Threads

Moore Threads is an integrated circuit high-tech company focusing on the design of full function GPU chips. It can provide a strong computing acceleration capability for a wide range of scientific and technological ecological partners, and is committed to building a metacomputing platform that provides multiple computing power for the next generation of Internet.

This wonderful review

The sword god is extraordinary

Completed the first step of ecological closed-loop

2024-05-27 16:32

one fabulous

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame

xyz990 2024-05-20 16:19

At 10:36 on August 28, 2023, an information comment was published on the other side of the ocean: How do you make such remarks? My last account has been blocked. Don't say bad in China. If you don't see it, you can still suggest that we run the apology statement of Digital Guangdong about CEC-IDE 2023/02/07 09:19. On the other side of the ocean, we published an information comment: American companies are strong, but domestic companies are still scrambling to buy vegetables. 35 Retired Google launched an AI product competing with ChatGPT: Bard

Stephen 421 2024-06-12 10:37

If I write a script that allows AI to generate images according to various keywords 24 hours a day, I can sit at home and wait to collect money

zhy 2024-05-16 13:16

At the end of Shannon is Nong

uncle_haiyang 2024-06-25 17:41

Qt will get stuck

play

Monkeys playing games 2024-06-25 21:09

This is why the forest is big and has everything, no problems. His name has proved his character.

waylau 2024-06-25 18:00

I regret that there is taiwan but no china

Plum wine is delicious 2024-06-25 21:24

Civilized exchanges... Different people have different ideas, which is normal 👍

Piglet basking in the sun 2024-06-25 20:24

Stolman.

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. You should open source all 360 products first.

Reading fishing 2024-06-25 21:38

I didn't force you to use it, so the sprayer is just a mouthful. If you have the ability to show your own project, everyone will appreciate it

Plum wine is delicious 2024-06-25 20:14

It's strange... Why do you think solon is bad??? It is original (even java ee is useless), has good performance and is made in China... If you feel bad, you can give more suggestions!

xyz990 2024-05-20 16:14

When foreigners and the FBI poisoned open source projects, why didn't you come forward? With such high quality, the United States should be very popular with other countries, right?

zhuzhua 2024-05-21 10:08

I'm laughing to death. Those who have been deeply kidnapped dare not pay? Who will use the domestic open source framework of small companies in the future will be 213!!! Wait for harvesting later

Ice God 2024-06-25 17:42

How do you feel? I also want to buy

swingcoder 2024-06-25 19:19

On the one hand, openai can be used without logging in; on the other hand, it has added a lock to some countries

Plum wine is delicious 2024-06-25 20:20

If you haven't used it, you will feel that it is not good and insult him..... This doesn't make sense. Everyone has been to school, so we should be reasonable

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

crystalsis 2024-06-18 15:06

The comment area is very interesting. What's the difference between writing kernel panic in white on a black background and writing it in white on a blue background? It's also "epic retrogression"

osc_566335 2024-05-30 14:12

Sure enough, there are a lot of low-end farmers. In the era of localization, they haven't even played with the domestic ecosystem. Although things are not so good, you should spray on the idea. Don't show your ignorance and make people laugh.

orange

Brother Feng of Orange 2024-06-25 21:40

The author cheers

Plum wine is delicious 2024-06-25 20:29

You can use it first... If it's not good, come and scold again "with reasons"! It's unreasonable now

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

ACANX 2024-06-25 21:09

Ask the same question

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

Plum wine is delicious 2024-06-25 19:18

It looks like "abuse", "slander"?!

Xiao Yan 2024-06-25 21:40

What's wrong with class? You don't need to use class because it supports all paradigms. You can use it as you like.

uncle_haiyang 2024-06-25 17:37

See the sewing monster later

Xiao Yan 2024-06-25 21:57

Chinese programming is not natural language programming?

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this 😂

Chief appraiser of OSC waste project 2024-06-25 18:58

@Sweet potato seeks a "shielding" function. It really needs to shield some shi like software and authors. This software is the same as the sqltoy, which is nothing but whining. The author also went to Zhihu to create topics for himself and brush comments. If you are clever, you have become popular for so many years. Every day, some of them are not available, but nobody uses them. Don't touch porcelain. Are you in the middle of marketing?? Every message should be answered with a ":)" symbol. Is it very personalized? It is childish to see, how can this framework be better? shi、 It's really Shi. I feel sick when I see it.

Post List 2024-06-25 17:31

Finally catch up with Microsoft

Plum wine is delicious 2024-06-25 19:12

Is your words a bit excessive?

Plum wine is delicious 2024-06-25 20:21

In addition... We are indeed "domestic", the purest domestic product in China... This is true

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

kangaroo 2024-06-22 19:13

The document is an excellent case of language integration practice from the perspective of specifications, but the designer still seems to be a bit pretentious and has made some "innovations", such as func/foreign/->/prop/mut/Run/<:/, Except for the assignment operator, any compound operator is unacceptable and should be avoided as far as possible; In some aspects, the consistency is not rigorous. For example, when functions are used as parameters and return types, they are inconsistent with the standard definitions, and for example, the definitions of anonymous functions (Lambda) are inconsistent, which increases the difficulty of code reading and understanding; It is a bad idea to use the C language address symbol (&) as a spacer for interface inheritance, because it is inconvenient to enter this symbol on the keyboard and requires double keys to enter it; Nothing/Option/Any seems to be taken from TypeScript, which is a kind of null like detection mechanism after all. How about combining one? The syntax style of type followed and separated by colon (:) belongs to Pascal/Go style. This style is more convenient to implement lexical analyzer and generate syntax tree, which is conducive to subsequent processing; I hope that a new language will emerge. It should be based on the style of C language, boldly absorb the excellent practice of the new generation language (grammar sugar), keep the language specification as simple and consistent as possible, but keep the semantics and extensibility open; Cangjie language has made great exploration and practice for the development of new languages, and the expectations after implementation are also very good, making a good start; With the power of example, I believe that the Chinese community is more likely to produce such a language;

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

Plum wine is delicious 2024-06-25 20:16

I have to explain to you... The early chat software did not have icons, but all used symbols such as ":)" ": - p"... It means friendly. Don't misunderstand

Little stars in space 2024-06-25 20:10

Support

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

Chief appraiser of OSC waste project 2024-06-25 19:59

Hey? What about your ending ":)"? What about your personality? Just a few words, you think it's too much. A little unbearable? Worried? How many people are disgusted when you put these titles on the Internet every day? Have you considered others' feelings? A lot of scolding, right? We can make products just as we make them. It is not shameful to use domestic products. People can do things like Jfinal really well and reliably. The same thing as your toy is hyped every day. It's really embarrassing to use your toy. Since you want to generate traffic in this way, you can't stand criticism? Sarcasm? Delete the warehouse, this thing, no! People! Use!

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

uncle_haiyang 2024-06-25 17:31

Still like claude3, which is too slow to bear

Youyouzi with Tomatoes 2024-05-30 16:53

There are off the shelf open source products available. Do I need a garbage to keep the bottom?

Chief appraiser of OSC waste project 2024-06-25 20:16

I don't think it's hard to use. To be exact, I'm too lazy to use it. Because of your promotion method, many people (including me) feel super disgusted and disgusting. It is impossible for me to understand the products through this disgusting marketing method. Don't play any domestic brand

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

canonical-entropy 2024-06-25 20:21

The design of SpringBoot is very bloated from today's point of view. In some places, it can be said that there are no problems but also problems. To compete with solon, you can use Quarkus+the next generation low code platform, Nop, but solon+Nop is smaller and faster.

Small and beautiful software development 2024-06-25 20:11

Whoa, whoa

Francesca 2024-06-15 15:56

Isn't this still six fingers

Intermarch 2024-05-30 13:42

800 million yuan was used for the database industry base of Damon China, and 603 million yuan was used for the construction project of Damon Research Institute. It raised 2.351 billion yuan and 1.403 billion yuan for infrastructure construction. six hundred and sixty-six

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

bhzhu203 2024-06-18 15:36

This is very useful. After the graphical interface server core, Panic, the graphical interface is stuck and cannot be moved. If there is no valid information, you can output specific error information with a blue screen

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

OSC_lDzjvE 2024-06-25 20:45

Those who scold the author, please ask the advanced group to know what the author does every day. Ask yourself how many idiotic framework questions your group will answer every day when they meet. Do you do it yourself? In addition, when encountering difficulties and miscellaneous problems in the use of frameworks, the author also takes the initiative to ask you to find problems with the project code and help you solve them free of charge. In the early years of a project, there was no persistent high intensity maintenance and update of any reputation. I admire the author very much, and for the sake of ecology, the author took the initiative to contact the famous ecological components to adapt. In addition, for the frequent updating of "exaggerated words" advertising, in front of the Spring Mountain, it is not frequently publicized how there are nouns, so that developers can remember this excellent domestic framework. I believe that in the near future, Xinchuang will have a place in solon

game 2024-06-25 18:52

Brothers, it's time for welfare again. Baidu Smart Cloud Qianfan launched the big model inclusive plan, switching to the big model platform with the first domestic deployment at zero cost. From now on, we will provide new registered enterprise users with: - 0 yuan to call: Wenxin flagship model is free for the first time, and ERNIE3.5 flagship model 50 million Tokens package will be presented. The main model ERNIE Speed/ERNIE Lite and lightweight model ERNIE Tiny will continue to be free. Additional gift of ERNIE3.5 flagship model token package equivalent to the scale of OpenAI use for OpenAI migration users - 0 yuan training: free model fine tuning training service - 0 yuan migration: zero cost SDK migration tool - 0 yuan service: expert service (migration&use guidance) immediately experience Baidu Intelligent Cloud Qianfan big model platform: https://qianfan.cloud.baidu.com/

It is the first time in the industry to realize the end-to-end AI big model training of domestic GPU from 0 to 1

Hot content

Popular comments of the whole site

Hot News

Excellent column

How do programmers get started with AI application development?

Talk about Unity and Native Bridging

How to Use Excelize to Process Excel Office Documents Efficiently

Analysis of Async annotation underlying asynchronous thread pool principle in Spring

Fault diagnosis of SRE K8s: from high CPU load to root cause disclosure of mount leakage

Automated Quality Assessment of AIGC Map Generation

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number

It is the first time in the industry to realize the end-to-end AI big model training of domestic GPU from 0 to 1

Hot content

Popular comments of the whole site

Hot News

Excellent column

How do programmers get started with AI application development?

Talk about Unity and Native Bridging

How to Use Excelize to Process Excel Office Documents Efficiently

Analysis of Async annotation underlying asynchronous thread pool principle in Spring

Fault diagnosis of SRE K8s: from high CPU load to root cause disclosure of mount leakage

Automated Quality Assessment of AIGC Map Generation

Recommended attention

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number