Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 History  



1.1  Early development  





1.2  Launch  





1.3  Updates  







2 Technical specifications  





3 See also  





4 References  














Huawei PanGu







 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Huawei PanGu
Developer(s)Huawei
Initial release3.0, July 7, 2023; 12 months ago (2023-07-07)
Stable release

5.0 / June 21, 2024; 25 days ago (2024-06-21)

Available inChinese, English, Russian
TypeLarge language model
LicenseProprietary

Huawei PanGu, PanGu, PanGu-ΣorPanGu-π (Chinese: 盘古大模型; pinyin: pángǔ dà móxíng) is a multimodal large language model developed by Huawei. It was announced on July 7, 2023, positioned as a contender to other multimodal large language models.[1]

The name of the large learning language model, PanGu, was derived from the Chinese mythology and folklore of Pangu, a primordial character related to the creation of the world.[2]

History

[edit]

Early development

[edit]

In April 2023, Huawei released a paper detailing the development of PanGu-Σ, a colossal language model featuring 1.085 trillion parameters. Developed within Huawei's MindSpore 5 framework, PanGu-Σ underwent training for over 100 days on a cluster system equipped with 512 Ascend 910 AI accelerator chips, processing 329 billion tokens in more than 40 natural and programming languages.[3]

PanGu-Σ incorporates Random Routed Experts (RRE) and the Transformer decoder architecture, allowing easy extraction of sub-models for various applications like conversation, translation, code production, and natural language interpretation. The model achieves 6.3 times faster training throughput compared to MoE models with the same hyper-parameters. In the Chinese domain, it outperforms previous state-of-the-art models across 16 tasks in a zero-shot setting. Trained on datasets from 40 domains, including Chinese, English, Bilingual, and code, PanGu-Σ excels in few-shot natural-language understanding, open-domain discussion, question answering, machine translation, and code creation.[4][5]

Launch

[edit]

During the Huawei Developer Conference on July 7, 2023, Huawei introduced PanGu 3.0, a large language model (LLM), tailored for sectors like government, finance, manufacturing, mining, and meteorology utilizing Huawei Cloud [zh] solutions. In the subsequent month, Huawei launched the Celia Virtual Assistant with advanced AI features, capable of generating long text replies based on user voice commands and set to release with HarmonyOS 4.0 for eligible devices.[6][7]

The LLM was designed for enterprises seeking advantages in the AI industry, focusing on task execution over creative work, unlike traditional models used for general purposes like chatbots, poetry, and visual content creation.[8]

Using the same technology as ChatGPT, Huawei's LLM features a hierarchical architecture, allowing customers to adapt the model to various tasks and train it on their own datasets, making it versatile across various industries.[9]

Updates

[edit]

On August 5, 2023, Huawei partnered with European Centre for Medium-Range Weather Forecasts (ECMWF) to launch a global weather forecasting AI model. This model used Huawei Cloud solutions and the PanGu-Weather Model with MindSpore. It is accessible on the ECMWF website and aims to provide accurate weather data.[10][11]

On December 19, 2023, Huawei announced its financial services on the PanGu-powered AI Finance platform for the global market. The tech giant introduced this product at the 2023 Huawei Cloud Fintech Summit, aiming to reshape the digital finance industry with efficient features to boost Fintech firms worldwide. The platform incorporated a variety of advanced technologies, including AI, big data analytics, and blockchain.[12]

On June 21, 2024 at HDC 2024, Huawei announced upgraded PanGu 5.0 alongside HarmonyOS NEXT. This version integrated with Harmony Intelligence, which features a smarter Celia (Xiaoyi) and focuses on generative AI updates to its LLM platform for creating new content, such as text, code, or images. Aiming to make PanGu accessible to a wide range of developers and businesses, it offered scalable options: smaller models requiring less computational power for those with limited resources, and larger models with increased capacities for complex tasks requiring more processing power.[13]

Technical specifications

[edit]

PanGu Large Model 3.0, designed for industry use, was structured with a 5+N+X three-tier architecture.[14]

The updated Huawei PanGu Model 5.0 by Huawei Cloud business division offered three key features: adaptability for different business scenarios, multi-style modeling, and advanced intelligence. Huawei divided the AI model platform into four series, each with different parameter scales:[15]

See also

[edit]

References

[edit]
  1. ^ "Reshaping Industries with AI: Huawei Cloud launches Pangu Models 3.0 and Ascend AI Cloud services". CITI Newsroom. CITI Newsroom. Retrieved February 13, 2024.
  • ^ Nair, Arya M. (July 8, 2023). "Huawei rolls out latest version of its deep learning AI model, Pangu - GCC Business News". Retrieved May 29, 2024.
  • ^ Upadhyay, Shyam Nandan. "Huawei Researchers Develop LLM With 1.085 Trillion Parameters". AnalyticsIndiaMag. AnalyticsIndiaMag. Retrieved February 13, 2024.
  • ^ "Huawei Researchers Unveil Pangu-Σ: Trillion-Parameter Language Model with Sparse Architecture". Multiplatform.ai. Multiplatform.ai. Retrieved February 13, 2024.
  • ^ Tickoo, Aneesh. "Huawei Researchers Develop Pangu-Σ: A Large Language Model With Sparse Architecture And 1.085 Trillion Parameters". marktechpost.com. marktechpost.com. Retrieved February 13, 2024.
  • ^ "Huawei Pangu AI models for Government, finance, manufacturing, mining, meteorology". HC Newsroom. HC Newsroom. Retrieved February 13, 2024.
  • ^ Sarkar, Amy. "Huawei launches Voice Assistant with large Pangu AI model". HC Newsroom. HC Newsroom. Retrieved February 13, 2024.
  • ^ "Revolutionizing Global AI Landscape: Huawei's PanGu Megamodel Set to Transform Industries Worldwide". LinkedIn. Grosso Link Sàrl. Retrieved February 13, 2024.
  • ^ Jarrett, Miranda. "Huawei to revolutionise applications of AI with new Pangu model". Dao Insights. Dao Insights. Retrieved February 13, 2024.
  • ^ Li, Deng. "Huawei Pangu-Weather Model debuts European ECMWF website". HC Newsroom. HC Newsroom. Retrieved February 13, 2024.
  • ^ Mishra, Yash. "Huawei Cloud will build large-scale high-precision regional weather forecast Pangu model". HC Newsroom. HC Newsroom. Retrieved February 13, 2024.
  • ^ Birch, Scott. "Huawei Cloud and Pangu AI model reshaping finance industry". FinTech Magazine. FinTech Magazine. Retrieved February 13, 2024.
  • ^ Writer, Staff (June 22, 2024). "Huawei Unveils New Harmony OS And AI Model In Continued Drive For Tech Self-reliance". Elnion. Retrieved July 7, 2024.
  • ^ "Huawei launches latest AI model, Pangu 3.0". Business Today (Malaysia). Business Today (Malaysia). Retrieved February 13, 2024.
  • ^ Matsui, Emiko (June 21, 2024). "Huawei Cloud unveils Pangu Large Model 5.0". Huawei Central. Retrieved July 7, 2024.


  • Retrieved from "https://en.wikipedia.org/w/index.php?title=Huawei_PanGu&oldid=1233864026"

    Categories: 
    2023 software
    Huawei products
    Large language models
    Multimodal interaction
    Hidden categories: 
    Articles with short description
    Short description matches Wikidata
    Use American English from July 2023
    All Wikipedia articles written in American English
    Use list-defined references from July 2023
    Use mdy dates from August 2023
    Articles containing simplified Chinese-language text
     



    This page was last edited on 11 July 2024, at 08:58 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki