DeepSeek releases Prover-V2 model with 671 billion parameters

2025-04-30 18:37:18

DeepSeek released a new model called DeepSeek-Prover-V2-671B on the AI open source community Hugging Face today. It is reported that DeepSeek-Prover-V2-671B uses a more efficient safetensors file format and supports multiple calculation precisions, which facilitates faster and more resource-saving model training and deployment. It has 671 billion parameters and may be an upgraded version of the Prover-V1.5 mathematical model released last year. In terms of model architecture, the model uses the DeepSeek-V3 architecture, adopts the MoE (mixed expert) mode, has 61 Transformer layers, and 7168-dimensional hidden layers. It also supports ultra-long contexts, with a maximum position embedding of 163,800, enabling it to handle complex mathematical proofs, and uses FP8 quantization, which can reduce the model size and improve reasoning efficiency through quantization technology.

Email Subscription

Newsletters and emails are now available! Delivered on time, every weekday, to keep you up to date with North American business news.

Weekly Highlights

                                    Apple (AAPL.O): Vietnam is expected to be the country of origin for most iPads, Macs, Apple Watches and Airpods sold in the United States.
                            2025-05-02

                                    US judge rules Apple violated order to reform App Store
                            2025-05-01

                                    Shanghai: Using tax support to encourage more innovation and help companies go global
                            2025-04-30

                                    Ideal Auto's Ring OS is officially open source, and the code is now available for download
                            2025-04-28

                                    Apple (AAPL.O) had a net profit of US$24.78 billion in the second quarter of fiscal year 2025, compared with market expectations of US$24.26 billion and US$23.636 billion in the same period last year.
                            2025-05-02