![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
加密货币新闻
Introducing Phi-4-Reasoning-Plus: A Compact, High-Performing Open-Weight Language Model for Reasoning Across Domains
2025/05/01 23:02
Microsoft Research today announced the release of Phi-4-Reasoning-Plus, a compact yet high-performing open-weight language model designed for structured reasoning across domains like math, coding, science, and logic.
This upgraded 14-billion-parameter model builds on the architecture of the original Phi-4. It's densely packed and decoder-only, prioritizing quality over sheer size. Trained on 16 billion tokens—over half of them unique—the model blends synthetic and curated web data to attain a level of performance that rivals or even surpasses much larger models.
Despite its relatively modest size, Phi-4-Reasoning-Plus outperforms 70B+ models like DeepSeek-R1-Distill on challenging benchmarks. On the AIME 2025 math exam, it achieves a higher “pass@1” rate across all 30 problems compared to heavyweight competitors—nearly reaching the performance of DeepSeek-R1's full 671B parameter version.
The model's training pipeline combines supervised fine-tuning with reinforcement learning:
* Supervised fine-tuning utilized curated chain-of-thought datasets with special tags to segregate intermediate reasoning from final answers—enhancing transparency and coherence.
* A second RL phase, using just 6,400 math problems and Microsoft's Group Relative Policy Optimization (GRPO) algorithm, boosted the model's depth, accuracy, and formatting consistency.
Phi-4-Reasoning-Plus natively supports 32k-token context lengths (up to 64k in tests), making it ideal for heavy text tasks like legal reasoning, financial analysis, or technical Q&A—especially when memory or latency are critical.
It integrates seamlessly with popular inference frameworks such as Hugging Face Transformers, vLLM, llama.cpp, and Ollama. It's released under the permissive MIT license, allowing commercial use, fine-tuning, and distillation without any restrictions.
Designed for modular AI pipelines and interpretable outputs, Phi-4-Reasoning-Plus is a strong fit for teams managing AI deployment, orchestration, or compliance. Its structured output format aids explainability, while its performance under resource constraints enables scalable real-time reasoning.
Microsoft has conducted extensive safety testing, including red teaming and evaluations via tools like Toxigen. These measures render it more suitable for enterprise use in regulated industries.
Phi-4-Reasoning-Plus marks a growing trend: small, efficient models that overachieve. For technical leaders balancing performance, cost, and control, it provides a powerful, open, and adaptable reasoning engine—capable of enterprise integration without the hefty infrastructure footprint of mega-models.
免责声明:info@kdj.com
所提供的信息并非交易建议。根据本文提供的信息进行的任何投资,kdj.com不承担任何责任。加密货币具有高波动性,强烈建议您深入研究后,谨慎投资!
如您认为本网站上使用的内容侵犯了您的版权,请立即联系我们(info@kdj.com),我们将及时删除。
-
- 比特币,企鹅和模因硬币:加密镇的狂野骑行
- 2025-07-31 15:55:54
- 深入研究比特币企鹅(Bpengu)的热潮,由矮胖的企鹅的Pengu领导的企鹅模因硬币浪潮以及更广泛的模因硬币市场趋势。
-
- 鲸鱼运动和山寨币:购买压力加热!
- 2025-07-31 15:54:08
- Dogecoin鲸鱼正在做出大动作,一家名为Ether Machine的公司购买了很多ETH。这对市场意味着什么?让我们潜入!
-
-
-
- XRP,AI和价格预测:解码加密未来
- 2025-07-31 13:17:06
- AI模型正在介意XRP的潜力,但是监管障碍和市场波动使价格预测成为狂野的旅程。 XRP会反抗期望吗?
-
- XRP投资:专家意见和爆炸性增长的潜力
- 2025-07-31 13:06:57
- 分析有关XRP投资的专家意见,探索价格预测,并检查可能推动大幅增长的因素。
-
- XRP价格:鲸鱼购买和象征性勺子 - 接下来是什么?
- 2025-07-31 13:00:57
- XRP价格的稳定约为3.00美元。鲸鱼购买和国库计划是否足以加剧集会?获取令牌勺和专家分析。
-
- 成像网络,RLUSD付款和分散应用程序:Web3的新时代?
- 2025-07-31 13:00:20
- 探索成像网络,RLUSD付款和分散应用程序之间的协同作用,以塑造Web3的未来。
-
- 模因硬币:长期购买和持有?解码炒作
- 2025-07-31 12:45:00
- 导航2025年的模因硬币狂热:发现长期模因硬币投资的主要见解和趋势。购买和持有是明智的吗?