市值: $2.1255T 4.27%
體積(24小時): $93.4122B 20.04%
  • 市值: $2.1255T 4.27%
  • 體積(24小時): $93.4122B 20.04%
  • 恐懼與貪婪指數:
  • 市值: $2.1255T 4.27%
加密
主題
加密植物
資訊
加密術
影片
頭號新聞
加密
主題
加密植物
資訊
加密術
影片
bitcoin
bitcoin

$87959.907984 USD

1.34%

ethereum
ethereum

$2920.497338 USD

3.04%

tether
tether

$0.999775 USD

0.00%

xrp
xrp

$2.237324 USD

8.12%

bnb
bnb

$860.243768 USD

0.90%

solana
solana

$138.089498 USD

5.43%

usd-coin
usd-coin

$0.999807 USD

0.01%

tron
tron

$0.272801 USD

-1.53%

dogecoin
dogecoin

$0.150904 USD

2.96%

cardano
cardano

$0.421635 USD

1.97%

hyperliquid
hyperliquid

$32.152445 USD

2.23%

bitcoin-cash
bitcoin-cash

$533.301069 USD

-1.94%

chainlink
chainlink

$12.953417 USD

2.68%

unus-sed-leo
unus-sed-leo

$9.535951 USD

0.73%

zcash
zcash

$521.483386 USD

-2.87%

加密貨幣新聞文章

NVIDIA RUBIN CPX:用大環境革命推理性能AI

2025/09/09 23:00

探索NVIDIA Rubin CPX如何轉化大量上下文AI工作負載的推理性能,提供無與倫比的效率和ROI。

NVIDIA RUBIN CPX:用大環境革命推理性能AI

The AI landscape is rapidly evolving, with inference becoming the new frontier. NVIDIA's Rubin CPX GPU is designed to meet the demands of long-context AI workloads with greater efficiency and ROI.

AI景觀正在迅速發展,推斷成為新的邊界。 NVIDIA的Rubin CPX GPU旨在滿足效率更高和ROI的長篇小說AI工作負載的需求。

The Rise of Long-Context AI

長篇文化AI的興起

Modern AI models are now capable of multi-step reasoning and long-horizon context, enabling them to tackle complex tasks. Processing massive context has become increasingly critical, particularly in areas like software development and video generation. These applications demand sustained coherence and memory across millions of tokens, pushing the boundaries of current infrastructure.

現在,現代的AI模型能夠進行多步推理和長遠的環境,從而使它們能夠解決複雜的任務。處理大量環境已經變得越來越關鍵,尤其是在軟件開發和視頻生成等領域。這些應用要求在數百萬個令牌上持續連貫和內存,從而突破了當前基礎設施的界限。

NVIDIA's SMART Framework and Disaggregated Inference

NVIDIA的智能框架和分類推理

To address this shift, the NVIDIA SMART framework optimizes inference across scale, performance, architecture, ROI, and the broader ecosystem. Disaggregated inference enables the context and generation phases to be processed independently, optimizing compute and memory resources. This improves throughput, reduces latency, and enhances overall resource utilization.

為了解決這一轉變,NVIDIA智能框架優化了規模,性能,體系結構,ROI和更廣泛的生態系統的推理。分解推理可以獨立處理上下文和生成階段,從而優化計算和內存資源。這可以改善吞吐量,減少延遲並增強整體資源利用率。

Introducing NVIDIA Rubin CPX

介紹NVIDIA RUBIN CPX

NVIDIA is introducing the Rubin CPX GPU, a purpose-built solution designed to deliver high-throughput performance for high-value, long-context inference workloads. Built with the Rubin architecture, it features 30 petaFLOPs of NVFP4 compute, 128 GB of GDDR7 memory, and 3x attention acceleration. Optimized for processing long sequences, Rubin CPX enhances throughput and responsiveness, maximizing ROI for large-scale generative AI workloads.

NVIDIA推出了Rubin CPX GPU,這是一種專門構建的解決方案,旨在提供高價值,長篇文化推理工作負載的高通量性能。它由Rubin建築建造,具有30個PETAFLOPS NVFP4計算,128 GB的GDDR7內存和3倍的注意加速度。魯賓CPX優化用於處理長序列,可增強吞吐量和響應性,最大程度地提高大規模生成AI工作負載的ROI。

The NVIDIA Vera Rubin NVL144 CPX Rack

NVIDIA VERA RUBIN NVL144 CPX機架

Rubin CPX works in tandem with NVIDIA Vera CPUs and Rubin GPUs for generation-phase processing, forming a complete, high-performance disaggregated serving solution. The NVIDIA Vera Rubin NVL144 CPX rack integrates 144 Rubin CPX GPUs, 144 Rubin GPUs, and 36 Vera CPUs to deliver 8 exaFLOPs of NVFP4 compute and 100 TB of high-speed memory.

Rubin CPX與NVIDIA VERA CPU和Rubin GPU一起工作,用於生成期處理,形成了完整的高性能分解分類解決方案。 NVIDIA VERA RUBIN NVL144 CPX機架集成了144個Rubin CPX GPU,144 Rubin GPU和36個Vera CPU,以提供8個Exaflops NVFP4計算和100 TB的高速記憶。

Real-World Impact and ROI

現實世界的影響和投資回報率

At scale, the platform can deliver a 30x to 50x return on investment, translating to as much as $5B in revenue from a $100M CAPEX investment. By combining disaggregated infrastructure, acceleration, and full-stack orchestration, Vera Rubin NVL144 CPX redefines what’s possible for enterprises building the next generation of generative AI applications.

在大規模上,該平台可以提供30倍至50倍的投資回報率,從1億美元的資本支出投資中的收入高達5B美元。通過結合分解的基礎架構,加速和全棧編排,Vera Rubin NVL144 CPX重新定義了企業構建下一代生成AI應用程序的可能性。

Conclusion

結論

The NVIDIA Rubin CPX GPU and the NVIDIA Vera Rubin NVL144 CPX rack represent a new standard for full-stack AI infrastructure, creating new possibilities for workloads like advanced software coding and generative video. It's an exciting time to be in AI, and NVIDIA is leading the charge!

NVIDIA RUBIN CPX GPU和NVIDIA VERA RUBIN NVL144 CPX機架代表了全堆AI基礎架構的新標準,為高級軟件編碼和生成視頻等工作負載創造了新的可能性。這是進入AI的激動人心的時刻,Nvidia正在領導這一指控!

原始來源:nvidia

免責聲明:info@kdj.com

所提供的資訊並非交易建議。 kDJ.com對任何基於本文提供的資訊進行的投資不承擔任何責任。加密貨幣波動性較大,建議您充分研究後謹慎投資!

如果您認為本網站使用的內容侵犯了您的版權,請立即聯絡我們(info@kdj.com),我們將及時刪除。

2026年07月03日 其他文章發表於