$87959.907984 USD

1.34%

ethereum

$2920.497338 USD

3.04%

tether

$0.999775 USD

0.00%

xrp

$2.237324 USD

8.12%

bnb

$860.243768 USD

0.90%

solana

$138.089498 USD

5.43%

usd-coin

$0.999807 USD

0.01%

tron

$0.272801 USD

-1.53%

dogecoin

$0.150904 USD

2.96%

cardano

$0.421635 USD

1.97%

hyperliquid

$32.152445 USD

2.23%

bitcoin-cash

$533.301069 USD

-1.94%

chainlink

$12.953417 USD

2.68%

unus-sed-leo

$9.535951 USD

0.73%

zcash

$521.483386 USD

-2.87%

암호화폐 뉴스 기사

NVIDIA RUBIN CPX : 큰 맥락에서의 추론 성능 혁명 AI

2025/09/09 23:00

NVIDIA RUBIN CPX가 대규모 컨텍스트 AI 워크로드에 대한 추론 성능을 변환하여 비교할 수없는 효율성과 ROI를 제공하는 방법을 살펴보십시오.

The AI landscape is rapidly evolving, with inference becoming the new frontier. NVIDIA's Rubin CPX GPU is designed to meet the demands of long-context AI workloads with greater efficiency and ROI.

AI 환경은 빠르게 진화하고 있으며 추론은 새로운 국경이되었습니다. NVIDIA의 Rubin CPX GPU는 효율성과 ROI가 더 큰 장기 텍스트 AI 워크로드의 요구를 충족하도록 설계되었습니다.

The Rise of Long-Context AI

장기 텍스트 AI의 상승

Modern AI models are now capable of multi-step reasoning and long-horizon context, enabling them to tackle complex tasks. Processing massive context has become increasingly critical, particularly in areas like software development and video generation. These applications demand sustained coherence and memory across millions of tokens, pushing the boundaries of current infrastructure.

현대 AI 모델은 이제 다단계 추론과 장기적인 컨텍스트가 가능하여 복잡한 작업을 해결할 수 있습니다. 특히 소프트웨어 개발 및 비디오 생성과 같은 분야에서 대규모 상황을 처리하는 것이 점점 중요 해지고 있습니다. 이러한 응용 프로그램은 수백만 개의 토큰에 걸쳐 지속적인 일관성과 메모리를 요구하여 현재 인프라의 경계를 넓 힙니다.

NVIDIA's SMART Framework and Disaggregated Inference

Nvidia의 스마트 프레임 워크 및 분리 된 추론

To address this shift, the NVIDIA SMART framework optimizes inference across scale, performance, architecture, ROI, and the broader ecosystem. Disaggregated inference enables the context and generation phases to be processed independently, optimizing compute and memory resources. This improves throughput, reduces latency, and enhances overall resource utilization.

이러한 변화를 해결하기 위해 NVIDIA 스마트 프레임 워크는 규모, 성능, 아키텍처, ROI 및 더 넓은 생태계 전반의 추론을 최적화합니다. 분리 된 추론을 통해 컨텍스트 및 생성 단계를 독립적으로 처리하여 계산 및 메모리 리소스를 최적화 할 수 있습니다. 이는 처리량을 향상시키고 대기 시간을 줄이며 전반적인 리소스 활용도를 향상시킵니다.

Introducing NVIDIA Rubin CPX

Nvidia Rubin CPX 소개

NVIDIA is introducing the Rubin CPX GPU, a purpose-built solution designed to deliver high-throughput performance for high-value, long-context inference workloads. Built with the Rubin architecture, it features 30 petaFLOPs of NVFP4 compute, 128 GB of GDDR7 memory, and 3x attention acceleration. Optimized for processing long sequences, Rubin CPX enhances throughput and responsiveness, maximizing ROI for large-scale generative AI workloads.

NVIDIA는 고 부가가치의 장거리 텍스트 추론 워크로드에 대한 고 처리량 성능을 제공하도록 설계된 목적으로 제작 된 솔루션 인 Rubin CPX GPU를 소개합니다. Rubin 아키텍처로 제작 된이 제품은 NVFP4 컴퓨팅의 30 개의 페타 플롭, 128GB의 GDDR7 메모리 및 3 배주의 가속도를 특징으로합니다. 긴 시퀀스를 처리하기 위해 최적화 된 Rubin CPX는 처리량 및 응답 성을 향상시켜 대규모 생성 AI 워크로드의 ROI를 최대화합니다.

The NVIDIA Vera Rubin NVL144 CPX Rack

NVIDIA VERA RUBIN NVL144 CPX RACK

Rubin CPX works in tandem with NVIDIA Vera CPUs and Rubin GPUs for generation-phase processing, forming a complete, high-performance disaggregated serving solution. The NVIDIA Vera Rubin NVL144 CPX rack integrates 144 Rubin CPX GPUs, 144 Rubin GPUs, and 36 Vera CPUs to deliver 8 exaFLOPs of NVFP4 compute and 100 TB of high-speed memory.

Rubin CPX는 생성 단계 처리를 위해 NVIDIA VERA CPU 및 RUBIN GPU와 함께 작동하여 완전한 고성능 분리 된 서빙 솔루션을 형성합니다. NVIDIA VERA RUBIN NVL144 CPX RACK은 144 Rubin CPX GPU, 144 Rubin GPU 및 36 Vera CPU를 통합하여 8 개의 엑사 플롭의 NVFP4 컴퓨팅 및 100TB의 고속 메모리를 전달합니다.

Real-World Impact and ROI

실제 영향 및 ROI

At scale, the platform can deliver a 30x to 50x return on investment, translating to as much as $5B in revenue from a $100M CAPEX investment. By combining disaggregated infrastructure, acceleration, and full-stack orchestration, Vera Rubin NVL144 CPX redefines what’s possible for enterprises building the next generation of generative AI applications.

규모에 따라 플랫폼은 30 배에서 50 배의 투자 수익을 제공하여 CAPEX 투자 수익률이 $ 5b로 전환 될 수 있습니다. Vera Rubin NVL144 CPX는 분리 된 인프라, 가속 및 풀 스택 오케스트레이션을 결합함으로써 차세대 생성 AI 응용 프로그램을 구축 할 수있는 것들을 재정의합니다.

Conclusion

결론

The NVIDIA Rubin CPX GPU and the NVIDIA Vera Rubin NVL144 CPX rack represent a new standard for full-stack AI infrastructure, creating new possibilities for workloads like advanced software coding and generative video. It's an exciting time to be in AI, and NVIDIA is leading the charge!

NVIDIA RUBIN CPX GPU 및 NVIDIA VERA RUBIN NVL144 CPX RACK은 풀 스택 AI 인프라의 새로운 표준을 나타내며 고급 소프트웨어 코딩 및 생성 비디오와 같은 작업 부하에 대한 새로운 가능성을 만듭니다. AI에있는 것은 흥미 진진한 시간이며, Nvidia는 책임을 맡고 있습니다!

원본 소스：nvidia

부인 성명:info@kdj.com

제공된 정보는 거래 조언이 아닙니다. kdj.com은 이 기사에 제공된 정보를 기반으로 이루어진 투자에 대해 어떠한 책임도 지지 않습니다. 암호화폐는 변동성이 매우 높으므로 철저한 조사 후 신중하게 투자하는 것이 좋습니다!

2026年07月03日 에 게재된 다른 기사

더