![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
梅塔(Meta
Meta held its first-ever event for AI developers, LlamaCon, at the company’s headquarters in Menlo Park, where it announced that it was ready to compete with ChatGPT from OpenAI, as well as Google, AWS, and AI-as-a-service startups.
梅塔(Meta)在Menlo Park的公司总部举办了AI开发人员Llamacon的首次活动,在那里宣布已准备好与Openai以及Google,AWS和AI-AS-AS-AS-AS-Service Startups竞争。
Meta Founder and CEO Mark Zuckerberg was joined by Co-Founder and CEO of Databricks, Ali Ghodsi.
Meta创始人兼首席执行官马克·扎克伯格(Mark Zuckerberg)由Ali Ghodsi Databricks的联合创始人兼首席执行官加入。
This is really a big deal for Meta and the AI industry as the maker of the popular open-source Llama LLM seeks to directly monetize the incredible adoption Meta has realized. Developers just access the model from the cloud; no hardware or software to install.
对于Meta和AI行业来说,这确实是一件大事,因为流行的开源Llama LLM的制造商试图将令人难以置信的收养意识到直接货币化。开发人员只需从云访问模型即可;没有硬件或软件要安装。
But it is also a big deal for Cerebras and Groq, the two startups selected by Meta for serving fast tokens, many times faster than a GPU. (Nvidia, Cerebras and Groq are all clients of Cambrian-AI Research.) Meta did not disclose pricing, as access to the API is currently in preview, and access to Groq and Cerebras is only available by request. This is the first time either startup has landed a foothold at a hyper-scale Cloud Service Provider (CSP). And Meta has made it super easy to use; developers just select Groq or Cerebras in the API call.
但是对于小脑和Groq来说,这也是一件大事,这是Meta为快速代币服务的两家初创公司,比GPU快很多次。 (NVIDIA,Cerebras和Groq都是Cambrian-AI研究的所有客户。)Meta没有透露定价,因为目前正在预览API,并且只能根据请求获得对Groq和Cerebras的访问。这是任何一家初创公司第一次在超级云服务提供商(CSP)上立足。 Meta使其非常容易使用;开发人员只需在API调用中选择GROQ或脑脑即可。
Cerebras is the industry's fastest inference processor by far (~18X) but Grok is also 5-fold faster ... More than any GPU.
到目前为止,小脑是该行业最快的推理处理器(约18倍),但Grok的速度也比任何GPU都要快5倍。
“Cerebras is proud to make Llama API the fastest inference API in the world,” said Andrew Feldman, CEO and co-founder of Cerebras. “Developers building agentic and real-time apps need speed. With Cerebras on Llama API, they can build AI systems that are fundamentally out of reach for leading GPU-based inference clouds.”
Cerebras的首席执行官兼联合创始人Andrew Feldman说:“脑感到自豪地使Llama API成为世界上最快的推理API。” “开发人员建造代理和实时应用程序需要速度。使用Llama API的脑力,他们可以构建AI系统,这些系统从根本上来说是基于GPU的主要推理云而无法触及的。”
Microsoft Confirms $1.50 Windows Security Update Hotpatch Fee Starts July 1
Microsoft确认$ 1.50 Windows Security Update Hotpatch Fee从7月1日开始
Microsoft Confirms Password Spraying Attack — What You Need To Know
微软确认密码喷涂攻击 - 您需要知道的
Google’s Gmail Upgrade—Why You Need To Change Your App
Google的Gmail升级 - 为什么需要更改应用程序
Llama on Cerebras is far faster than on Google TPUs or Nvidia GPUs.
小脑上的美洲驼远远比Google TPU或NVIDIA GPU的速度快得多。
Andrew’s point is important. Obtaining inferences at some 100 tokens per second is faster than a human can read, so “one-shot” inference requests for a service like ChatGPT runs just fine on GPUs. But multi-model agents and reasoning models can increase computational requirements by some 100-fold, opening an opportunity for faster inference from companies like Cerebras, Groq. Meta did not mention the third fast-inference company, Samba Nova, but indicated that they are open to other compute options in the future.
安德鲁的观点很重要。以每秒100个令牌获得的推论要比人类所能阅读的要快,因此“一击”推理请求诸如ChatGpt之类的服务在GPU上运行良好。但是,多模型的代理和推理模型可以将计算要求提高约100倍,从而为诸如Cerbras,Groq等公司更快的推理打开了机会。 Meta没有提及第三家快速推荐公司Samba Nova,但表示将来他们对其他计算选项开放。
It will be interesting to see how well these two new options fare in the tokens-as-a-service world.
有趣的是,这两个新选项在代币中的服务世界中的表现如何。
免责声明:info@kdj.com
所提供的信息并非交易建议。根据本文提供的信息进行的任何投资,kdj.com不承担任何责任。加密货币具有高波动性,强烈建议您深入研究后,谨慎投资!
如您认为本网站上使用的内容侵犯了您的版权,请立即联系我们(info@kdj.com),我们将及时删除。
-
-
- Fiobit - 2025年最佳总体比特币云挖掘平台
- 2025-04-30 20:55:13
- 随着由于硬件,电力和技术障碍,传统加密矿山的越来越昂贵,随着云采矿的出现,随着越来越多
-
-
-
- 证券化和手套将阿波罗信用基金的令牌化版本带到defi
- 2025-04-30 20:45:12
- 令牌化公司的证券化和分散融资(DEFI)专家Gauntlet计划将阿波罗信用基金的令牌化版本带到生态系统上。
-
- SEC Drops @paypal $ pyusd探测
- 2025-04-30 20:45:12
- SEC结束了对Paypal Pyusd Stablecoin的调查,而无需采取任何执法行动,这是该行业的重大发展。
-
-
- 比特币(BTC)价格合并在美国宏数据之前,可能触发上升突破
- 2025-04-30 20:40:21
- 比特币在上升势头摊位后巩固,但交易者相信会导致上升突破。
-
- RWA协议铅笔金融部署从开放式校园和Animoca品牌的1000万美元流动性,以促进Defi学生贷款
- 2025-04-30 20:40:20
- 输入:2025年4月30日 - 由Animoca Brands和Hackquest共同播种的EDU连锁店的学生贷款现实世界中的铅笔金融协议(RWA)协议今天宣布,开放式校园和Animoca品牌已部署了1000万美元的贷款,以贷款作为贷款,以促进Defi Student interation in Plinance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance Finance loans。