|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Salesforce AI Research揭示了CODA-1.7B,这是一种基于扩散的代码语言模型,为高效和高质量的代码生成设定了新的标准。

Salesforce AI is shaking things up with CoDA-1.7B, a nifty diffusion-based language model for code! It generates code by cleverly denoising whole sequences with bidirectional context, updating multiple tokens in parallel. Let's dive in!
Salesforce AI正在用Coda-1.7b(一种基于漂亮的扩散语言模型的代码模型)来震动事情!它通过巧妙地使用双向上下文来生成代码,并并行更新多个令牌。让我们潜入!
Understanding CoDA-1.7B's Architecture and Training
了解CODA-1.7B的架构和培训
CoDA adapts a 1.7B-parameter backbone to discrete diffusion for text. Imagine masked sequences being iteratively denoised using full-sequence attention, enabling native infilling and non-autoregressive decoding. The model card documents a three-stage pipeline: pre-training with bidirectional masking, supervised post-training, and progressive denoising at inference. Plus, they've included reproducible scripts for TPU pre-training, GPU fine-tuning, and evaluation. Talk about thorough!
CODA适应1.7B参数的骨干,以使文本离散扩散。想象一下,掩盖的序列是使用全面的注意力迭代地否定的,实现了天然填充和非自动回形解码。该模型卡记录了三阶段的管道:双向掩蔽,监督后训练和推理时进行性降解的预训练。另外,它们还包括可重现的TPU预培训,GPU微调和评估的脚本。谈论彻底!
How Does CoDA-1.7B Perform on Benchmarks?
CODA-1.7B如何在基准上执行?
On standard code-gen suites, CoDA-1.7B-Instruct reports some impressive numbers:
在标准代码套件上,Coda-1.7b-Instruct报告了一些令人印象深刻的数字:
- HumanEval: 54.3%
- HumanEval+: 47.6%
- MBPP: 47.2%
- MBPP+: 63.2%
- EvalPlus aggregate: 55.4% (pass@1)
For context, the model card compares against diffusion baselines, including Dream-7B-Instruct (57.9% HumanEval), indicating CoDA’s 1.7B footprint is competitive with some 7B diffusion models while using fewer parameters. Efficiency is key, folks!
对于上下文,模型卡与扩散基线的比较,包括Dream-7b-Instruct(57.9%Humaneval),表明CODA的1.7B足迹在使用更少的参数时与某些7B扩散模型具有竞争力。效率是关键,伙计们!
Inference Behavior: Tuning Latency and Quality
推理行为:调整潜伏期和质量
Generation cost is governed by the number of diffusion steps. CoDA exposes knobs such as STEPS, ALG="entropy", ALG_TEMP, and block length to tune latency/quality trade-offs. Because tokens are updated in parallel under full attention, CoDA targets lower wall-clock latency at small scale compared with larger diffusion models, at comparable step budgets. It's all about finding that sweet spot!
发电成本受扩散步骤的数量约束。 CODA暴露了旋钮,例如步骤,alg =“熵”,alg_temp和块长度,以调整潜伏/质量的权衡。由于令牌在全面关注下并行更新,因此与较大的扩散模型相比,在可比的步骤预算中,尾co的目标是小规模的较小壁式延迟。这一切都是关于找到那个甜蜜的地方!
Deployment and Licensing: Easy Access and Usage
部署和许可:易于访问和使用
The repository provides a FastAPI server with OpenAI-compatible APIs and an interactive CLI for local inference. Instructions include environment setup and a start_server.sh launcher. Model cards and a Hugging Face collection centralize artifacts. The checkpoints are published under CC BY-NC 4.0 on Hugging Face. Open and accessible – just how we like it!
该存储库为FastAPI服务器提供了与OpenAI兼容的API和用于本地推理的交互式CLI。说明包括环境设置和start_server.sh启动器。型号卡和拥抱的面部收集集中文物。检查站在CC BY-NC 4.0下发布,在拥抱脸上。开放且易于访问 - 我们喜欢它!
Our Take: CoDA-1.7B is a Game Changer
我们的看法:CODA-1.7B是一个改变游戏规则的人
CoDA-1.7B stands as a clean reference for discrete-diffusion code generation at small scale: 1.7B parameters, bidirectional denoising with parallel token updates, and a reproducible pipeline from pre-training to SFT and serving. The reported pass@1 results—HumanEval 54.3, HumanEval+ 47.6, MBPP 47.2, MBPP+ 63.2, EvalPlus aggregate 55.4—place it competitive with some 7B diffusion baselines (e.g., Dream-7B HumanEval 57.9) while using fewer parameters. Inference latency is explicitly governed by step count and decoding knobs (STEPS, entropy-style guidance), which is operationally useful for tuning throughput/quality. Plus, the release includes weights on Hugging Face and a FastAPI server/CLI for local deployment. What's not to love?
CODA-1.7B是小规模离散 - 扩散代码生成的简洁参考:1.7b参数,双向Denoising具有平行令牌更新,以及从预训练到SFT和SFT和服务的可再现管道。报告的通过@1结果 - Humaneval 54.3,HumaneVal+ 47.6,MBPP 47.2,MBPP+ 63.2,EvalPlus Crengate 55.4-与一些7b扩散基准(例如,Dream-dream-7b HumaneVal 57.9)竞争,同时使用较少的参数。推理潜伏期由步骤计数和解码旋钮明确控制(步骤,熵式指导),这在操作上对于调整吞吐量/质量很有用。另外,该版本包括拥抱面的权重和用于本地部署的FastAPI服务器/CLI。什么不爱?
So, there you have it! Salesforce AI's CoDA-1.7B is making waves in the world of code generation. Who knew denoising could be so cool? Keep coding and stay curious!
所以,你有! Salesforce AI的CODA-1.7B正在代码生成世界中浪潮。谁知道denoising可能这么酷?保持编码并保持好奇!
免责声明:info@kdj.com
所提供的信息并非交易建议。根据本文提供的信息进行的任何投资,kdj.com不承担任何责任。加密货币具有高波动性,强烈建议您深入研究后,谨慎投资!
如您认为本网站上使用的内容侵犯了您的版权,请立即联系我们(info@kdj.com),我们将及时删除。
-
- 比特币、eCash 分叉和空投动态:深入探讨加密货币的最新争议
- 2026-05-03 00:52:02
- 探索最近的 eCash 分叉、其作为高风险空投的分类,以及对比特币和加密生态系统的更广泛影响。
-
-
- 美联储维持利率稳定,地缘政治紧张局势引发比特币价格下跌
- 2026-05-01 04:04:38
- 美联储维持利率的决定,加上中东冲突,影响了比特币的价格。分析近期趋势和市场反应。
-
-
-
-
-
-
































