|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Salesforce AI Research揭示了CODA-1.7B,這是一種基於擴散的代碼語言模型,為高效和高質量的代碼生成設定了新的標準。

Salesforce AI is shaking things up with CoDA-1.7B, a nifty diffusion-based language model for code! It generates code by cleverly denoising whole sequences with bidirectional context, updating multiple tokens in parallel. Let's dive in!
Salesforce AI正在用Coda-1.7b(一種基於漂亮的擴散語言模型的代碼模型)來震動事情!它通過巧妙地使用雙向上下文來生成代碼,並並行更新多個令牌。讓我們潛入!
Understanding CoDA-1.7B's Architecture and Training
了解CODA-1.7B的架構和培訓
CoDA adapts a 1.7B-parameter backbone to discrete diffusion for text. Imagine masked sequences being iteratively denoised using full-sequence attention, enabling native infilling and non-autoregressive decoding. The model card documents a three-stage pipeline: pre-training with bidirectional masking, supervised post-training, and progressive denoising at inference. Plus, they've included reproducible scripts for TPU pre-training, GPU fine-tuning, and evaluation. Talk about thorough!
CODA適應1.7B參數的骨幹,以使文本離散擴散。想像一下,掩蓋的序列是使用全面的注意力迭代地否定的,實現了天然填充和非自動回形解碼。該模型卡記錄了三階段的管道:雙向掩蔽,監督後訓練和推理時進行性降解的預訓練。另外,它們還包括可重現的TPU預培訓,GPU微調和評估的腳本。談論徹底!
How Does CoDA-1.7B Perform on Benchmarks?
CODA-1.7B如何在基准上執行?
On standard code-gen suites, CoDA-1.7B-Instruct reports some impressive numbers:
在標準代碼套件上,Coda-1.7b-Instruct報告了一些令人印象深刻的數字:
- HumanEval: 54.3%
- HumanEval+: 47.6%
- MBPP: 47.2%
- MBPP+: 63.2%
- EvalPlus aggregate: 55.4% (pass@1)
For context, the model card compares against diffusion baselines, including Dream-7B-Instruct (57.9% HumanEval), indicating CoDA’s 1.7B footprint is competitive with some 7B diffusion models while using fewer parameters. Efficiency is key, folks!
對於上下文,模型卡與擴散基線的比較,包括Dream-7b-Instruct(57.9%Humaneval),表明CODA的1.7B足跡在使用更少的參數時與某些7B擴散模型具有競爭力。效率是關鍵,伙計們!
Inference Behavior: Tuning Latency and Quality
推理行為:調整潛伏期和質量
Generation cost is governed by the number of diffusion steps. CoDA exposes knobs such as STEPS, ALG="entropy", ALG_TEMP, and block length to tune latency/quality trade-offs. Because tokens are updated in parallel under full attention, CoDA targets lower wall-clock latency at small scale compared with larger diffusion models, at comparable step budgets. It's all about finding that sweet spot!
發電成本受擴散步驟的數量約束。 CODA暴露了旋鈕,例如步驟,alg =“熵”,alg_temp和塊長度,以調整潛伏/質量的權衡。由於令牌在全面關注下並行更新,因此與較大的擴散模型相比,在可比的步驟預算中,尾co的目標是小規模的較小壁式延遲。這一切都是關於找到那個甜蜜的地方!
Deployment and Licensing: Easy Access and Usage
部署和許可:易於訪問和使用
The repository provides a FastAPI server with OpenAI-compatible APIs and an interactive CLI for local inference. Instructions include environment setup and a start_server.sh launcher. Model cards and a Hugging Face collection centralize artifacts. The checkpoints are published under CC BY-NC 4.0 on Hugging Face. Open and accessible – just how we like it!
該存儲庫為FastAPI服務器提供了與OpenAI兼容的API和用於本地推理的交互式CLI。說明包括環境設置和start_server.sh啟動器。型號卡和擁抱的面部收集集中文物。檢查站在CC BY-NC 4.0下發布,在擁抱臉上。開放且易於訪問 - 我們喜歡它!
Our Take: CoDA-1.7B is a Game Changer
我們的看法:CODA-1.7B是一個改變遊戲規則的人
CoDA-1.7B stands as a clean reference for discrete-diffusion code generation at small scale: 1.7B parameters, bidirectional denoising with parallel token updates, and a reproducible pipeline from pre-training to SFT and serving. The reported pass@1 results—HumanEval 54.3, HumanEval+ 47.6, MBPP 47.2, MBPP+ 63.2, EvalPlus aggregate 55.4—place it competitive with some 7B diffusion baselines (e.g., Dream-7B HumanEval 57.9) while using fewer parameters. Inference latency is explicitly governed by step count and decoding knobs (STEPS, entropy-style guidance), which is operationally useful for tuning throughput/quality. Plus, the release includes weights on Hugging Face and a FastAPI server/CLI for local deployment. What's not to love?
CODA-1.7B是小規模離散 - 擴散代碼生成的簡潔參考:1.7b參數,雙向Denoising具有平行令牌更新,以及從預訓練到SFT和SFT和服務的可再現管道。報告的通過@1結果 - Humaneval 54.3,HumaneVal+ 47.6,MBPP 47.2,MBPP+ 63.2,EvalPlus Crengate 55.4-與一些7b擴散基準(例如,Dream-dream-7b HumaneVal 57.9)競爭,同時使用較少的參數。推理潛伏期由步驟計數和解碼旋鈕明確控制(步驟,熵式指導),這在操作上對於調整吞吐量/質量很有用。另外,該版本包括擁抱面的權重和用於本地部署的FastAPI服務器/CLI。什麼不愛?
So, there you have it! Salesforce AI's CoDA-1.7B is making waves in the world of code generation. Who knew denoising could be so cool? Keep coding and stay curious!
所以,你有! Salesforce AI的CODA-1.7B正在代碼生成世界中浪潮。誰知道denoising可能這麼酷?保持編碼並保持好奇!
免責聲明:info@kdj.com
所提供的資訊並非交易建議。 kDJ.com對任何基於本文提供的資訊進行的投資不承擔任何責任。加密貨幣波動性較大,建議您充分研究後謹慎投資!
如果您認為本網站使用的內容侵犯了您的版權,請立即聯絡我們(info@kdj.com),我們將及時刪除。
-
- 比特幣、eCash 分叉和空投動態:深入探討加密貨幣的最新爭議
- 2026-05-03 00:52:02
- 探索最近的 eCash 分叉、其作為高風險空投的分類,以及對比特幣和加密生態系統的更廣泛影響。
-
-
- 聯準會維持利率穩定,地緣政治緊張局勢引發比特幣價格下跌
- 2026-05-01 04:04:38
- 聯準會維持利率的決定,加上中東衝突,影響了比特幣的價格。分析近期趨勢和市場反應。
-
-
-
-
-
-
































