bitcoin
bitcoin

$107957.245065 USD

0.19%

ethereum
ethereum

$2508.355924 USD

-1.20%

tether
tether

$1.000227 USD

0.00%

xrp
xrp

$2.316526 USD

-0.45%

bnb
bnb

$665.985271 USD

0.37%

solana
solana

$172.342327 USD

-1.37%

usd-coin
usd-coin

$0.999629 USD

-0.02%

dogecoin
dogecoin

$0.222496 USD

-2.48%

cardano
cardano

$0.740686 USD

-1.75%

tron
tron

$0.269423 USD

-1.18%

sui
sui

$3.604351 USD

-1.17%

hyperliquid
hyperliquid

$33.793015 USD

4.53%

chainlink
chainlink

$15.353547 USD

-1.83%

avalanche
avalanche

$22.811071 USD

-1.87%

stellar
stellar

$0.285294 USD

-1.28%

Cryptocurrency News Video

A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment

May 25, 2025 at 12:46 am Statistical Machine Learning

A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment Raanan Y. Rohekar, Yaniv Gurwicz, Sungduk Yu, Estelle Aflalo, Vasudev Lal Do generative pre-trained transformer (GPT) models, trained only to predict the next token, implicitly learn a world model from which a sequence is generated one token at a time? We address this question by deriving a causal interpretation of the attention mechanism in GPT, and suggesting a causal world model that arises from this interpretation. Furthermore, we propose that GPT models, at inference time, can be utilized for zero-shot causal structure learning for input sequences and present a confidence score. Empirical evaluation is conducted in a controlled environment using the setup and rules of the Othello and Chess strategy games. A GPT, pre-trained on real-world games played with the intention of winning, is tested on out-of-distribution synthetic data consisting of sequences of random legal moves. We find that the GPT model is likely to generate legal next moves for out-of-distribution sequences for which a causal structure is encoded in the attention mechanism with high confidence. In cases for which the GPT model generates illegal moves it also fails to capture any causal structure.
Video source:Youtube

Disclaimer:info@kdj.com

The information provided is not trading advice. kdj.com does not assume any responsibility for any investments made based on the information provided in this article. Cryptocurrencies are highly volatile and it is highly recommended that you invest with caution after thorough research!

If you believe that the content used on this website infringes your copyright, please contact us immediately (info@kdj.com) and we will delete it promptly.

Other videos published on May 25, 2025