In the way that AIXI is an abstracted mathematical formalism for (very roughly) "a program that maximizes the expected total rewards received from the environment", what is the equivalent formalism for an abstracted next token predictor?

Does this exist in the literature? What's it called? Where can I read about it?

The predictor looks like this:

Training:
[some long series of 0's and 1's] --> [training some ML model on this data to minimize loss for next-token prediction] --> [some set of final weights in the ML model.]
Inference:
[Some series of 0's and 1's] --> [our trained ML Model] --> [probability distribution over 0,1 for next token.]

The training data should not be random, and should be 'correlated with the reality you want to predict.' (The binary output of a real-world sensor at discrete time steps is a good example of the kind of data that's suitable.)

Any pointers?

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

12

[ Question ]

Is there a 'time series forecasting' equivalent of AIXI?

12

12

1 Answers sorted by
top scoring

May 17, 2023

12

[ Question ]

Is there a 'time series forecasting' equivalent of AIXI?

12

12

1 Answers sorted by top scoring

May 17, 2023

1 Answers sorted by
top scoring