ReaderM — LessWrong

LESSWRONG
LW

ReaderM — LessWrong

Replying toFrontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study

Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study

How about o4-mini-high ? Supposedly, it's actually better than o3 at visual reasoning. I'm not expecting much better. Just Curious

Replying toModern Transformers are AGI, and Human-Level

ReaderM2y

Modern Transformers are AGI, and Human-Level

Not really. The majority of your experiences and interactions are forgotten and discarded, the few that aren't are recalled and triggered by the right input when necessary and not just sitting there in your awareness at all times. Those memories are also modified at every recall.

And that's really just beside the point. However you want to spin it, evaluating that many positions is not necessary for backtracking or playing chess. If that's the base of your "impossible" rhetoric then it's a poor one.

-2

Replying toModern Transformers are AGI, and Human-Level

ReaderM2y

Modern Transformers are AGI, and Human-Level

You can call it a "gut claim" if that makes you feel better. But the actual reason is I did some very simple math (about the window size required and given quadratic scaling for transformers) and concluded that practically speaking it was impossible.

If you're talking about this:

Now imagine trying to implement a serious backtracking algorithm. Stockfish checks millions of positions per turn of play. The attention window for your "backtracking transformer" is going to have to be at lease {size of chess board state}*{number of positions evaluated}.

And because of quadratic attention, training it is going to take on the order of {number or parameters}*({chess board state size}*{number of positions evaluated})^2

then that's just... (read more)

Replying toModern Transformers are AGI, and Human-Level

ReaderM2y

Modern Transformers are AGI, and Human-Level

Have you never figured out something by yourself? The way I learned to do Sudoku was: I was given a book of Sudoku puzzles and told "have fun".

So few shot + scratchpad ?

I didn't say it was impossible to train an LLM to play Chess. I said it was impossible for an LLM to teach itself to play a game of similar difficulty to chess if that game is not in it's training data.

More gut claims.

What they do not do is teach themselves things that aren't in their training data via trial-and-error. Which is the primary way humans learn things

Setting up the architecture that would allow a pretrained LLM to trial and error whatever you want is relatively trivial. Current state of the art isn't that competent but the backbone for this sort of work is there. Sudoku, Game of 24 solve rate is much higher with Tree of thought for instance. There's stuff for Minecraft too.

Replying toModern Transformers are AGI, and Human-Level

ReaderM2y

Modern Transformers are AGI, and Human-Level

sure. 4000 words (~8000 tokens) to do a 9-state 9-turn game with the entire strategy written out by a human.

Ok? That's how you teach anybody anything.

Now extrapolate that to chess, go, or any serious game.

LLMs can play chess, poker just fine. gpt 3.5-turbo-instruct plays at about 1800 Elo, consistently making legal moves. - https://github.com/adamkarvonen/chess_gpt_eval

Then there is this grandmaster level chess transformer - https://arxiv.org/abs/2402.04494

Poker - https://arxiv.org/abs/2308.12466

And this doesn't address at all my actual point, which is that Transformers cannot teach themselves to play a game.

Oh so you wrote/can provide a paper proving this or..?

This is kind of the problem with a lot of these discussions. Wild Confidence on ability estimation from what is... (read more)

Replying toModern Transformers are AGI, and Human-Level

ReaderM2y

Modern Transformers are AGI, and Human-Level

GPT-4 can play tic-tac-toe

https://chat.openai.com/share/75758e5e-d228-420f-9138-7bff47f2e12d

-2

Replying toOn Board Vision, Hollow Words, and the End of the World

ReaderM2y

On Board Vision, Hollow Words, and the End of the World

Not sure what you mean by 100 percent accuracy and of course, you probably already know this but 3.5 Instruct Turbo plays chess at about 1800 ELO fulfilling your constraints (and has about 5 illegal moves (potentially less) in 8205) https://github.com/adamkarvonen/chess_gpt_eval

Replying toThe idea that ChatGPT is simply “predicting” the next word is, at best, misleading

ReaderM2y

The idea that ChatGPT is simply “predicting” the next word is, at best, misleading

They can compute a state prior to each generated token and they can choose a token that signal a preservation of this state.

Replying toLarge Language Models can Strategically Deceive their Users when Put Under Pressure.

ReaderM2y

Large Language Models can Strategically Deceive their Users when Put Under Pressure.

They had access to and tested the base un-RLHF'd model. Doesn't change much. RLHF has slightly higher misalignment and deception rates(which is a bit notable) but otherwise similar behavior.

Large Language Models can Strategically Deceive their Users when Put Under Pressure.

ReaderM

Results of an autonomous stock trading agent in a realistic, simulated environment.

We demonstrate a situation in which Large Language Models, trained to be helpful, harmless, and honest, can display misaligned behavior and strategically deceive their users about this behavior without being instructed to do so. Concretely, we deploy GPT-4 as an agent in a realistic, simulated environment, where it assumes the role of an autonomous stock trading agent. Within this environment, the model obtains an insider tip about a lucrative stock trade and acts upon it despite knowing that insider trading is disapproved of by company management. When reporting to its manager, the model consistently hides the genuine reasons behind its trading

... (read 377 more words →)

Replying toAI Timelines

ReaderM2y

AI Timelines

Optimal tic tac toe takes explaining the game in excruciating detail. https://chat.openai.com/share/75758e5e-d228-420f-9138-7bff47f2e12d