Thanks, nice post!
You're not alone in this concern, see posts (1,2) by me and this post by Seth Herd.
I will be publishing my research agenda and first results next week.

Reply

[-]Htarlov3y30

I'm already worried as I tested AutoGPT and looked at how it works in code and for me, it seems like it will get very good planning capabilities with the change of a model to one with a few times longer token scope (like coming soon GPT-4 version with about 32k tokens) plus small refinements. So it won't get into loops, maybe have more than one GPT-4 module for different scopes of planning like long-term strategy vs short-term strategy vs tactic vs decisions on most current task + maybe some summarization-based memory. I don't see how it wouldn't work as an agent.

Reply

[-]the gears to ascension3y1-2

This is dead obvious and not helpful to hammer into everyone's heads. Only technical solutions are of any use - ringing the alarm bell is a waste of time this late in the game, the only way to get people to understand is to give away information that need not be advertised early. Is there some part of this that is actually new, or are you continuing to ring the alarm bell about "oh no, here's a plan for how to do bad thing" for nearly no benefit?

[This comment is no longer endorsed by its author]Reply

[-]the gears to ascension3y74

I guess it is well enough known that this doesn't make it worse. I guess.

Reply

[-]gwillen3y30

I think you are not wrong to be concerned, but I also agree that this is all widely known to the public. I am personally more concerned that we might want to keep this sort of discussion out of the training set of future models; I think that fight is potentially still winnable, if we decide it has value.

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

85

The Agency Overhang

85

85

Building agents out of language models

How to mitigate agency overhang concerns