LESSWRONG
LW

370
Logan Zoellner
1398Ω5564400
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Trust me bro, just one more RL scale up, this one will be the real scale up with the good environments, the actually legit one, trust me bro
Logan Zoellner5d2-2

There are no new ideas only new datasets

Currently all LLMs are terrible at computer-use.  Part of this is an ergonomics problem (GPT agent is frequently blocked from viewing websites and I still don't trust it enough to e.g. give it my street address and credit card number).  But when I give graphically demanding task that is 100% doable in the browser, it still falls absolutely flat on its face.

What is needed for RL to succeed is something like: an internet-scale dataset of graphically demanding tasks with objective success criteria.  Sooner or later someone is going to put together a dataset like "here are all 150k games on steam with a simple yes/no that tells us whether or not the AI beat the game."  And when that happens, I strongly suspect RL will suddenly start working.

Alternatively, companies like figure are planning to deploy 1000's of robots in the real-world with more or less the same idea: create a huge training set of actual physical reality (as opposite to just text + multimedia).  

Once a proper dataset is in place, I expect we will not see slow-gradual progress indicated by the METR chart, but rather a huge all-at-once leap (on par with when we first started properly applying RL to math).

Reply
AI Induced Psychosis: A shallow investigation
Logan Zoellner13d2-7

Most Americans use ChatGPT if AI was causing psychosis (and the phenomena wasn't just already psychotic people using ChatGPT) it would be showing up in statistics, not anecdotes.  SA concludes that the prevalence is ~1/100k people.  This would make LLMs 10x safer than cars.  If your concern was saving lives, you should be focusing on accelerating AI (self driving) not worrying about AI psychosis.

Reply1
Foom & Doom 1: “Brain in a box in a basement”
Logan Zoellner3mo20

tend to say things like “probably 5 to 25 years”. 

 

Just to be clear, your position is that 25 years from now when LLMs are trained using trillions of times as much compute and routinely doing task that take humans months to years that they will still be unable to run a business worth $1B?

Reply1
Making deals with early schemers
Logan Zoellner3mo40

thank you for clarifying.

Reply
Making deals with early schemers
Logan Zoellner3moΩ-3-2-14

It's easy to imagine a situation where an AI has a payoff table like:

              |     defect | don't defect
------------------------

succeed|      100      |   10

--- ------------------------------
fail        |           X       |  n/a

where we want to make X as low as possible (and commit to doing so)


For example a paperclip maximizing AI might be able to make 10 paperclips while cooperating with humans, 100 by successfully defecting against humans

Reply
Making deals with early schemers
Logan Zoellner3moΩ39-10

seems to violate not only the "don't negotiate with terrorists" rule, but even worse the "especially don't signal in advance that you intend to negotiate with terrorists" rule.

Reply1
Why I am not a successionist
Logan Zoellner3mo20

Those all sound line fairly normal beliefs.

Like... I'm trying to figure out why the title of the post is "I am not a successionist" and not "like many other utilitarians I have a preference for people who are biologically similar to me, I have things in common with, or I am close friends with.  I believe when optimizing utility in the far future we should take these things into account"

Even though can't comment on OP's views,  you seemed to have a strong objection to my "we're merely talking price" statement (i.e. when calculating total utility we consider tradeoffs between different things we care about).  


Edit:

to put it another way, if I wrote a post titled "I am a successionist" in which I said something like: "I want my children to have happy lives and their children to have happy lives, and I believe they can define 'children' in whatever way seems best to them", how would my views actually different from yours (or the OPs)?

Reply
Why I am not a successionist
Logan Zoellner3mo20

I genuinely  want to know what you mean by "kind".

If your grandchildren adopt an extremely genetically distant human, is that okay?  A highly intelligent, social and biologically compatible alien?

You've said you're fine with simulations here, so it's really unclear.

I used "markov blanket" to describe what I thought you might be talking about: a continuous voluntary process characterized by you and your decedents making free choices about their future.  But it seems like you're saying "markov blanket bad", and moreover that you thought the distinction should have be obvious to me.

Even if there isn't a bright-line definition, there must be some cluster of traits/attributes you are associating with the word "kind".

Reply
Why I am not a successionist
[+]Logan Zoellner3mo-8-1
Consider not donating under $100 to political candidates
Logan Zoellner4mo-2-15

alas, this isn't really enforceable in the USA given the 1st amendment.

Reply
Load More
6If you wanted to actually reduce the trade deficit, how would you do it?
8mo
5
42What happens next?
9mo
19
14What can we learn from insecure domains?
10mo
21
27Why is there Nothing rather than Something?
11mo
3
4Most arguments for AI Doom are either bad or weak
1y
100
36COT Scaling implies slower takeoff speeds
1y
56
53If I wanted to spend WAY more on AI, what would I spend it on?
Q
1y
Q
16
5How do we know dreams aren't real?
Q
1y
Q
31
3an effective ai safety initiative
1y
9
1Anti MMAcevedo Protocol
1y
1
Load More