LESSWRONG
LW

1437
p.b.
1309Ω11262680
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
6p.b.'s Shortform
1y
7
Which side of the AI safety community are you in?
p.b.6d20

For me the linked site with the statement doesn't load. And this was also the case when I first tried to access it yesterday. Seems less than ideal. 

Reply
Jacob_Hilton's Shortform
p.b.8d20

Thanks!

Reply
Jacob_Hilton's Shortform
p.b.8d20

How does this coefficient relate to the maximal slope (i.e. at the 50%-x)?

Reply
The "Length" of "Horizons"
p.b.9d20

Very possible. 

I plan to watch this a bit longer and also analyse how the trend changes with repo size. 

Reply
The "Length" of "Horizons"
p.b.14d4-3

The way METR time horizons tie into AI 2027 is very narrow: As a trend not even necessarily on coding/software engineering skills but on machine learning engineering. I think that is hard to attack except by claiming that the trend will taper off. AI 2027 does not require unrealistic generalisation. 

The reason why I think that time horizons are much more solid evidence of AI progress then earlier benchmarks, is that the calculated time horizons explain the trends in AI-assisted coding over the last few years very well. For example it's not by chance that "vibe coding" became a thing when it became a thing. 

I have computed time horizon trends for more general software engineering tasks (i.e. with a bigger context) and my preliminary results point towards a logistic trend, i.e. the exponential is already tapering off. However, I am still pretty uncertain about that. 

Reply1
No, That's Not What the Flight Costs
p.b.21d30

And why would anybody do that?

Reply
adamzerner's Shortform
p.b.1mo20

I think babysitting a baby is not very informative about whether you would enjoy having kids. Having a kid is first and foremost about having the deepest and most meaningful emotional connection of your life. 

Take that away and you just don't have a sensible test run. It's like finding out whether you like hiking by going up and down the stairs of your apartment building all morning. 

Having kids is like having parents, except the emotional connection is stronger in the other direction. Would you rather have grown up in an orphanage if that had meant more time for your hobbies and other goals? 

Reply
Does My Appearance Primarily Matter for a Romantic Partner?
p.b.1mo105

I think the most important thing has not been mentioned yet: 

How you dress and take care of yourself is the very first and often only impression of how much you have your shit together. Having your shit together - doing the things you need to do in time and doing them well - is the most important trait in a long-term partner. 

Reply
johnswentworth's Shortform
p.b.1mo41

If the one clearly fucked up receptor copy is sufficient for your "symptoms", it seems pretty likely that one of your parents should have them too. I think there is no reason to expect a denovo mutation to be particularly likely in your case (unlike in cases that lead to severe disfunction). And of course you can check for that by sequencing your parents.

So my money would be on the second copy also being sufficiently messed up that you have basically no fully functioning oxytocin receptors. If you have siblings and you are the only odd one in the family, you could make a pretty strong case for both copies being messed up, by showing that you are the only one with the combination of frameshift in one copy and particular SNPs in the other. (If you are not the only odd one you can make an even stronger case). 

Reply
henryaj's Shortform
p.b.2mo35

Seems a lot harder to write a post a day if one is not holed up in Lighthaven. 

Reply
Load More
15What LLMs lack
5mo
5
4On AI personhood
6mo
7
57Evidence against Learned Search in a Chess-Playing Neural Network
1y
3
4Reasoning is not search - a chess example
1y
3
9Broadly human level, cognitively complete AGI
1y
0
6p.b.'s Shortform
1y
7
27The Limitations of GPT-4
2y
12
37Dall-E 3
2y
9
14Improving Mathematical Reasoning with-Process Supervision
2y
3
50Gemini will bring the next big timeline update
2y
6
Load More