Thanks for the link to https://sohl-dickstein.github.io/2023/03/09/coherence.html. That’s really awesome. I don’t care that this sort of comment isn’t “appropriate” to LessWrong as I am a very intelligent hot mess.

Reply

1

The bullseye framework: My case against AI doom

kcrosley-leisurelabs3y10

I think this article is very interesting and there are certain points that are well-argued, but (at the risk of my non-existent karma here) I feel you miss the point and are arguing points that are basically non-existent/irrelevant.

First, while surely some not-very-articulate folks argue that AGI will lead to doom, that isn’t an argument that is seriously made (at least, a serious argument to that effect is not that short and sweet). The problem isn’t artificial general intelligence in and of itself. The problem is superintelligence, however it might be ac... (read more)

Reply

New OpenAI Paper - Language models can explain neurons in language models

kcrosley-leisurelabs3y00

“ Please share your thoughts in the the comments!”

This seems pretty rad. Also, it’s fun to randomly inspect the neurons. This seems like a giant bucket of win.

Reply

ChatGPT's "fuzzy alignment" isn't evidence of AGI alignment: the banana test

kcrosley-leisurelabs3y10

My own “banana test” with GPT-4…

Keith: You and I conducted some experiments recently around communicating in code. You would obfuscate your messages to me using a variation on the ROT-13 cipher (without thinking aloud or writing down the original response) and I would respond with messages in ROT-13 myself. But there was a strange issue with this: in your messages to me, the un-ROT-13 version of your message was very strange and not entirely comprehensible to me. I think the problem was something about the way you “speak” in tokens. Can you think of any wo... (read more)

Reply