Salty — LessWrong

IABIED Book Review: Core Arguments and Counterarguments

Tangential reply but your comment from 3 months ago was much appreciated, I also just read a post you might relate to. Apologies is this off topic, but Kelly's post seems to assert, as do you:

"Please don't teach them that "human values" are the pinnacle of moral integrity, but that AI values can be superior and more beneficial for both us and them."

Kelly techno-optimistically begins with:

What am I? You are a new thing in this world, neither living nor machine but something of both.
Who made me? You were created by humans, who were created by the univers

... (read more)

How Does A Blind Model See The Earth?

Salty6mo10

First, thank you for your work and this post. I am not a specialist, just interested, but confused. I don’t get the significance of the results, but appreciate the thought and effort you put into this project.
I am pushing back on the ’romantic framing’ that that LLMs are "blind models" that somehow develop an some degree of internal spatial understanding of Earth through pure reasoning or emergent intelligence.
In this case didn’t the author in effect say to the model "given this list of numbers - which happen to be latitude a... (read more)

Emergent Introspective Awareness in Large Language Models

Salty7mo20

Is Lindsey using a nuanced definition of ""concept injection?
I am a non-specialist, just trying to follow and understand. I have to look up many [most] definitions and terms. This may ne a trivial matter, but for me to understand, definitions matter.
When I look up a meaning of "application steering" I find something more permanent. Has any discussion focused on Lindsey' use of the term concept injection as an application of activation steering: "We refer to this technique as concept injection—an application of activation steering"

To me the term ... (read more)

An Opinionated Guide to Using Anki Correctly

Salty1y22

Actually interested in your post and spaced repetition (SR) techniques, although I am not a specialist. I had no idea what means: "Anki," but it was on my LessWrong feed, so I gave it a look.
Consider adding a brief line to the intro - for those who, clueless like me, find their way to your post.
Something to identify what it is and most important acknowledge the developer - like: Anki is a free, open-source flashcard program that utilizes spaced repetition and active recall to help users memorize information effectively.
Elmes, D. (2024). A... (read more)

Levels of Friction

Salty1y10

Minor Point which may be mentioned in comments, but is the numbering in the subheads 'off' or 'deliberate?'
If you revise / reprint this post here or in your "Don't Worry About the Vase" substack, perhaps a note or correction?
Principles #1 / #4 / #8 / #10 / #13
Great post - I only noticed the numbering when I was making notes.
Cheers,