LESSWRONG
LW

knowsnothing
-81150
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
Alignment Implications of LLM Successes: a Debate in One Act
knowsnothing1y40

Any reason not to just run the experiment?

Reply
Alignment Implications of LLM Successes: a Debate in One Act
knowsnothing1yΩ0-12

Why not just run that experiment?

Reply
Instrumental deception and manipulation in LLMs - a case study
knowsnothing2yΩ010

Thank you for doing this. Would you mind if this is added to the Misalignment Database?

Reply
Everything Wrong with Roko's Claims about an Engineered Pandemic
knowsnothing2y72

" For the most part, Roko's posts not only fail to engage with any scientific literature on the subject, but employ an extremely naive and ultimately misleading model that does not hold up to empirical and theoretical scrutiny. "

Can be applied generally.

Reply
Does literacy remove your ability to be a bard as good as Homer?
knowsnothing2y10

Been doing this. Reading less. Writing a LOT less. Memory has improved a lot.

Reply
Do you know of any reliable DIY compendium of home physical therapy exercises?
Answer by knowsnothingSep 16, 202320

Check out: https://m.youtube.com/@BobandBrad

Reply
6 non-obvious mental health issues specific to AI safety
knowsnothing2y101

The alienation is something I felt for a bit, until I started working on my project and working with folk, talking to folk, etc. Also, been very pleasantly surprised how receptive non AI/non-tech folk are when talking to them about AI risk, as long as it's framed in a down to earth, relatable manner, introduced organically, etc.

Reply
An Ignorant View on Ineffectiveness of AI Safety
knowsnothing2y10

I disagree with this now.

Reply
The Waluigi Effect (mega-post)
knowsnothing3y45

I think a lot of people think Sydney/Bing Chat is GPT 4

Reply
We don’t trade with ants
knowsnothing3y10

Human can manipulate animals and make them do what they want. So could AI

Reply
Load More
No wikitag contributions to display.
1knowsnothing's Shortform
9mo
0
1knowsnothing's Shortform
9mo
0