LESSWRONG
LW

Gunnar_Zarncke
10638Ω27142392032
Message
Dialogue
Subscribe

Software engineering, parenting, cognition, meditation, other
Linkedin, Facebook, Admonymous (anonymous feedback)

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
So You Think You've Awoken ChatGPT
Gunnar_Zarncke3d30

Angus: 

This person founded an investment firm that manages 2B in assets. Apparently no-one is safe from LLM-induced psychosis 😭

Reply
On the functional self of LLMs
Gunnar_Zarncke3d1-1

I don't fully understand it is easier for many LessWrongers to reinvent their own version

Well, the best way to understand something is often to (re)derive it. And the best way to make sure you have actually understood it is to explain it to somebody. Reproducing research is also a good idea. This process also avoids or uncovers errors in the original research. Sure, the risk is that your new explanation is less understandable than the official one, but that seems more like a feature than a bug to me: It might be more understandable to some people. Diversity of explanations.

Reply
On the functional self of LLMs
Gunnar_Zarncke3d20

Most brains simulate just one character (cf Player vs. Character: A Two-Level Model of Ethics), and use the life-long data about it, but brains are capable of simulating more characters - usually this is a mental health issue, but you can also think about some sort of deep sleeper agent who half-forgot his original identity. 

This seems like you'd support Steven Byrnes' Intuitive Self-Models model.

Reply
[Intuitive self-models] 1. Preliminaries
Gunnar_Zarncke11d140

The sequence has been reviewed by Scott Alexander in Practically-A-Book Review: Byrnes on Trance.

Reply
Raemon's Shortform
Gunnar_Zarncke11d71

I wonder whether this tweet by Yudkowsky is related.

Reply
RohanS's Shortform
Gunnar_Zarncke11d20

Intuitively, when I'm more tired or most stressed. I would guess that is most likely in the morning - if often have to get up earlier than I like. This excludes getting woken up unexpectedly in the middle of the night, which is known to mess with people's minds.

I tried to use my hourly Anki performance, but it seems very flat, except indeed for a dip a 6 AM, but that could be lack of data (70 samples).

 

Reply
Screwtape's Shortform
Gunnar_Zarncke11d21

Reminds me loosely of The Honest Broker.

Reply1
Gunnar_Zarncke's Shortform
Gunnar_Zarncke12d40

Yes! That's the one. Thank you.

Reply
Gunnar_Zarncke's Shortform
Gunnar_Zarncke12d40

I'm looking for a video of AI gone wrong illustrating AI risk and unusual persuasion. It starts with a hall with blinking computers where an AI voice is manipulating a janitor and it ends with a plane crashing and other emergencies. I think it was made between 2014 and 2018 and linked on LW but I can't google, perplex or o3 it. And ideas?

Reply
On the functional self of LLMs
Gunnar_Zarncke13d60

Are you implying that there is a connection between A Three-Layer Model of LLM Psychology and active inference or do you offer that just as two lenses into LLM identity? If it is the former, can you say more?

Reply
Load More
8Gunnar_Zarncke's Shortform
5y
175
Theory of Mind
10mo
(+250)
Pareto Efficiency
1y
(+52/-52)
Pareto Efficiency
1y
(+52)
Pareto Efficiency
1y
(+392)
Babble and Prune
2y
(+1264)
Has Diagram
2y
(+163)
Simulation
2y
(+9/-10)
Simulation
2y
(+443/-24)
Simulation
2y
(+174/-3)
Simulation
2y
(+646)
Load More
11[Linkpost] How Am I Getting Along with AI?
2d
0
9Hybrid model reveals people act less rationally in complex games, more predictably in simple ones
11d
0
52Project Vend: Can Claude run a small shop?
20d
7
13[Linkpost] The lethal trifecta for AI agents: private data, untrusted content, and external communication
1mo
3
34Unexpected Conscious Entities
2mo
6
13[Linkpost] The value of initiating a pursuit in temporal decision-making
4mo
0
81Mistral Large 2 (123B) seems to exhibit alignment faking
Ω
4mo
Ω
4
156Reducing LLM deception at scale with self-other overlap fine-tuning
Ω
4mo
Ω
43
63RL, but don't do anything I wouldn't do
7mo
5
13[Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
8mo
0
Load More