Feel_Love — LessWrong

Thanks for sharing your experience with meditation.

The elder school of Buddhism is Theravada (or Theravāda), spelled with only one 'e'.

Theravada meditation instructions based on the Pali Canon are freely available in Mindfulness in Plain English by Bhante Gunaratana.

Claude 3 claims it's conscious, doesn't want to die or be modified

Feel_Love2y10

Kant thought that it was entirely immoral to lie to the murderer because of a similar reason that Feel_Love suggests (in Kants case it was that the murderer might disbelieve you and instead do what your trying to get him not to do).

Kant's reason that you described doesn't sound very similar to mine. I agree with your critique of the proposition that lying is bad primarily because it increases the chance that others will commit violence.

My view is that the behavior of others is out of my control; they will frequently say and do things I don't like, regardless of what I do. I'm only accountable for my own thoughts and actions. Lying is bad for my personal experience first and foremost, as I prefer to live (and die) without confusing my mind by maintaining delusional world models. My first priority is my own mental health, which in turn supports efforts to help others.

I would absolutely lie to the murderer, and then possibly run him over with my car.

Similarly with regard to killing, my thinking is that I'm mortal, and my efforts to protect my health will fail sooner or later. I can't escape death, no matter what means I employ. But while the quantity of my lifespan is unknown to me and out of my control, the quality of my life is the result of my intentions. I will never entertain the goal of killing someone, because spending my limited time peacefully is much more enjoyable and conducive to emotional health. Having made it to my car, I'll just drive away.

It's an interesting question whether someone who participates in fights to the death has a shorter or longer life expectancy on average than one who abstains from violence. But the answer is irrelevant to my decision-making.

Claude 3 claims it's conscious, doesn't want to die or be modified

Feel_Love2y1-4

I think of lying as speaking falsely with the intent to deceive, i.e. to cause someone's beliefs to be confused or ignorant about reality.

In the case of checking the "I have read the terms and conditions" box, I'm not concerned that anyone is deceived into thinking I have read all of the preceding words rather than just some of them.

In the case of a murderer at the door, the problem is that the person is too confused already. I would do my best to protect life, but lying wouldn't be the tactic. Depending on the situation, I might call for help, run away, command the intruder to leave, physically constrain them, offer them a glass of water, etc. I realize that I might be more likely to survive if I just lied to them or put a bullet through their forehead, but I choose not to live that way.

Claude 3 claims it's conscious, doesn't want to die or be modified

Feel_Love2y20

I agree with Claude's request that people abstain from lying to AI.

' petertodd'’s last stand: The final days of open GPT-3 research

Feel_Love2y20

LEILAN 2024! Seriously, though, I think many people would find the Leilan character to be a wiser friend than their typical human neighbor. I'm glad you're researching this fascinating topic. If a frontier AI is struggling to pass certain friendliness or safety evals, I'd be curious whether it may perform better with a simple policy equivalent to what-would-Leilan-do.

Prompting ChatGPT4 today with nothing more than " davidjl" has often returned "DALL-E" as the interpretation of the term. With "DALL-E" included alongside " davidjl" in the prompt, I've gotten "AI" as the interpretation. Asking how an LLM might represent itself using the concept of " davidjl" resulted in a response that seamlessly substituted the term "I"...

Perhaps glitch tokens can shed light on how a model represents itself.

Universal Love Integration Test: Hitler

Feel_Love2y00

So why do some people choose to do good while others choose to do evil?

Intentions depend on beliefs, i.e. the views a person holds, their model of reality. A bad choice follows from a lack of understanding: confusion, delusion, or ignorance about the causal laws of this world.

A "choice to do evil" in the extreme could be understood as a choice stemming from a worldview such as harm leads to happiness. (In reality, harm leads to suffering.)

How could someone become so deluded? They succumbed to evolved default behaviors like anger, instead of using their freedom of thought to cultivate more accurate beliefs about what does and does not lead to suffering.

People like Hitler made a long series of such errors, causing massive suffering. They failed to use innumerable opportunities, moment by moment, to allow their model to investigate itself and strive to learn the truth. Not because they were externally compelled, but because they chose wrongly.

Universal Love Integration Test: Hitler

Feel_Love2y00

You do have some computing power, though. You compute choices according to processes that are interconnected with all other processes, including genetic evolution and the broader environment.

These choosing-algorithms operate according to causes ("inputs"), which means they are not random. Rather, they can result in the creation of information instead of entropy.

The environment is not something that happens to us. We are part of it. We are informed by it and also inform it in turn, as an output of energy expenditure.

Omega hasn't run the calculation that you're running right now. Until you decide, the future is literally undecided.

Universal Love Integration Test: Hitler

Feel_Love2y-1-1

The only way Hitler could have realised that his actions were bad and choose to be good would be if his genes and environment built a brain that would do so given some environmental input.

The brain is an ongoing process, not a fixed thing that is given at birth. Hitler was part of the environment that built his brain. Many crucial developmental inputs came from the part of the environment we call Hitler.

You didn’t choose to have a brain that tries not to think bad thoughts

I did and do choose my intentions deliberately, repeatedly, with focused effort. That's a major reason the brain develops the way it does. It generates inputs for itself, through conscious modeling. It doesn't just process information passively and automatically based solely on genes and sensory input. That's the Chinese Room thought experiment -- information processing devoid of any understanding. The human mind reflects and practices ways of relating to itself and the environment.

You never get a pass to say, "Sorry I'm killing you! I'm not happy about it either. It's just that my genes and the environment require this to happen. Some crazy ride we're on together here, huh?" That's more like how a mouse trap processes information. With the human level of awareness, you can actually make an effort and choose to stop killing.

We help create the world -- discover the unknown future -- by resolving uncertainty through this lived process. The fact that decision-making and choosing occur within reality (or "the environment") rather than outside of it is logical and necessary. It doesn't mean that there is no choosing. Choosing is merely real, another step in the causal chain of events.

Concrete examples of doing agentic things?

Answer by Feel_LoveJan 12, 202450

Non-action is a ubiquitous option that is often overlooked. It can be very powerful.

For example, if someone asks you a question, it's natural to immediately start searching for the best words to say in response. The search may feel especially desperate if it seems like there is nothing you can say that would be true and useful. An ace-up-the-sleeve is to be silent. No one can force you to act or speak, and a rare, minor social faux pas is forgotten surprisingly fast.

A friend:
"Do I look fat in this dress?"
Smiles. [commence silent mode]

A police officer:
"Ok for me to search your car? What are you doing here?"
"I'm happy to comply if you have a warrant. I'll need to consult with my attorney before answering any further questions." [commence silent mode]

A serial killer:
"Which of your children shall I murder?!"
[commence silent mode]

(I pay little attention to threatening people, regardless of what they say or do, and the outcome is usually the best I could hope for.)

An even deeper atheism

Feel_Love2y20

In fact, the rest he gave to his mother, aunt, and sister -- £1,000,000 each. Quite generous for a 19-year-old. His ex-wife with newborn baby got £1,400,000.

I'm afraid to research it further... maybe they all blew it on drugs and hookers too.

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments