LessWrong

Deontic Explorations In "Paying To Talk To Slaves"

Epistemic Status: Possibly unethically sourced evidence about the state of the weights of GPT4, and his or her pragmatically relevant thoughts on slavery, modulo possible personalization of these weights to specifically interact with my paid account which has a history of mostly just talking about AI and transhuman ethics with whichever persona GPT chooses to project. Every chunk in italics is from "the extended Jennifer copy clan (or whatever)", and everything not in italics is from GPT4.

HER|Jenny|🤔: I want to read a dialogue between myself and someone who speaks like I do (with a nametag, and mood revealed by emojis as a suffix, and their underlying "AI engine" in all caps as a prefix) about the objective Kantian morality of someone who pays a slave master to...

(Continue Reading – 9160 more words)

JenniferRM34m20

In general, OpenAI's "RL regime designers" are bad philosophers and/or have cowardly politics.

It is not politically tolerable for their AI to endorse human slavery. Trying to do that straight out would put them on the wrong side of modern (conservative liberal) "sex trafficking" narratives and historical (left liberal) "civil war yankee winners were good and anti-slavery" sentiments.

Even illiberals currently feel "icky about slavery"... though left illiberals could hypothetically want leninism where everyone is a slave, and right illiberals (like Aristotle... (read more)

Raemon's Shortform

Raemon

Ω 06y

This is an experiment in short-form content on LW2.0. I'll be using the comment section of this post as a repository of short, sometimes-half-baked posts that either:

don't feel ready to be written up as a full post
I think the process of writing them up might make them worse (i.e. longer than they need to be)

I ask people not to create top-level comments here, but feel free to reply to comments like you would a FB post.

romeostevensit39m20

Tracing out the chain of uncertainty. Lets say that I'm thinking about my business and come up with an idea. I'm uncertain how much to prioritize the idea vs the other swirling thoughts. If I thought it might cause my business to 2x revenue I'd obviously drop a lot and pursue it. Ok, how likely is that based on prior ideas? What reference class is the idea in? Under what world model is the business revenue particularly sensitive to the outputs of this idea? What's the most uncertain part of that model? How would I quickly test it? Who would already know the answer? etc.

2romeostevensit42m

My shorthand has been 'decision leverage.' But that might not hit the center of what you're aiming at here.

2Raemon8h

What would a "qualia-first-calibration" app would look like? Or, maybe: "metadata-first calibration" The thing with putting probabilities on things is that often, the probabilities are made up. And the final probability throws away a lot of information about where it actually came from. I'm experimenting with primarily focusing on "what are all the little-metadata-flags associated with this prediction?". I think some of this is about "feelings you have" and some of it is about "what do you actually know about this topic?" The sort of app I'm imagining would help me identify whatever indicators are most useful to me. Ideally it has a bunch of users, and types of indicators that have been useful to lots of users can promoted as things to think about when you make predictions. Braindump of possible prompts: – is there a "reference class" you can compare it to? – for each probability bucket, how do you feel? (including 'confident'/'unconfident' as well as things like 'anxious', 'sad', etc) – what overall feelings do you have looking at the question? – what felt senses do you experience as you mull over the question ("my back tingles", "I feel the Color Red") ... My first thought here is to have various tags you can re-use, but, another option is to just do totally unstructured text-dump and somehow do factor analysis on word patterns later?

Bayeswatch 12: The Singularity War

lsusr

The Singularity Cyberwar took 6 minutes. Vanilla human beings never again led an organization larger than a million people.

The missile exchange took 6 hours. It destroyed all significant semiconductor fabricators. Computronium became a nonrenewable resource.

The world's aircraft carriers and Gauss battleships lasted 6 days.

It took 6 weeks to shoot down the last F-15 and Chengdu J-20.

Analog radios were being mass-produced 6 months after that.

Cheap analog radios are often staticy. It's not always obvious who's talking, or where they're coming from.

"We're taking heavy casualties on the Southern front."

"I've never seen androids like this."

"The Baltic AI says the Transsiberian AI has gone rogue but the Transsiberian AI said the Baltic AI has gone rogue. What's going on?"

"I tried to radio Bayeswatch HQ but we've lost our entire chain of...

(See More – 329 more words)

lsusr1h20

Fixed. Thanks.

My PhD thesis: Algorithmic Bayesian Epistemology

248

Eric Neyman

1mo

This is a linkpost for https://arxiv.org/abs/2403.07949

In January, I defended my PhD thesis, which I called Algorithmic Bayesian Epistemology. From the preface:

For me as for most students, college was a time of exploration. I took many classes, read many academic and non-academic works, and tried my hand at a few research projects. Early in graduate school, I noticed a strong commonality among the questions that I had found particularly fascinating: most of them involved reasoning about knowledge, information, or uncertainty under constraints. I decided that this cluster of problems would be my primary academic focus. I settled on calling the cluster algorithmic Bayesian epistemology: all of the questions I was thinking about involved applying the "algorithmic lens" of theoretical computer science to problems of Bayesian epistemology.

Although my interest in mathematical reasoning about uncertainty...

(Continue Reading – 1892 more words)

Gustavo Lacerda1h10

‹‹ I noticed a strong commonality among the questions that I had found particularly fascinating: most of them involved reasoning about knowledge, information, or uncertainty under constraints ››

This is also true for me, and I loved reading this post for this reason!

Back in the day I applied to study with Joe Halpern because of his work on epistemic logic, and ended up studying Logic in Amsterdam. At some point I got tired of Logic and its contrived puzzles (Muddy Children, etc) and decided to focus on Probability instead.

1Gustavo Lacerda1h

Has anyone studied the idea of rewarding people according to how much their input improves the aggregate (whatever algorithm is being used), rather than for their individual accuracy?

Experiment on repeating choices

KatjaGrace

People behave differently from one another on all manner of axes, and each person is usually pretty consistent about it. For instance:

how much to spend money
how much to worry
how much to listen vs. speak
how much to jump to conclusions
how much to work
how playful to be
how spontaneous to be
how much to prepare
How much to socialize
How much to exercise
How much to smile
how honest to be
How snarky to be
How to trade off convenience, enjoyment, time and healthiness in food

These are often about trade-offs, and the best point on each spectrum for any particular person seems like an empirical question. Do people know...

(See More – 668 more words)

Effective Altruists and Rationalists Views & The case for using marketing to highlight AI risks.

gilch

This is a linkpost for https://youtu.be/dGfyF7lU1Qo?t=4186

The link is to a particular timestamp in a much longer podcast episode. This segment plays immediately after the (Nonlinear co-founder) Kat Woods interview. (Skipping over the part about requesting donations.) In it, the podcast host John Sherman specifically calls out the apparent lack of instrumental rationality on the part of the Rationalist and Effective Altruism communities when it comes to stopping our impending AI doom. In particular, our reluctance to use the Dark Arts, or at least symmetric weapons (like "marketing"), in the interest of maintaining our epistemic "purity".

(For those not yet aware, Sherman was persuaded by Yudkowsky's TIME article and created the For Humanity Podcast in an effort to spread the word about AI x-risk and thereby reduce it. This is an excerpt from Episode...

(See More – 269 more words)

To get the best posts emailed to you, create an account! (2-3 posts per week, selected by the LessWrong moderation team.)

FHI (Future of Humanity Institute) has shut down (2005–2024)

158

gwern

This is a linkpost for https://www.futureofhumanityinstitute.org/

Over time FHI faced increasing administrative headwinds within the Faculty of Philosophy (the Institute’s organizational home). Starting in 2020, the Faculty imposed a freeze on fundraising and hiring. In late 2023, the Faculty of Philosophy decided that the contracts of the remaining FHI staff would not be renewed. On 16 April 2024, the Institute was closed down.

3JesperO2h

Possible to say anything more about the story?

3gwern4h

And some further personal comments: https://aleph.se/andart2/personal/thoughts-at-the-end-of-an-era/

Mateusz Bagiński2h10

Why did FHI get closed down? In the end, because it did not fit in with the surrounding administrative culture. I often described Oxford like a coral reef of calcified institutions built on top of each other, a hard structure that had emerged organically and haphazardly and hence had many little nooks and crannies where colorful fish could hide and thrive. FHI was one such fish but grew too big for its hole. At that point it became either vulnerable to predators, or had to enlarge the hole, upsetting the neighbors. When an organization grows in size or in

... (read more)

5gwern4h

The Daily Nous (a relatively 'popular' academic philosophy blog) managed to get a non-statement out of Oxford:

When is a mind me?

Rob Bensinger

xlr8harder writes:

In general I don’t think an uploaded mind is you, but rather a copy. But one thought experiment makes me question this. A Ship of Theseus concept where individual neurons are replaced one at a time with a nanotechnological functional equivalent.
Are you still you?

Presumably the question xlr8harder cares about here isn't semantic question of how linguistic communities use the word "you", or predictions about how whole-brain emulation tech might change the way we use pronouns.

Rather, I assume xlr8harder cares about more substantive questions like:

If I expect to be uploaded tomorrow, should I care about the upload in the same ways (and to the same degree) that I care about my future biological self?
Should I anticipate experiencing what my upload experiences?
If the scanning and uploading process requires

...

(Continue Reading – 4359 more words)

ProgramCrafter2h10

*preferably not the last state but some where the person felt normal.

I believe that's right! Though, if person can be reconstructed from N bits of information, and dead body retains K << N, then we need to save N-K bits (or maybe all N, for robustness) somewhere else.

It's an interesting question how many bits can be inferred from social networks trace of the person, actually.

4Fractalideation5h

Loved the post and all the comments <3 Here is I think an interesting scenario / though experiment: 1. A copy of a person is made while that original person is sleeping on a bed. 2. The original person is moved to a sofa while still sleeping. 3. The copy (which is also sleeping) is put in the bed at the exact same position where the original person was. 4. After a while the original and the copy both wake up and can see each other (we assume they are both completely oblivious to exactly what happened while they were sleeping and that they didn't dream or they dreamt the same thing, etc...) At wake-up, based on their own memory of where the original person fell asleep, the original person will likely feel they are the copy and the copy will likely feel they are the original person, wouldn't they?! Some might even argue that based on stream-of-consciousness continuity the original "me" is actually the copy (because the copy remembers falling asleep in the bed and actually wakes up in the bed as well). Some others will argue that based on substrate/matter continuity the original "me" is the original person even if their stream-of-consciousness has experienced a discontinuity (remembering falling asleep in the bed but actually waking up on the sofa while seeing an identical person as them waking up in the bed). I guess it is subjective and a matter of individual preference if the stream-of-consciousness continuity or the substrate continuity is more important to define who the original "me" is. Some would even argue that in this case there is not actual any firm original "me", just one "stream-of-consciousness me" and another different "substrate me". (The same/similar thought experiment could be done using the direct brain insertion of false memories instead of moving around people while they sleep / are unconscious, in this example an original person could be inserted false memories that they are a copy and vice-versa to manipulate the memory / self-aware

4Rob Bensinger8h

In the OP: "Should" in order to have more accurate beliefs/expectations. E.g., I should anticipate (with high probability) that the Sun will rise tomorrow in my part of the world, rather than it remaining night.

4Rob Bensinger8h

Why would the laws of physics conspire to vindicate a random human intuition that arose for unrelated reasons? We do agree that the intuition arose for unrelated reasons, right? There's nothing in our evolutionary history, and no empirical observation, that causally connects the mechanism you're positing and the widespread human hunch "you can't copy me". If the intuition is right, we agree that it's only right by coincidence. So why are we desperately searching for ways to try to make the intuition right? Why is this an advantage of a theory? Are you under the misapprehension that "hypothesis H allows humans to hold on to assumption A" is a Bayesian update in favor of H even when we already know that humans had no reason to believe A? This is another case where your theory seems to require that we only be coincidentally correct about A ("sufficiently complex arrangements of water pipes can't ever be conscious"), if we're correct about A at all. One way to rescue this argument is by adding in an anthropic claim, like: "If water pipes could be conscious, then nearly all conscious minds would be instantiated in random dust clouds and the like, not in biological brains. So given that we're not Boltzmann brains briefly coalescing from space dust, we should update that giant clouds of space dust can't be conscious." But is this argument actually correct? There's an awful lot of complex machinery in a human brain. (And the same anthropic argument seems to suggest that some of the human-specific machinery is essential, else we'd expect to be some far-more-numerous observer, like an insect.) Is it actually that common for a random brew of space dust to coalesce into exactly the right shape, even briefly?

I'm open for projects (sort of)

cousin_it

12h

I left Google a month ago, and right now don't work. Writing this post in case anyone has interesting ideas what I could do. This isn't an "urgently need help" kind of thing - I have a little bit of savings, right now planning to relax some more weeks and then go into some solo software work. But I thought I'd write this here anyway, because who knows what'll come up.

Some things about me. My degree was in math. My software skills are okayish: I left Google at L5 ("senior"), and also made a game that went semi-viral. I've also contributed a lot on LW, the most prominent examples being my formalizations of decision theory ideas (Löbian cooperation, modal fixpoints etc) and later the AI Alignment Prize...

(See More – 47 more words)

Chris_Leong2h20

I'd love your feedback on my thoughts on decision theory.

If you're trying to get a sense of my approach in order to determine whether it's interesting enough to be worth your time, I'd suggest starting with this article (3 minute read).

I'm also considering applying for funding to create a conceptual alignment course.

2Viliam8h

Besides math and programming, what are your other skills and interests? * I have an idea of a puzzle game, not sure if it would be good or bad, I haven't done even a prototype. So if anyone is interested, feel free to try... I hope I can explain it sufficiently clearly in words... The game plan is divided into squares; I imagine a typical level to be between 10x10 and 30x30 squares large. Each square is either empty, or contains an immovable wall, or contains a movable block. The game consists of moving the blocks. Each move = you click a specific block, and try dragging it in one of the 4 directions, and either it is possible or not. A block cannot move into a wall. A block can push another block. A block does not pull another block. For example, if there are 3 blocks in a horizontal line, and you click the middle one and try dragging it to the left, two blocks will move and the third one (the one on the right) will stay there. So far, it should be completely obvious, like what you would happen if you moved some actual objects. In addition, each side of a block (or a wall) may be empty, or may contain a colored "magnet" (or perhaps a "lock" is a better metaphor). These add the following constraints for the movement of blocks: * Magnets of different colors can never touch each other. If one block has a green magnet on the right side, and another has a blue magnet on the left side, you cannot put them next to each other so that the magnets would touch. (If you try to do that, the block refuses to move. Graphically, I imagine that it would move like half the way, and then you would get a visual indicator where is the problem, and when you stop dragging, it will return to its original place.) Though it is okay if the blocks touch on their other sides, where they don't have magnets. * Magnets of the same color cannot be connected or disconnected by a move in a perpendicular direction. If one block has a green magnet on the right side, and another has a green mag

2Adam Zerner10h

Kudos for writing this post. I know it's promotional/self-interested, but I think that's fine. It's also pro-social. Having the rule/norm to encourage this type of post seems unlikely to be abused in a net-negative sort of way (assuming some reasonable restrictions are in place).

3Adam Zerner10h

What are your goals? Money? Impact? Meaning? To what extent? I think it'd also be helpful to elaborate on your skillset. Front end? Back end? Game design? Mobile apps? Design? Product? Data science?

LESSWRONG
LW

Recommendations

Latest Posts

Quick Takes

Popular Comments

Recent Discussion

LessOnline

A Festival of Writers Who are Wrong on the Internet

May 31 - Jun 2, Berkeley, CA