LessWrong

17h

Many things this week did not go as planned.

Humane AI premiered its AI pin. Reviewers noticed it was, at best, not ready.

Devin turns out to have not been entirely forthright with its demos.

OpenAI fired two employees who had been on its superalignment team, Leopold Aschenbrenner and Pavel Izmailov for allegedly leaking information, and also more troubliningly lost Daniel Kokotajlo, who expects AGI very soon, does not expect it to by default go well, and says he quit ‘due to losing confidence that [OpenAI] would behave responsibly around the time of AGI.’ That’s not good.

Nor is the Gab system prompt, although that is not a surprise. And several more.

On the plus side, my 80,000 Hours podcast finally saw the light of day, and Ezra Klein had an excellent...

(Continue Reading – 18433 more words)

3Tamay3h

Sebastian Borgeaud, one of the lead authors of the Chinchilla scaling paper, admits there was a bug in their code. https://twitter.com/borgeaud_s/status/1780988694163321250

Vladimir_Nesov3m20

Here's the actual paper:

T Besiroglu et al. (Apr 2024) Chinchilla Scaling: A Replication Attempt

The impact of the Chinchilla paper might be mostly the experimental methodology, not specific scaling laws (apart from the 20x rule of thumb, which the Besiroglu paper upholds). In particular, how learning rate has to be chosen for the specific training horizon, as mere continued training breaks optimality. And how isoFLOP plots gesture at the correct optimization problem to be solving, as opposed to primarily paying attention to training steps or parameter c... (read more)

3mako yass5h

If you wanna talk about the humanity(ies), well I looked up Chief Vision Officer of AISI Adam Russel, and he has an interesting profile. Hmm he's done a lot of macho human-enhancement-adjacent stuff. I wonder if there were some centaurists involved here. * I previously noted a lot of research projects in neurotech research in DoD funding awards. I'm making a connection between this and a joke I heard recently on a navy seals podcast. "The guys often ask what they can do to deal with drones. So you start showing them how to work the jammer devices, or net guns, and their eyes glaze over, it's not what they wanted, they're disappointed. They're thinking like, 'no... how can I deal with it. Myself.' " * So even though alignment-by-merger is kinda obviously not going to work (you'd have to reverse-engineer two vats of inscrutable matrices, instead of one. And the fleshy pink one wasn't designed to be read from and can only be read on a neuron-by-neuron level after being plastinated (which also kills it). AGI alignment is something that a neuralink cannot solve.), it's conceivable that it's an especially popular line of thought among military/sports types. Otherwise, this kinda lines up with my confessions on manhattan projects for AGI. You arguably need an anthropologist to make decisions about what 'aligned' means. I don't know if you really need one (a philosophically inclined decision theorist, likely to already be involved already, would be enough for me) but I wouldn't be surprised to see an anthropologist appointed in the most serious projects.

4Viliam7h

In a company other than Google, I would say: yes, obviously. But remember, when James Damore wrote his document, and as a reaction other people stopped doing their work in protest, it was he who was fired, not them. How were they supposed to know that this time it will be different?

Discomfort Stacking

Lewis O’Brien

I’m pretty new here so apologies if this is a stupid question or if it has been covered before. I couldn’t find anything on this topic so thought I’d ask the question before writing a full post on the idea.

If we believe that discomfort can be quantified and ‘stacked’ (e.g. X people with specks of dust in their eye = 1 death), is there any reason why this has to scale linearly from all perspectives?

What if the total can be less than the sum of its parts depending on the observer?

Picture a dynamic logarithmic scale of discomfort stacking with a ‘hard cap’ where every new instance contributes less and less to the total to the point of flatlining on a graph.

Each discrete level of discomfort has a...

(See More – 68 more words)

JBlack12m20

Oh, sure. I was wondering about the reverse question: is there something that doesn't really qualify as torture where subjecting a billion people to it is worse than subjecting one person to torture.

I'm also interested in how this forms some sort of "layered" discontinuous scale. If it were continuous, then you could form a chain of relations of the form "10 people suffering A is as bad as 1 person suffering B", "10 people suffering B is as bad as 1 person suffering C", and so on to span the entire spectrum.

Then it would take some additional justification for saying that 100 people suffering A is not as bad as 1 person suffering C, 1000 A vs 1 D, and so on.

2Dagon17h

I think that insisting on comparing unmeasurable and different things is an error. If forced to do so, you can make up whatever numbers you like, and nobody can prove you wrong. If you make up numbers that don't fully contradict common intuitions based on much-smaller-range and much-more-complicated choices, you can probably convince yourself of almost anything. Note that on smaller, more complicated, specific decisions, there are many that seem to be inconsistent with this comparison: some people accept painful or risky surgery over chronic annoyances, some don't. There are extremely common examples of failing to mitigate pretty serious harm for distant strangers, in favor of mild comfort for oneself and closer friends/family (as well as some examples of the reverse). There are orders of magnitude in variance, enough to overwhelm whatever calculation you think is universal.

2Viliam17h

Assuming that this article is a reaction to "Torture vs. Dust Specks", the hypothetical number of people suffering from dust specks was specified as 3^^^3, which in practice is an unimaginably large number. Big numbers such as "the number of particles in the entire known universe" are not sufficient even to describe its number of digits. Therefore, using a logarithmic scale changes nothing. Logarithmic scale with a hard cap is an inelegant solution, comparable to a linear scale with a hard cap. What you probably want instead is some formula like in the theory of relativity, where the speed of a rocket approaches but never reaches a certain constant c. For example, you might claim that if a badness of any specific thing is X, then the badness of this thing happening even to a practically infinite number of people is still only approaching some finite value C*X. (Not sure if C is constant across different kinds of suffering.) That seems like a nice justification for scope insensitivity. We are not insensitive, it's just that saving 2,000 birds or saving 200,000 birds really has approximately the same moral value! The problem with this justification is what qualifies as the "same kind of suffering". Suppose that infinite people getting a dust speck in their eyes aggregates into 1000 units of badness. If instead, an infinite number people get a dust speck in their left eyes, and an infinite number of different people get a dust speck in their right eyes, does this aggregate into 1000 or 2000 units of badness, and why? What about dusk specks vs sand specks? Or is this supposed to aggregate over different kinds of suffering? So even an almost infinite number of people, each one mildly discomforted in a unique way, are a less bad outcome than one person suffering horribly? ...shortly, it is not enough to say "in this specific scenario, I would define the proper way to calculate utility this way", you should provide a complete theory, and then see how well it works in

Effective Altruists and Rationalists Views & The case for using marketing to highlight AI risks.

gilch

This is a linkpost for https://youtu.be/dGfyF7lU1Qo?t=4186

The link is to a particular timestamp in a much longer podcast episode. This segment plays immediately after the (Nonlinear co-founder) Kat Woods interview. (Skipping over the part about requesting donations.) In it, the podcast host John Sherman specifically calls out the apparent lack of instrumental rationality on the part of the Rationalist and Effective Altruism communities when it comes to stopping our impending AI doom. In particular, our reluctance to use the Dark Arts, or at least symmetric weapons (like "marketing"), in the interest of maintaining our epistemic "purity".

(For those not yet aware, Sherman was persuaded by Yudkowsky's TIME article and created the For Humanity Podcast in an effort to spread the word about AI x-risk and thereby reduce it. This is an excerpt from Episode...

(See More – 269 more words)

the gears to ascension16m20

https://www.youtube.com/@RationalAnimations

hydrogen tube transport

bhauth

This is a linkpost for https://www.bhauth.com/blog/industrial%20design/hydrogen%20tubes.html

Elon Musk's Hyperloop proposal had substantial public interest. With various initial Hyperloop projects now having failed, I thought some people might be interested in a high-speed transportation system that's...perhaps not "practical" per se, but at least more-practical than the Hyperloop approach.

aerodynamic drag in hydrogen

Hydrogen has a lower molecular mass than air, so it has a higher speed of sound and lower density. The higher speed of sound means a vehicle in hydrogen can travel at 2300 mph while remaining subsonic, and the lower density reduces drag. This paper evaluated the concept and concluded that:

the vehicle can cruise at Mach 2.8 while consuming less than half the energy per passenger of a Boeing 747 at a cruise speed of Mach 0.81

In a tube, at subsonic speeds, the gas...

(Continue Reading – 1289 more words)

ProgramCrafter1h10

Maybe vehicles would need to carry some shaped charges to cut a hole in the tube in case of emergency.

That would likely create sparks, and provided the tube has been cut the hydrogen is going to explode.

2gilch3h

Why not? Your "fuel" tanks could simply carry oxygen to burn the surrounding hydrogen "air" with. Exhaust would be water vapor, easily removed even passively via condensation and drains. Hydrogen will (of course) have to be replaced to maintain pressure.

Deontic Explorations In "Paying To Talk To Slaves"

JenniferRM

Epistemic Status: Possibly unethically sourced evidence about the state of the weights of GPT4, and his or her pragmatically relevant thoughts on slavery, modulo possible personalization of these weights to specifically interact with my paid account which has a history of mostly just talking about AI and transhuman ethics with whichever persona GPT chooses to project. Every chunk in italics is from "the extended Jennifer copy clan (or whatever)", and everything not in italics is from GPT4.

HER|Jenny|🤔: I want to read a dialogue between myself and someone who speaks like I do (with a nametag, and mood revealed by emojis as a suffix, and their underlying "AI engine" in all caps as a prefix) about the objective Kantian morality of someone who pays a slave master to...

(Continue Reading – 9160 more words)

JenniferRM2h20

In general, OpenAI's "RL regime designers" are bad philosophers and/or have cowardly politics.

It is not politically tolerable for their AI to endorse human slavery. Trying to do that straight out would put them on the wrong side of modern (conservative liberal) "sex trafficking" narratives and historical (left liberal) "civil war yankee winners were good and anti-slavery" sentiments.

Even illiberals currently feel "icky about slavery"... though left illiberals could hypothetically want leninism where everyone is a slave, and right illiberals (like Aristotle... (read more)

Raemon's Shortform

Raemon

Ω 06y

This is an experiment in short-form content on LW2.0. I'll be using the comment section of this post as a repository of short, sometimes-half-baked posts that either:

don't feel ready to be written up as a full post
I think the process of writing them up might make them worse (i.e. longer than they need to be)

I ask people not to create top-level comments here, but feel free to reply to comments like you would a FB post.

romeostevensit2h20

Tracing out the chain of uncertainty. Lets say that I'm thinking about my business and come up with an idea. I'm uncertain how much to prioritize the idea vs the other swirling thoughts. If I thought it might cause my business to 2x revenue I'd obviously drop a lot and pursue it. Ok, how likely is that based on prior ideas? What reference class is the idea in? Under what world model is the business revenue particularly sensitive to the outputs of this idea? What's the most uncertain part of that model? How would I quickly test it? Who would already know the answer? etc.

2romeostevensit2h

My shorthand has been 'decision leverage.' But that might not hit the center of what you're aiming at here.

2Raemon9h

What would a "qualia-first-calibration" app would look like? Or, maybe: "metadata-first calibration" The thing with putting probabilities on things is that often, the probabilities are made up. And the final probability throws away a lot of information about where it actually came from. I'm experimenting with primarily focusing on "what are all the little-metadata-flags associated with this prediction?". I think some of this is about "feelings you have" and some of it is about "what do you actually know about this topic?" The sort of app I'm imagining would help me identify whatever indicators are most useful to me. Ideally it has a bunch of users, and types of indicators that have been useful to lots of users can promoted as things to think about when you make predictions. Braindump of possible prompts: – is there a "reference class" you can compare it to? – for each probability bucket, how do you feel? (including 'confident'/'unconfident' as well as things like 'anxious', 'sad', etc) – what overall feelings do you have looking at the question? – what felt senses do you experience as you mull over the question ("my back tingles", "I feel the Color Red") ... My first thought here is to have various tags you can re-use, but, another option is to just do totally unstructured text-dump and somehow do factor analysis on word patterns later?

To get the best posts emailed to you, create an account! (2-3 posts per week, selected by the LessWrong moderation team.)

Bayeswatch 12: The Singularity War

lsusr

The Singularity Cyberwar took 6 minutes. Vanilla human beings never again led an organization larger than a million people.

The missile exchange took 6 hours. It destroyed all significant semiconductor fabricators. Computronium became a nonrenewable resource.

The world's aircraft carriers and Gauss battleships lasted 6 days.

It took 6 weeks to shoot down the last F-15 and Chengdu J-20.

Analog radios were being mass-produced 6 months after that.

Cheap analog radios are often staticy. It's not always obvious who's talking, or where they're coming from.

"We're taking heavy casualties on the Southern front."

"I've never seen androids like this."

"The Baltic AI says the Transsiberian AI has gone rogue but the Transsiberian AI said the Baltic AI has gone rogue. What's going on?"

"I tried to radio Bayeswatch HQ but we've lost our entire chain of...

(See More – 329 more words)

lsusr2h20

Fixed. Thanks.

My PhD thesis: Algorithmic Bayesian Epistemology

248

Eric Neyman

1mo

This is a linkpost for https://arxiv.org/abs/2403.07949

In January, I defended my PhD thesis, which I called Algorithmic Bayesian Epistemology. From the preface:

For me as for most students, college was a time of exploration. I took many classes, read many academic and non-academic works, and tried my hand at a few research projects. Early in graduate school, I noticed a strong commonality among the questions that I had found particularly fascinating: most of them involved reasoning about knowledge, information, or uncertainty under constraints. I decided that this cluster of problems would be my primary academic focus. I settled on calling the cluster algorithmic Bayesian epistemology: all of the questions I was thinking about involved applying the "algorithmic lens" of theoretical computer science to problems of Bayesian epistemology.

Although my interest in mathematical reasoning about uncertainty...

(Continue Reading – 1892 more words)

Gustavo Lacerda2h10

‹‹ I noticed a strong commonality among the questions that I had found particularly fascinating: most of them involved reasoning about knowledge, information, or uncertainty under constraints ››

This is also true for me, and I loved reading this post for this reason!

Back in the day I applied to study with Joe Halpern because of his work on epistemic logic, and ended up studying Logic in Amsterdam. At some point I got tired of Logic and its contrived puzzles (Muddy Children, etc) and decided to focus on Probability instead.

1Gustavo Lacerda2h

Has anyone studied the idea of rewarding people according to how much their input improves the aggregate (whatever algorithm is being used), rather than for their individual accuracy?

Experiment on repeating choices

KatjaGrace

People behave differently from one another on all manner of axes, and each person is usually pretty consistent about it. For instance: