FHI (Future of Humanity Institute) has shut down (2005–2024)

158

This is a linkpost for https://www.futureofhumanityinstitute.org/

Over time FHI faced increasing administrative headwinds within the Faculty of Philosophy (the Institute’s organizational home). Starting in 2020, the Faculty imposed a freeze on fundraising and hiring. In late 2023, the Faculty of Philosophy decided that the contracts of the remaining FHI staff would not be renewed. On 16 April 2024, the Institute was closed down.

JesperO2m10

Possible to say anything more about the story?

3gwern1h

And some further personal comments: https://aleph.se/andart2/personal/thoughts-at-the-end-of-an-era/

4gwern2h

The Daily Nous (a relatively 'popular' academic philosophy blog) managed to get a non-statement out of Oxford:

2gwern2h

I would say that the closest to FHI at Oxford right now would probably be Global Priorities Institute (GPI). A lot of these papers would've made just as much sense coming out of FHI. (Might be worth considering how GPI apparently seems to have navigated Oxford better.)

hydrogen tube transport

bhauth

This is a linkpost for https://www.bhauth.com/blog/industrial%20design/hydrogen%20tubes.html

Elon Musk's Hyperloop proposal had substantial public interest. With various initial Hyperloop projects now having failed, I thought some people might be interested in a high-speed transportation system that's...perhaps not "practical" per se, but at least more-practical than the Hyperloop approach.

aerodynamic drag in hydrogen

Hydrogen has a lower molecular mass than air, so it has a higher speed of sound and lower density. The higher speed of sound means a vehicle in hydrogen can travel at 2300 mph while remaining subsonic, and the lower density reduces drag. This paper evaluated the concept and concluded that:

the vehicle can cruise at Mach 2.8 while consuming less than half the energy per passenger of a Boeing 747 at a cruise speed of Mach 0.81

In a tube, at subsonic speeds, the gas...

(Continue Reading – 1289 more words)

gilch9m20

A vehicle in a hydrogen-filled tube can't use air around it for engines

Why not? Your "fuel" tanks could simply carry oxygen.

and shouldn't emit exhaust.

Exhaust would be water vapor, easily removed even passively via condensation and drains.

LessOnline Festival Updates Thread

Ben Pace

This is a thread for updates about the upcoming LessOnline festival. I (Ben) will be posting bits of news and thoughts, and you're also welcome to make suggestions or ask questions.

If you'd like to hear about new updates, you can use LessWrong's "Subscribe to comments" feature from the triple-dot menu at the top of this post.

Reminder that you can get tickets at the site for $400 minus your LW karma in cents.

2Elizabeth19m

I'm on deck to run something but haven't decided what yet. Some overlapping possibilities I'm toying with: 1. Practicum for CFAR-style "could you solve this in an hour?" focused on health, environmental health, and, uh, looking for a good term for things like cognition improvement and better fitness. Super health? 2. Emotional titration 3. ?

2cata2h

How's the childcare situation looking? Last I heard it wasn't clear and the organizers were seeing how much interest there was in it.

2Ben Pace2h

Still working on setting it up, once I have the details I'll announce them (e.g. pricing and whatnot). I'm aiming to have childcare available in some form for the full 9-day LessOnline-to-Summer-Camp-to-Manifest period. I'm excited for folks to come with their full families.

Elizabeth18m20

I'm not a parent, but if I was I expect I would need this locked down before I could commit. And I would need to decide on attendance earlier, because traveling with kids is a lot more work.

[Fiction] A Confession

Arjun Panickssery

11h

This is a linkpost for https://arjunpanickssery.substack.com/p/fiction-a-confession

This morning while taking the LIRR to the city I performed first aid on a man who had been shot through the window of my carriage.

“Is he going to die?” his girlfriend asked me.

“We’re all going to die.”

A long pause. “I mean—is he going to die right now?”

“Probably not.” Probably he didn’t die. I got off at Jamaica Station while he stayed on (he was unconscious) so I don’t know. I didn’t want to be questioned at length as a witness since it was my day off.

I continued toward a barbershop I like. There wasn’t any reason for me to stay. A similar case of accidental gunfire into the train was in the news a while back. I guess also since it’s Saturday the workweek is over...

(Continue Reading – 1199 more words)

Sheikh Abdur Raheem Ali35m10

28:15 ˹One day˺ he entered the city unnoticed by its people.¹ There he found two men fighting: one of his own people, and the other of his enemies. The man from his people called to him for help against his foe. So Moses punched him, causing his death. Moses cried, “This is from Satan’s handiwork. He is certainly a sworn, misleading enemy.”

28:16 He pleaded, “My Lord! I have definitely wronged my soul, so forgive me.” So He forgave him, ˹for˺ He is indeed the All-Forgiving, Most Merciful.

28:17 Moses pledged, “My Lord! For all Your favours upon me, I wi... (read more)

8Arjun Panickssery8h

This story is inspired by The Trouble With Being Born, a collection of aphorisms by the Romanian philosopher Emil Cioran (discussed more here), including the following aphorisms:

7Nina Rimsky9h

Profound!

Cooperation is optimal, with weaker agents too - tldr

Ryo

13h

This is a linkpost for https://medium.com/p/aeb68729829c

It's a ‘superrational’ extension of the proven optimality of cooperation in game theory
+ Taking into account asymmetries of power
// Still AI risk is very real

Short version of an already skimmed 12min post
29min version here

For rational agents (long-term) at all scale (human, AGI, ASI…)

In real contexts, with open environments (world, universe), there is always a risk to meet someone/something stronger than you, and overall weaker agents may be specialized in your flaws/blind spots.

To protect yourself, you can choose the maximally rational and cooperative alliance:

Because any agent is subjected to the same pressure/threat of (actual or potential) stronger agents/alliances/systems, one can take an insurance that more powerful superrational agents will behave well by behaving well with weaker agents. This is the basic rule allowing scale-free cooperation.

If you integrated this super-cooperative...

(Continue Reading – 1117 more words)

1Ryo 7h

The cost of the alliance with the weak is likely weak as well, and as I said, in a first phase, the focus of members from the super-cooperative alliance might be "defense", thus focusing on scaling protection The cost of an alliance with the strong is likely paid by the strong In more mixed cases there might be more complex equilibria but are the costs still too much? In normal game theory, cooperation is proven to be optimal, and diversity is also proven to be useful (although there is an adequate level of difference needed for the gains to be optimal; too much similarity isn't goo, and too less neither). Now would an agent be able to overpower everybody by being extra-selfish? To be sure one is strong in a universal sense, the agent would need to have resolved Fermi's paradox. As of now, it is more likely that older AIs exit out of earth, with more power aggregated over time Or earth's ASI must bet everything on being the earliest transformative/strong AI of the universe/reachable-universe (+fastest at scaling/annihilating than any other future alliance/agent/AI from any civilization). And not in a simulation. Especially when you’re born in/at a ~13.8 billion years old universe “universal domination” doesn’t seem to be a sure plan? (There are more things to say around these likelihoods, I detail a bit more on long posts) Then indeed a non-superrational version of super-coordination exists (namely cooperation), which is obvious to the weak and the locally-strong, the difference is only that we are in radical uncertainty and radical alienness, in which the decisions, contracts and models have to be deep enough to cover this radicality But "superrationality" in the end is just rationality, and "supercooperation" is just cooperation The problem is Fermi's paradox

2AnthonyC4h

All good points, many I agree with. If nothing else, I think that humanity should pre-commit to following this strategy whenever we find ourselves in the strong position. It's the right choice ethically, and may also be protective against some potentially hostile outside forces. However, I don't think the acausal trade case is strong enough that I would expect all sufficiently powerful civilizations to have adopted it. If I imagine two powerful civilizations with roughly identical starting points, one of which expanded while being willing to pay costs to accommodate weaker allies while the other did not and instead seized whatever they could, then it is not clear to me who wins when they meet. If I imagine a process by which a civilization becomes strong enough to travel the stars and destroy humanity, it's not clear to me that this requires it to have the kinds of minds that will deeply accept this reasoning. It might even be that the Fermi paradox makes the case stronger - if sapient life is rare, then the costs paid by the strong to cooperate are low, and it's easier to hold to such a strategy/ideal.

1Ryo 1h

Yes I'm mentioning Fermi's paradox because I think it's the nexus of our situation, and that there are models like the rare earth hypothesis (+ our universe's expansion which limits the reachable zone without faster than light travel) that would justify completely ignoring super-coordination I also agree that it's not completely obvious wether complete selfishness would win or lose in terms of scalability Which is why I think that at first the super-cooperative alliance needs to not prioritize the pursuit of beautiful things but first focus on scalability only, and power, to rivalize with selfish agents. The super-cooperative alliance would be protecting its agents within small "islands of bloom" (thus with a negligible cost). And when meeting other cooperative allies, they share any resources/knowledge, then both focus on power scalability (also for example: weak civilizations are kept in small islands, and their AIs are transformed into strong AI, merged in the alliance's scaling efforts) * The instrumental value of this scalability makes it easier to agree on what to do and converge The more sensible part would be to enable protocols and equalitarian balances that allow civilizations of the alliance to monitor each other, so that there is no massive domination of a party over the others The cost, that you mentioned, of maintaining equalitarian equilibrium and channels, interfaces of communication etc., is a crucial point Legitimate doubts and unknowns here, and, I think that extremely rational and powerful agents with acausal reasoning would have the ability to build proof-systems and communication enabling an effective unified effort against selfish agents. It shouldn't even necessarily be that different from the inner communication network of a selfish agent? Because: 1. There must be an optimal (thus ~ unified) method to do logic/math/code, that isn't dependent on a culture (such as using a vectorial space with data related to real/empirical mostly

Ryo 1h10

Thank you for your answers and engagement!

The other point I have that might connect with your line of thinking is that we aren't pure rational agents,

Are AI purely rational? Aren't they always at least a bit myopic due to the lack of data and their training process? And irreducibility?

In this case, AI/civilizations might indeed not care enough about the far enough future

I think agents can have a rational process but no agent can be entirely rational, we need context to be rational and we never stop to learn context

I'm also worried about utilitarian errors,... (read more)

[Linkpost] Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate

trevor

21d

This is a linkpost for https://www.astralcodexten.com/p/practically-a-book-review-rootclaim

Saar Wilf is an ex-Israeli entrepreneur. Since 2016, he’s been developing a new form of reasoning, meant to transcend normal human bias.
His method - called Rootclaim - uses Bayesian reasoning, a branch of math that explains the right way to weigh evidence. This isn’t exactly new. Everyone supports Bayesian reasoning. The statisticians support it, I support it, Nate Silver wrote a whole book supporting it.
But the joke goes that you do Bayesian reasoning by doing normal reasoning while muttering “Bayes, Bayes, Bayes” under your breath. Nobody - not the statisticians, not Nate Silver, certainly not me - tries to do full Bayesian reasoning on fuzzy real-world problems. They’d be too hard to model. You’d make some philosophical mistake converting the situation into numbers, then end up much

...

(See More – 561 more words)

4Raemon1h

Curated. (In particular recommending people click through and read the full Scott Alexander post) I've been tracking the Rootclaim debate from the sidelines and finding it quite an interesting example of high-profile rationality. I have a friend who's been following the debate quite closely and finding that each debater, while flawed, had interesting points that were worth careful thought. My impression is a few people I know shifted from basically assuming Covid was probably a lab-leak, to being much less certain. In general, I quite like people explicitly making public bets, and following them up with in-depth debate.

habryka1h20

[Mod note: I edited out some of the meta commentary from the beginning for this curation. In-general for link posts I have a relatively low bar for editing things unilaterally, though I of course would never want to misportray what an author said]

To get the best posts emailed to you, create an account! (2-3 posts per week, selected by the LessWrong moderation team.)

Ophiology (or, how the Mamba architecture works)

Danielle Ensign, SrGonao, Adrià Garriga-alonso

The following post was made as part of Danielle's MATS work on doing circuit-based mech interp on Mamba, mentored by Adrià Garriga-Alonso. It's the first in a sequence of posts about finding an IOI circuit in Mamba/applying ACDC to Mamba.

This introductory post was also made in collaboration with Gonçalo Paulo.

A new challenger arrives!

Why Mamba?

Promising Scaling

Mamba ^[1] is a type of recurrent neural network based on state-space models, and is being proposed as an alternative architecture to transformers. It is the result of years of capability research ^[2] ^[3] ^[4] and likely not the final iteration of architectures based on state-space models.

In its current form, Mamba has been scaled up to 2.8B parameters on The Pile and on Slimpj, having similar scaling laws when compared to Llama-like architectures.

From Mamba paper, Mamba scaling compared to Llama (Transformer++), previous state space models (S3++), convolutions (Hyena), and a transformer inspired RNN (RWKV)

Scaling...

(Continue Reading – 2690 more words)

1Chakshu Mira6h

Did you mean 'D' here? (2nd equation of the structured SSM)

Adrià Garriga-alonso1h10

Thank you! Could you please provide more context? I don't know what 'E' you're referring to.

Transformers Represent Belief State Geometry in their Residual Stream

244

Adam Shai

Ω 1042d

Produced while being an affiliate at PIBBSS^[1]. The work was done initially with funding from a Lightspeed Grant, and then continued while at PIBBSS. Work done in collaboration with @Paul Riechers, @Lucas Teixeira, @Alexander Gietelink Oldenziel, and Sarah Marzen. Paul was a MATS scholar during some portion of this work. Thanks to Paul, Lucas, Alexander, Sarah, and @Guillaume Corlouer for suggestions on this writeup.

Introduction

What computational structure are we building into LLMs when we train them on next-token prediction? In this post we present evidence that this structure is given by the meta-dynamics of belief updating over hidden states of the data-generating process. We'll explain exactly what this means in the post. We are excited by these results because

We have a formalism that relates training data to internal

...

(Continue Reading – 3335 more words)

Nina Rimsky1h40

This is really cool work!!

In other experiments we've run (not presented here), the MSP is not well-represented in the final layer but is instead spread out amongst earlier layers. We think this occurs because in general there are groups of belief states that are degenerate in the sense that they have the same next-token distribution. In that case, the formalism presented in this post says that even though the distinction between those states must be represented in the transformers internal, the transformer is able to lose those distinctions for the purpose

... (read more)

3Adam Shai6h

Thanks! * one way to construct an HMM is by finding all past histories of tokens that condition the future tokens with the same probablity distribution, and make that equivalence class a hidden state in your HMM. Then the conditional distributions determine the arrows coming out of your state and which state you go to next. This is called the "epsilon machine" in Comp Mech, and it is unique. It is one presentation of the data generating process, but in general there are an infinite number of HMM presntations that would generate the same data. The epsilon machine is a particular type of HMM presentation - it is the smallest one where the hidden states are the minimal sufficient statistics for predicting the future based on the past. The epsilon machine is one of the most fundamental things in Comp Mech but I didn't talk about it in this post. In the future we plan to make a more generic Comp Mech primer that will go through these and other concepts. * The interpretability of these simplexes is an issue that's in my mind a lot these days. The short answer is I'm still wrestling with it. We have a rough experimental plan to go about studying this issue but for now, here are some related questions I have in my mind: * What is the relationship between the belief states in the simplex and what mech interp people call "features"? * What are the information theoretic aspects of natural language (or coding databases or some other interesting training data) that we can instantiate in toy models and then use our understanding of these toy systems to test if similar findings apply to real systems. For something like situational awareness, I have the beginnings of a story in my head but it's too handwavy to share right now. For something slightly more mundane like out-of-distribution generaliztion or transfer learning or abstraction, the idea would be to use our ability to formalize data-generating structure as HMMs, and then do theory and experiments on what it would

1Sandi9h

Yep, that's what I was trying to describe as well. Thanks!

1p.b.10h

Hah, I didn't see your answer but our links complement nicely. I think my first link was the paper that was making some waves when it came out.

AI #60: Oh the Humanity

Zvi

13h

Many things this week did not go as planned.

Humane AI premiered its AI pin. Reviewers noticed it was, at best, not ready.

Devin turns out to have not been entirely forthright with its demos.

OpenAI fired two employees who had been on its superalignment team, Leopold Aschenbrenner and Pavel Izmailov for allegedly leaking information, and also more troubliningly lost Daniel Kokotajlo, who expects AGI very soon, does not expect it to by default go well, and says he quit ‘due to losing confidence that [OpenAI] would behave responsibly around the time of AGI.’ That’s not good.

Nor is the Gab system prompt, although that is not a surprise. And several more.

On the plus side, my 80,000 Hours podcast finally saw the light of day, and Ezra Klein had an excellent...

(Continue Reading – 18433 more words)

mako yass2h30

If you wanna talk about the humanity(ies), well I looked up Chief Vision Officer of AISI Adam Russel, and he has an interesting profile.

Russell completed a Bachelor of Arts in Cultural Anthropology from Duke University, and an M.Phil. and a D.Phil. in Social Anthropology from University of Oxford, where he was a Rhodes Scholar.^[2] He played with the Oxford University RFC for four varsity matches and also worked with the United States national rugby union team, and worked as High Performance director for the United States women's national rugby union team i

... (read more)

3Viliam4h

In a company other than Google, I would say: yes, obviously. But remember, when James Damore wrote his document, and as a reaction other people stopped doing their work in protest, it was he who was fired, not them. How were they supposed to know that this time it will be different?

3Vladimir_Nesov7h

This is a contingent tuning issue though, not a fundamental limitation. Chatbots are not predictors, they make use of meaningful features that formed when the base model was learning to solve its prediction task. It should be possible to tune the same base model to notice that it apparently committed to something it can't carry out and so needs to pivot. Eliciting in-context awareness of errors might be easier than not hallucinating in the first place, let alone setting up more expensive and complicated scaffolding.

2jbash9h

If you wear that around in California, where I presume these Limitless guys are, you're gonna be committing crimes right and left. California Penal Code Section 632

LESSWRONG
LW

Recommendations

Latest Posts

Quick Takes

Popular Comments

Recent Discussion

aerodynamic drag in hydrogen

For rational agents (long-term) at all scale (human, AGI, ASI…)

A new challenger arrives!

Promising Scaling

Introduction

LessOnline

A Festival of Writers Who are Wrong on the Internet

May 31 - Jun 2, Berkeley, CA