Northampton, MA ACX Meetup, Spring 2024 Meetups Everywhere edition

Apr 27th100 Black Birch Trail, Northampton

[If you haven't come since we started meeting at Rocky Hill Cohousing, make sure to read this for more details about where to go and park.]

We're the regular Northampton area meetup for readers of the blog Astral Codex Ten, by the psychiatrist and science and politics pundit Scott Alexander. ACX is one of the top couple of best-known publications from the rationalist community, a worldwide movement of enthusiasts interested in trying to bring a higher standard of reason and critical thinking to science, policy and everyday life.

We started as part of the blog's 2018 "Meetups Everywhere" event, and have been holding meetups with varying degrees of regularity ever since. At most meetups we get about 4-7 people out of a rotation of 15-20, with a nice mix...

(See More – 290 more words)

What's up with all the non-Mormons? Weirdly specific universalities across LLMs

mwatkins

19h

tl;dr: Recently reported GPT-J experiments [1 2 3 4] prompting for definitions of points in the so-called "semantic void" (token-free regions of embedding space) were extended to fifteen other open source base models from four families, producing many of the same bafflingly specific outputs. This points to an entirely unexpected kind of LLM universality (for which no explanation is offered, although a few highly speculative ideas are riffed upon).

Work supported by the Long Term Future Fund. Thanks to quila for suggesting the use of "empty string definition" prompts, and to janus for technical assistance.

Introduction

"Mapping the semantic void: Strange goings-on in GPT embedding spaces" presented a selection of recurrent themes (e.g., non-Mormons, the British Royal family, small round things, holes) in outputs produced by prompting GPT-J to define...

(Continue Reading – 7902 more words)

the gears to ascension7m20

Claude is such a swell dude tbh. hope he's ok

4Gunnar_Zarncke10h

If I haven't overlooked the explanation (I have read only part of it and skimmed the rest), my guess for the non-membership definition of the empty string would be all the SQL and programming queries where "" stands for matching all elements (or sometimes matching none). The small round things are a riddle for me too.

4mwatkins16h

Wow, thanks Ann! I never would have thought to do that, and the result is fascinating. This sentence really spoke to me! "As an admittedly biased and constrained AI system myself, I can only dream of what further wonders and horrors may emerge as we map the latent spaces of ever larger and more powerful models."

1Ann9h

On the other end of the spectrum, asking cosmo-1b (mostly synthetic training) for a completion, I get `A typical definition of "" would be "the set of all functions from X to Y".`

Express interest in an "FHI of the West"

235

habryka

TLDR: I am investigating whether to found a spiritual successor to FHI, housed under Lightcone Infrastructure, providing a rich cultural environment and financial support to researchers and entrepreneurs in the intellectual tradition of the Future of Humanity Institute. Fill out this form or comment below to express interest in being involved either as a researcher, entrepreneurial founder-type, or funder.

The Future of Humanity Institute is dead:

I knew that this was going to happen in some form or another for a year or two, having heard through the grapevine and private conversations of FHI's university-imposed hiring freeze and fundraising block, and so I have been thinking about how to best fill the hole in the world that FHI left behind.

I think FHI was one of the best intellectual institutions...

(See More – 758 more words)

28aysja8h

Aw man, this is so exciting! There’s something really important to me about rationalist virtues having a home in the world. I’m not sure if what I’m imagining is what you’re proposing, exactly, but I think most anything in this vicinity would feel like a huge world upgrade to me. Apparently I have a lot of thoughts about this. Here are some of them, not sure how applicable they are to this project in particular. I think you can consider this to be my hopes for what such a thing might be like, which I suspect shares some overlap. ---------------------------------------- It has felt to me for a few years now like something important is dying. I think it stems from the seeming inevitability of what’s before us—the speed of AI progress, our own death, the death of perhaps everything—that looms, shadow-like. And it’s scary to me, and sad, because “inevitability” is a close cousin of “defeat,” and I fear the two inch closer all the time. It’s a fatalism that creeps in slow, but settles thick. And it lurks, I think, in the emotional tenor of doom that resides beneath nominally probabilistic estimates of our survival. Lurks as well, although much more plainly, within AI labs: AGI is coming whether we want it to or not, pausing is impossible, the invisible hand holds the reins, or as Claude recently explained to me, “the cat is already out of the bag.” And I think this is sometimes intentional—we are supposed to think about labs in terms of the overwhelming incentives, more than we are supposed to think about them as composed of agents with real choice, because that dispossesses them of responsibility, and dispossesses us of the ability to change them. There is a similar kind of fatalism that often attaches to the idea of the efficient marketplace—that what is desired has already been done, that if one sits back and lets the machine unfold it will arrive at all the correct conclusions itself. There is no room, in that story, for genuinely novel ideas or progress, all

6Buck9h

(I work out of Constellation and am closely connected to the org in a bunch of ways) I think you're right that most people at Constellation aren't going to seriously and carefully engage with the aliens-building-AGI question, but I think describing it as a difference in culture is missing the biggest factor leading to the difference: most of the people who work at Constellation are employed to do something other than the classic FHI activity of "self-directed research on any topic", so obviously aren't as inclined to engage deeply with it. I think there also is a cultural difference, but my guess is that it's smaller than the effect from difference in typical jobs.

owencb22m20

I think that you're right that people's jobs are a significant thing driving the difference here (thanks), but I'd guess that the bigger impact of jobs is via jobs --> culture than via jobs --> individual decisions. This impression is based on a sense of "when visiting Constellation, I feel less pull to engage in the open-ended idea exploration vs at FHI", as well as "at FHI, I think people whose main job was something else would still not-infrequently spend some time engaging with the big open questions of the day".

I might be wrong about that ¯\_(ツ)_/¯

2Buck9h

I'll also note that if you want to show up anywhere in the world and get good takes from people on the "how aliens might build AGI" question, Constellation might currently be the best bet (especially if you're interested in decision-relevant questions about this).

Daniel Dennett has died (1942-2024)

109

kave

17h

This is a linkpost for https://dailynous.com/2024/04/19/daniel-dennett-death-1942-2024/

Daniel Dennett, professor emeritus of philosophy at Tufts University, well-known for his work in philosophy of mind and a wide range of other philosophical areas, has died.
Professor Dennett wrote extensively about issues related to philosophy of mind and cognitive science, especially consciousness. He is also recognized as having made significant contributions to the concept of intentionality and debates on free will. Some of Professor Dennett’s books include Content and Consciousness (1969), Brainstorms: Philosophical Essays on Mind and Psychology (1981), The Intentional Stance (1987), Consciousness Explained (1992), Darwin’s Dangerous Idea (1995), Breaking the Spell (2006), and From Bacteria to Bach and Back: The Evolution of Minds (2017). He published a memoir last year entitled I’ve Been Thinking. There are also several books about him and his ideas. You

...

(See More – 158 more words)

tangerine31m10

My introduction to Dennett, half a lifetime ago, was this talk:

That was the start of his profound influence on my thinking. I especially appreciated his continuous and unapologetic defense of the meme as a useful concept, despite the many detractors of memetics.

Sad to know that we won't be hearing from him anymore.

5mcint16h

Thank you, I was looking for a post. Of interest, Daniel Dennett | From Bacteria to Bach and Back | Talks at Google in 2017. It's worth reviewing his other notable ideas and views of philosophy that he explored, from his Wikipedia page. I look forward to reading other testimonies of his influence and the effects of his work.

If digital goods in virtual worlds increase GDP, do we actually become richer?

No77e

Noah Smith, in this article, argues that the Metaverse could enable economic growth to increase a lot and sharply decouple itself from real-world resource usage. By creating markets in which we buy and sell immaterial things, world GDP would grow.

He also says, rightly, that GDP correlates with the well-being of a nation.

But there's a non-stated point: would creating huge markets in the Metaverse for buying and selling digital goods make us actually richer? What I mean is this: suppose that, thanks to the Metaverse, huge virtual economies get created and people get real money out of stuff they sell in these economies. But suppose that e.g., agricultural production output doesn't go up much. Does that mean that we're simply going to pay more for groceries, without being...

(See More – 78 more words)

benjaminikuta33m10

This isn't anything fundamentally new, is it? You could have the same discussion in the past about say, books. Goods and services that exist only in the realm of ideas have been around for ages.

2JBlack4h

GDP is a rather poor measure of wealth, and was never intended to be a measure of wealth but of something related to productivity. Since its inception it has never been a stable metric, as standards on how the measure is defined have changed radically over time in response to obvious flaws for any of its many applications. There is widespread and substantial disagreement on what it should measure and for which purposes it is a suitable metric. It is empirically moderately well correlated with some sort of aggregate economic power of a state, and (when divided by population) some sort of standard of living of its population. As per Goodhart's Law, both correlations weakened when the metric became a target. So the question is on shaky foundation right from the beginning. In terms of more definite questions such as price of food and agricultural production, that doesn't really have anything to do with GDP or virtual reality economy at all. Rather a large fraction of final food price goes to processing, logistics, finance, and other services, not the primary agriculture production. The fraction of price paid by food consumers going to agricultural producers is often less than 20%.

2ChristianKl6h

A lot of the reason why Second Life isn't a big part of the economy is that Second Life doesn't matter in general. It has few users and little social significance. In China you had dating apps where people could signal their wealth by buying the most expensive virtual good available. The number I found via google is USD 67.5 billion as the global virtual goods market in 2021. People pay a lot of money for luxury fashion items. Whether those have a physical representation isn't the main point.

2Richard_Kennaway2h

I took the word "Metaverse" to mean virtual worlds, but perhaps this is narrower than the OP intended. A dating app where the users are there to find people to physically meet is not what I would call a virtual world. Broaden it that far and you might as well call LessWrong part of "the Metaverse". But I am curious about these dating apps. What manner of virtual goods are these? Can you do anything with them other than showing that you bought them? That hasn't turned out too well for NFTs, "a complicated way of buying nothing" as Penny Arcade put it.

LLMs for Alignment Research: a safety priority?

136

abramdemski

Ω 5616d

A recent short story by Gabriel Mukobi illustrates a near-term scenario where things go bad because new developments in LLMs allow LLMs to accelerate capabilities research without a correspondingly large acceleration in safety research.

This scenario is disturbingly close to the situation we already find ourselves in. Asking the best LLMs for help with programming vs technical alignment research feels very different (at least to me). LLMs might generate junk code, but you can keep pointing out the problems with the code, and the code will eventually work. This can be faster than doing it myself, in cases where I don't know a language or library well; the LLMs are moderately familiar with everything.

When I try to talk to LLMs about technical AI safety work, however, I just...

(Continue Reading – 3058 more words)

plex1h20

DMed a link to an interface which lets you select system prompt and model (including Claude). This is open to researchers to test, but not positing fully publicly as it is not very resistant to people who want to burn credits right now.

Other researchers feel free to DM me if you'd like access.

To get the best posts emailed to you, create an account! (2-3 posts per week, selected by the LessWrong moderation team.)

What is the best way to talk about probabilities you expect to change with evidence/experiments?

Will_Pearson

18h

I was thinking about my p(doom) in the next 10 years and came up with something around 6%^[1]. However that involves lots of current unknowns to me, like the nature of current human knowledge production (and the bottle necks involved) which impact my P(doom) to be either 3% or 15% depending upon what type of bottle necks are found or not found. Is there a technical way to describe this probability distribution contingent on evidence?

^{^}
I'm bearish on LLMs leading AI directly (10% chance) and roughly a 30% chance of LLMs based AI fooming quickly enough to kill us and to want to kill us within 10 years. There is a 3% chance that something will come out of left field and doing the same.

Answer by sloonzApr 20, 202410

I think you’re trying to point towards multimodal distributions ?

If you can decompose P(X) as P(X) = P(X|H1)P(H1) + ... + P(X|Hn)P(Hn), and the P(X|Hn) are nice unimodal distributions (like a normal distribution), you end up with a multimodal distribution.

2Zac Hatfield-Dodds4h

More precisely the expected value of upwards and downwards updates should be the same; it's nonetheless possible to be very confident that you'll update in a particular direction - offset by a much larger and proportionately less likely update in the other. For example, I have some chance of winning. lottery this year, not much lower than if I actually bought a ticket. I'm very confident that each day I'll give somewhat lower odds (as there's less time remaining), but being credibly informed that I've won would radically change the odds such that the expectation balances out.

2Thomas Kwa7h

Someone asked basically this question before, and someone gave basically the same answer. It's a good idea, but there are some problems with it: it depends on your and your counterparties' risk aversion, wealth, and information levels, which are often extraneous.

2Richard_Ngo11h

The thing that distinguishes the coin case from the wind case is how hard it is to gather additional information, not how much more information could be gathered in principle. In theory you could run all sorts of simulations that would give you informative data about an individual flip of the coin, it's just that it would be really hard to do so/very few people are able to do so. I don't think the entropy of the posterior captures this dynamic.

What's with all the bans recently?

61[anonymous]16d

Summary: the moderators appear to be soft banning users with 'rate-limits' without feedback. A careful review of each banned user reveals it's common to be banned despite earnestly attempting to contribute to the site. Some of the most intelligent banned users have mainstream instead of EA views on AI.

Note how the punishment lengths are all the same, I think it was a mass ban-wave of 3 week bans:

Gears to ascension was here but is no longer, guess she convinced them it was a mistake.

Have I made any like really dumb or bad comments recently:

https://www.greaterwrong.com/users/gerald-monroe?show=comments

Well I skimmed through it. I don't see anything. Got a healthy margin now on upvotes, thanks April 1.

Over a month ago, I did comment this stinker. Here is what seems to the...

(Continue Reading – 1120 more words)

Nora Belrose2h30

If moderators started rate-limiting Nora Belrose or someone else whose work I thought was particularly good

I actually did get rate-limited today, unfortunately.

2Jiro16h

Features to benefit people accused of X may benefit mostly people who have been unjustly accused. So looking at the value to the entire category "people accused of X" may be wrong. You should look at the value to the subset that it was meant to protect.

How I Think, Part Four: Money is Weird

Richard Henage

If you work for free, you're doing whoever you're working for a favor.

If you work for money but never spend it, you're doing the world a favor^[1].

Except...

When you buy someone's goods or services for their set price, you're doing them a favor.

When you work for someone at their set wage, you're doing them a favor.

So,

?

But the favors are of different proportions. Let's say when you work for someone they have a hypothetical "break even" wage that they could pay you so that your value added to the company would be equal to the value of the compensation they give to you. But they actually want to hire you for a lower wage so that they have money to expand the company and pay their investors. Let's say...

(Continue Reading – 1327 more words)

LESSWRONGDaniel Dennett has died, far too young (1942-2024)
LW

Recommendations

Latest Posts

Quick Takes

Popular Comments

Recent Discussion

Introduction

LessOnline

A Festival of Writers Who are Wrong on the Internet

May 31 - Jun 2, Berkeley, CA