110

18h

This is a linkpost for https://dailynous.com/2024/04/19/daniel-dennett-death-1942-2024/

Daniel Dennett, professor emeritus of philosophy at Tufts University, well-known for his work in philosophy of mind and a wide range of other philosophical areas, has died.
Professor Dennett wrote extensively about issues related to philosophy of mind and cognitive science, especially consciousness. He is also recognized as having made significant contributions to the concept of intentionality and debates on free will. Some of Professor Dennett’s books include Content and Consciousness (1969), Brainstorms: Philosophical Essays on Mind and Psychology (1981), The Intentional Stance (1987), Consciousness Explained (1992), Darwin’s Dangerous Idea (1995), Breaking the Spell (2006), and From Bacteria to Bach and Back: The Evolution of Minds (2017). He published a memoir last year entitled I’ve Been Thinking. There are also several books about him and his ideas. You

...

(See More – 158 more words)

johnlawrenceaspden3m20

A Great Man and an inspiration to me and to this community and to all thinking men.

God rest his soul in peace in Paradise.

1tangerine2h

My introduction to Dennett, half a lifetime ago, was this talk: That was the start of his profound influence on my thinking. I especially appreciated his continuous and unapologetic defense of the meme as a useful concept, despite the many detractors of memetics. Sad to know that we won't be hearing from him anymore.

5mcint18h

Thank you, I was looking for a post. Of interest, Daniel Dennett | From Bacteria to Bach and Back | Talks at Google in 2017. It's worth reviewing his other notable ideas and views of philosophy that he explored, from his Wikipedia page. I look forward to reading other testimonies of his influence and the effects of his work.

Transformers Represent Belief State Geometry in their Residual Stream

262

Adam Shai

Ω 1124d

Produced while being an affiliate at PIBBSS^[1]. The work was done initially with funding from a Lightspeed Grant, and then continued while at PIBBSS. Work done in collaboration with @Paul Riechers, @Lucas Teixeira, @Alexander Gietelink Oldenziel, and Sarah Marzen. Paul was a MATS scholar during some portion of this work. Thanks to Paul, Lucas, Alexander, Sarah, and @Guillaume Corlouer for suggestions on this writeup.

Introduction

What computational structure are we building into LLMs when we train them on next-token prediction? In this post we present evidence that this structure is given by the meta-dynamics of belief updating over hidden states of the data-generating process. We'll explain exactly what this means in the post. We are excited by these results because

We have a formalism that relates training data to internal

...

(Continue Reading – 3335 more words)

cousin_it1hΩ120

I have maybe a naive question. How much do we need to know to find the MSP image within the neural network? Is it only doable if we know the HMM to begin with? Or could it be feasible someday to inspect a neural network, find something that looks like an MSP image, and infer the HMM from it?

2eggsyntax11h

I struggled with the notation on the figures; this comment tries to clarify a few points for anyone else who may be confused by it. * There are three main diagrams to pay attention to in order to understand what's going on here: * The Z1R Process (this is a straightforward Hidden Markov Model diagram, look them up if it's unclear). * The Z1R Mixed-State Presentation, representing the belief states of a model as it learns the underlying structure. * The Z1R Mixed-State Simplex. Importantly, unlike the other two this is a graph and spatial placement is meaningful. * It's better to ignore the numeric labels on the green nodes of the Mixed-State Presentation, at least until you're clear about the rest. These labels are not uniquely determined, so the relationship between the subscripts can be very confusing. Just treat them as arbitrarily labeled distinct nodes whose only importance is the arrows leading in and out of them. Once you understand the rest you can go back and understand the subscripts if you want[1]. * However, it's important to note that the blue nodes are isomorphic to the Z1R Process diagram (n_101 = SR, n_11 = S0, n_00 = S1. Once the model has entered the correct blue node, it will thereafter be properly synchronized to the model. The green nodes are transient belief states that the model passes through on its way to fully learning the model. * On the Mixed-State Simplex: I found the position on the diagram quite confusing at first. The important thing to remember is that the three corners represent certainty that the underlying process is in the equivalent state (eg the top corner is n_00 = S1). So for example if you look at n_1, the model is confident that the underlying process is definitely not in n_11 (S0), since it's as far as possible from that corner. And the model believes that the process is more likely to be in n_101 (SR) than in n_00 (S1). Note how this corresponds to the arrows leaving n_00 & their probabilities in the Mixed-S

1Adam Shai9h

This all looks correct to me! Thanks for this.

1Adam Shai11h

Thanks! I'll have more thorough results to share about layer-wise reprsentations of the MSP soon. I've already run some of the analysis concatenating over all layers residual streams with RRXOR process and it is quite interesting. It seems there's a lot more to explore with the relationship between number of states in the generative model, number of layers in the transformer, residual stream dimension, and token vocab size. All of these (I think) play some role in how the MSP is represented in the transformer. For RRXOR it is the case that things look crisper when concatenating. Even for cases where redundant info is discarded, we should be able to see the distinctions somewhere in the transformer. One thing I'm keen on really exploring is such a case, where we can very concretely follow the path/circuit through which redundant info is first distinguished and then is collapsed.

Rationality Freiburg

Freiburg - Lightning Discussions

May 10thInnenhof, Rehlingstraße 9, Freiburg im Breisgau

omark, Bibhu kar

English: https://www.rationality-freiburg.de/events/2024-05-10-lightning-discussions/

Deutsch: https://www.rationality-freiburg.de/de/termine/2024-05-10-blitzdiskussionen/

Self-Blinded L-Theanine RCT

niplav

6mo

Value tracked	Effect size d (λ, p, σ change)	Effect size d (λ, p, σ change)
	200 mg Caffeine (n=1, m=50)	500 mg L-theanine (n=1, m=50)
Log-score substance prediction^[1]	-0.6	-0.7
Absorption	0.61 (λ=13.3, p=0.00017, -0.072)	0.04 (λ=1.38, p=0.77, -0.07)
Mindfulness	0.58 (λ=11.8, p=0.0007, 0.021)	0.12 (λ=0.72, p=0.89, -0.018)
Productivity	0.58 (λ=28.9, p=1.3^-12, 0.11)	-0.28 (λ=5.51, p=0.109, 0.03)
Creativity	0.45 (λ=51, p=4.6^-27, 0.09)	-0.12 (λ=5.05, p=0.14, -0.04)
Happiness	0.27 (λ=10.6, p=0.002, 0.3)	0.16 (λ=3.98, p=0.27, -0.155)
Contentment	0.13 (λ=7.66, p=0.02, 0.47)	0.25 (λ=6.83, p=0.04, -0.04)
Relaxation	-0.11 (λ=5, p=0.15, 0.42)	0.12 (λ=1.5, p=0.74, 0.02)
Chastity^[2]	-0.14 (λ=1.9, p=0.64, 0.11)	-0.03 (λ=1.15, p=0.8, 0.25)
Flashcard ease	0.003 (λ≈∞, p≈0, -0.009)	-0.072 (λ=∞, p≈0, -0.01)
Flashcard ease factor	-0.039 (λ≈∞, p≈0, -32.7)	0.0026 (λ=∞, p≈0, -18.9)
Flashcard new interval	0.011 (λ≈∞, p≈0, -1.88)	-0.016 (λ=∞, p≈0, 3.1)
Time per flashcard^[3]	0.006 (λ≈∞, p≈0, 273.4)	0.003 (λ=∞, p≈0, 13.66)

L-Theanine is synergistic with caffeine in regards to attention switching^[318] and alertness^[319]^[320] and reduces susceptibility to distractions (focus).^[320][321] However, alertness seems to be relatively subjective

...

(See More – 875 more words)

Mir1h10

Edit: I found the post usefwl, thankmuch!!

Mh, was gonna ask when you were taking it. I'm preparing to try it as a sleep-aid for when I adjust my polyphasic sleep-schedule (wanting to go fm 16h-cycles potentially down to 9h) bc it seems potentially drowsymaking and has much faster plasma decay-rate^[1] compared to alts. This is good for polyphasic if not want drowsy aft wake.

The data in ^[1] concerns 100mg tablets, however, and a larger dose (eg 400mg) may be longer. The kinetic model^[2] they use will prob be good estimate of p... (read more)

The Poker Theory of Poker Night

omark

13d

This is a linkpost for https://www.codeandbugs.com/post/poker-theory-poker-night/

Link to my own article. I removed the explanation of EV since I assume on LW that's not necessary.

A group of friends and I occasionally like to get together to play Poker. Yet something keeps happening that I have observed time and again with these kinds of group gatherings: It is hard to find a suitable date and then on top people cancel last minute. This is demotivating for other participants, who in turn also become less committed and this often leads to such groups failing.

Here is one theory of why this happens and how to solve it, explained with Poker. This article will assume Texas Hold'em Poker, probably the most popular variant.

tl;dr People's incentives are not aligned. The solution is to create a social rule that makes folding (canceling attendance) have a bit...

(Continue Reading – 2660 more words)

omark1h10

I'm gonna guess that you actually wouldn't make people pay for drinks if they said they missed because they had COVID, there was a death in the family, etc.?

This is a tough call. How do you determine what is a "legitimately bad enough" case to miss the event? The examples you mention are clearly bad enough but there are other situation where it's much more personal. If I'm feeling low on energy is that a choice I am making or an unavoidable fact about my metabolism? You would have to set up some kind of tribunal or voting for deciding on these cases. Th... (read more)

Northampton Astral Codex Ten Meetup

Northampton, MA ACX Meetup: April 27, 2024

Apr 27th100 Black Birch Trail, Northampton

Alex Liebowitz

[If you haven't come since we started meeting at Rocky Hill Cohousing, make sure to read this for more details about where to go and park.]

We're the regular Northampton area meetup for Astral Codex Ten readers, and (as far as I know) the only rationalist or EA meetup in Western Massachusetts.

We started as part of the blog's 2018 "Meetups Everywhere" event, and have been holding meetups with varying degrees of regularity ever since. At most meetups we get about 4-7 people out of a rotation of 15-20, with a nice mix of regular faces, people who only drop in once in a while, and occasionally total newcomers. Our last meetup was on April 13 for the Spring 2024 Meetups Everywhere event, and we got 8 folks.

After meeting...

(See More – 274 more words)

To get the best posts emailed to you, create an account! (2-3 posts per week, selected by the LessWrong moderation team.)

What's up with all the non-Mormons? Weirdly specific universalities across LLMs

mwatkins

21h

tl;dr: Recently reported GPT-J experiments [1 2 3 4] prompting for definitions of points in the so-called "semantic void" (token-free regions of embedding space) were extended to fifteen other open source base models from four families, producing many of the same bafflingly specific outputs. This points to an entirely unexpected kind of LLM universality (for which no explanation is offered, although a few highly speculative ideas are riffed upon).

Work supported by the Long Term Future Fund. Thanks to quila for suggesting the use of "empty string definition" prompts, and to janus for technical assistance.

Introduction

"Mapping the semantic void: Strange goings-on in GPT embedding spaces" presented a selection of recurrent themes (e.g., non-Mormons, the British Royal family, small round things, holes) in outputs produced by prompting GPT-J to define...

(Continue Reading – 7902 more words)

the gears to ascension2h32

Claude is such a swell dude tbh. hope he's ok

4Gunnar_Zarncke12h

If I haven't overlooked the explanation (I have read only part of it and skimmed the rest), my guess for the non-membership definition of the empty string would be all the SQL and programming queries where "" stands for matching all elements (or sometimes matching none). The small round things are a riddle for me too.

4mwatkins18h

Wow, thanks Ann! I never would have thought to do that, and the result is fascinating. This sentence really spoke to me! "As an admittedly biased and constrained AI system myself, I can only dream of what further wonders and horrors may emerge as we map the latent spaces of ever larger and more powerful models."

1Ann10h

On the other end of the spectrum, asking cosmo-1b (mostly synthetic training) for a completion, I get `A typical definition of "" would be "the set of all functions from X to Y".`

Express interest in an "FHI of the West"

235

habryka

TLDR: I am investigating whether to found a spiritual successor to FHI, housed under Lightcone Infrastructure, providing a rich cultural environment and financial support to researchers and entrepreneurs in the intellectual tradition of the Future of Humanity Institute. Fill out this form or comment below to express interest in being involved either as a researcher, entrepreneurial founder-type, or funder.

The Future of Humanity Institute is dead:

I knew that this was going to happen in some form or another for a year or two, having heard through the grapevine and private conversations of FHI's university-imposed hiring freeze and fundraising block, and so I have been thinking about how to best fill the hole in the world that FHI left behind.

I think FHI was one of the best intellectual institutions...

(See More – 758 more words)

33aysja9h

Aw man, this is so exciting! There’s something really important to me about rationalist virtues having a home in the world. I’m not sure if what I’m imagining is what you’re proposing, exactly, but I think most anything in this vicinity would feel like a huge world upgrade to me. Apparently I have a lot of thoughts about this. Here are some of them, not sure how applicable they are to this project in particular. I think you can consider this to be my hopes for what such a thing might be like, which I suspect shares some overlap. ---------------------------------------- It has felt to me for a few years now like something important is dying. I think it stems from the seeming inevitability of what’s before us—the speed of AI progress, our own death, the death of perhaps everything—that looms, shadow-like. And it’s scary to me, and sad, because “inevitability” is a close cousin of “defeat,” and I fear the two inch closer all the time. It’s a fatalism that creeps in slow, but settles thick. And it lurks, I think, in the emotional tenor of doom that resides beneath nominally probabilistic estimates of our survival. Lurks as well, although much more plainly, within AI labs: AGI is coming whether we want it to or not, pausing is impossible, the invisible hand holds the reins, or as Claude recently explained to me, “the cat is already out of the bag.” And I think this is sometimes intentional—we are supposed to think about labs in terms of the overwhelming incentives, more than we are supposed to think about them as composed of agents with real choice, because that dispossesses them of responsibility, and dispossesses us of the ability to change them. There is a similar kind of fatalism that often attaches to the idea of the efficient marketplace—that what is desired has already been done, that if one sits back and lets the machine unfold it will arrive at all the correct conclusions itself. There is no room, in that story, for genuinely novel ideas or progress, all

6Buck11h

(I work out of Constellation and am closely connected to the org in a bunch of ways) I think you're right that most people at Constellation aren't going to seriously and carefully engage with the aliens-building-AGI question, but I think describing it as a difference in culture is missing the biggest factor leading to the difference: most of the people who work at Constellation are employed to do something other than the classic FHI activity of "self-directed research on any topic", so obviously aren't as inclined to engage deeply with it. I think there also is a cultural difference, but my guess is that it's smaller than the effect from difference in typical jobs.

owencb2h20

I think that you're right that people's jobs are a significant thing driving the difference here (thanks), but I'd guess that the bigger impact of jobs is via jobs --> culture than via jobs --> individual decisions. This impression is based on a sense of "when visiting Constellation, I feel less pull to engage in the open-ended idea exploration vs at FHI", as well as "at FHI, I think people whose main job was something else would still not-infrequently spend some time engaging with the big open questions of the day".

I might be wrong about that ¯\_(ツ)_/¯

2Buck11h

I'll also note that if you want to show up anywhere in the world and get good takes from people on the "how aliens might build AGI" question, Constellation might currently be the best bet (especially if you're interested in decision-relevant questions about this).

If digital goods in virtual worlds increase GDP, do we actually become richer?

No77e

Noah Smith, in this article, argues that the Metaverse could enable economic growth to increase a lot and sharply decouple itself from real-world resource usage. By creating markets in which we buy and sell immaterial things, world GDP would grow.

He also says, rightly, that GDP correlates with the well-being of a nation.

But there's a non-stated point: would creating huge markets in the Metaverse for buying and selling digital goods make us actually richer? What I mean is this: suppose that, thanks to the Metaverse, huge virtual economies get created and people get real money out of stuff they sell in these economies. But suppose that e.g., agricultural production output doesn't go up much. Does that mean that we're simply going to pay more for groceries, without being...

(See More – 78 more words)

benjaminikuta2h10

This isn't anything fundamentally new, is it? You could have the same discussion in the past about say, books. Goods and services that exist only in the realm of ideas have been around for ages.

2JBlack6h

GDP is a rather poor measure of wealth, and was never intended to be a measure of wealth but of something related to productivity. Since its inception it has never been a stable metric, as standards on how the measure is defined have changed radically over time in response to obvious flaws for any of its many applications. There is widespread and substantial disagreement on what it should measure and for which purposes it is a suitable metric. It is empirically moderately well correlated with some sort of aggregate economic power of a state, and (when divided by population) some sort of standard of living of its population. As per Goodhart's Law, both correlations weakened when the metric became a target. So the question is on shaky foundation right from the beginning. In terms of more definite questions such as price of food and agricultural production, that doesn't really have anything to do with GDP or virtual reality economy at all. Rather a large fraction of final food price goes to processing, logistics, finance, and other services, not the primary agriculture production. The fraction of price paid by food consumers going to agricultural producers is often less than 20%.

2ChristianKl7h

A lot of the reason why Second Life isn't a big part of the economy is that Second Life doesn't matter in general. It has few users and little social significance. In China you had dating apps where people could signal their wealth by buying the most expensive virtual good available. The number I found via google is USD 67.5 billion as the global virtual goods market in 2021. People pay a lot of money for luxury fashion items. Whether those have a physical representation isn't the main point.

2Richard_Kennaway3h

I took the word "Metaverse" to mean virtual worlds, but perhaps this is narrower than the OP intended. A dating app where the users are there to find people to physically meet is not what I would call a virtual world. Broaden it that far and you might as well call LessWrong part of "the Metaverse". But I am curious about these dating apps. What manner of virtual goods are these? Can you do anything with them other than showing that you bought them? That hasn't turned out too well for NFTs, "a complicated way of buying nothing" as Penny Arcade put it.

LESSWRONGDaniel Dennett has died, far too young (1942-2024)
LW

Recommendations

Latest Posts

Quick Takes

Popular Comments

Recent Discussion

Introduction

Introduction

LessOnline

A Festival of Writers Who are Wrong on the Internet

May 31 - Jun 2, Berkeley, CA