Nisan — LessWrong

GPT = Generative Pre-Training?

Everyone thinks GPT stands for "Generative Pre-trained Transformer". (For example, Wikipedia.) Does it really? The earliest mention of "GPT" is in the GPT-2 paper, which refers to "the OpenAI GPT model" and cites the GPT-1 paper. That paper does not contain the phrase "generative pre-trained transformer". But it does contain the phrase "generative pre-training", in the title and in the body, italicized.

A high integrity/epistemics political coalition?

Nisan4d182

Good post, but do you really want to use the specific term political machine for this? "Bloc" or "institution" seem more like what you're talking about here, unless I'm misunderstanding.

Little Echo

Nisan10d20

So you form the band and try to figure out how to keep everyone working together so that no one's confidence drops below 50%. If you're not sure you can do that, consider the value of trying anyway and seeing if you can do it. If the expected values still don't work out, don't start the band.

The Missing Genre: Heroic Parenthood - You can have kids and still punch the sun

Nisan20d80

The Rothschilds musical is about the ambitious Mayer Rotschild who raises his children to be his business partners. I recommend the 1970 recording.

Part of Inventing the Renaissance by Ada Palmer tells the story of Cosimo de Medici securing lasting power for his family. It's a fun read.

Of course Barrayar is about an adventuring expectant mother. Cordelia Naismith shows up later in the Vorkosigan series, but in Mirror Dance she seemed larger than life, no longer the adventuring type.

peterbarnett's Shortform

Nisan1mo207

Another reason labs don't provide CoT is that if users see them, the labs will be incentivized to optimize for them, and this will decrease their informativeness. A flag like you propose would have a similar effect.

Dalcy's Shortform

Nisan1mo40

In 3, there's also the Brown representability theorem: Cohomology groups are just homotopy groups, with the sphere spectrum replaced with the Eilenberg-MacLane spectrum.

GradientDissenter's Shortform

Nisan1mo20

"the geeks are generally worse, unless they make it an explicit optimization target, but there are a bunch of very competent sociopaths around, in the Venkatesh Rao sense of the word, which seem a lot more competent and empowered than even the sociopaths in other communities"

Are you combining Venkatesh Rao's loser/clueless/sociopath taxonomy with David Chapman's geek/mop/sociopath?

(ETA: I know this is not relevant to the discussion, but I confuse these sometimes.)

Mo Putera's Shortform

Nisan1mo*50

All the mathematicians quoted above can successfully write proofs that convince experts that something is true and why something is true; the quotes are about the difficulty of conveying the way the mathematician found that truth. All those mathematicians can convey the that and and the why — except for Mochizuki and his circle.

The matter of Mochizuki's work on the abc conjecture is intriguing because the broader research community has neither accepted his proof nor refuted it. The way to bet now is that his proof is wrong:

Professional mathematicians have not and will not publicly declare that "Mochizuki's proof is X% likely to be correct". Why? I'd guess one reason is that it's their job to provide a definitive verdict that serves as the source of truth for probabilistic forecasts. If the experts gave subjective probabilities, it would confuse judgments of different kinds.

Nisan's Shortform

Nisan1mo20

There's now this post by GradientDissenter and this post by me.

Drake Thomas's Shortform

Nisan1mo20

Oh sorry, somehow I forgot what you wrote about Reginald Johnston before writing my comment! I haven't read anything else about Puyi, so my suspicion is just a hunch.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

Posts

Wikitag Contributions

Comments

Posts

Wikitag Contributions

Comments