mattmacdermott — LessWrong

Yeah, I think “training for transparency” is fine if we can figure out good ways to do it. The problem is more training for other stuff (e.g. lack of certain types of thoughts) pushes against transparency.

abramdemski's Shortform

mattmacdermott9d*5536

I often complain about this type of reasoning too, but perhaps there is a steelman version of it.

For example, suppose the lock on my front door is broken, and I hear a rumour that a neighbour has been sneaking into my house at night. It turns out the rumour is false, but I might reasonably think, "The fact that this is so plausible is a wake-up call. I really need to change that lock!"

Generalising this: a plausible-but-false rumour can fail to provide empirical evidence for something, but still provide 'logical evidence' by alerting you to something that is already plausible in your model but that you hadn't specifically thought about. Ideal Bayesian reasoners don't need to be alerted to what they already find plausible, but humans sometimes do.

The quotation mark

mattmacdermott11d70

But then we have to ask — why two ‘ marks, to make the quotation mark? A quotidian reason: when you only use one, it’s an apostrophe. We already had the mark that goes in “don’t”, in “I’m”, in “Maxwell’s”; so two ‘ were used to distinguish the quote mark from the existing apostrophe.

Incidentally I think in British English people normally do just use single quotes. I checked the first book I could find that was printed in the UK and that’s what it uses:

Markets in Democracy: What happens when you can sell your vote?

mattmacdermott12d*77

He'd be a fool to part with his vote for less than the amount of the benefits he gets.

Doesn't seem right. Even assuming the person buying his vote wants to use it to remove his benefits, that one vote is unlikely to be the difference between the vote-buyer's candidate winning and losing. The expected effect of the vote on the benefits is going to be much less than the size of the benefits.

Checking in on AI-2027

mattmacdermott13d30

An intuition you might be able to invoke is that the procedure they describe is like greedy sampling from an LLM, which doesn’t get you the most probable completion.

CFAR update, and New CFAR workshops

mattmacdermott19d1612

“A Center for Applied Rationality” works as a tagline but not as a name

Notes on fatalities from AI takeover

mattmacdermott23d72

We have a ~25% chance of extinction

Maybe add the implied 'conditional on AI takeover' to the conclusion so people skimming don't come away with the wrong bottom line? I had to go back through the post to check whether this was conditional or not.

leogao's Shortform

mattmacdermott1mo20

Fair enough yeah. But at least (1)-style effects weren’t strong enough to prevent any significant legislation in the near future.

leogao's Shortform

mattmacdermott1mo2-2

Some evidence for (2) is that before the 1957 act no civil rights legislation had been passed for 82 years^[1], and after it three more civil rights acts were passed in the next 11 years, including the Civil Rights Act of 1964, which in my understanding is considered very significant.

Going off what's listed in the wikipedia article on civil rights acts in the United States. ↩︎

Nathan Young's Shortform

mattmacdermott1mo20

I thought the post was fine and was surprised it was so downvoted. Even if people don’t agree with the considerations, or think all the most important considerations are missing, why should a post saying, “Here’s what I think and why I think it, feel free to push back in the comments,” be so poorly received? Commenters can just say what they think is missing.

Seems likely that it wouldn’t have been so downvoted if its bottom line was that AI risk is very high. Increases my P(LW groupthink is a problem) a bit.

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments