BionicD0LPH1N

Email at acxmontreal@gmail.com for ACX Montreal-related issues/comments/feedback.

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing self-deception.

Readings:

The hostile telepaths problem, by Valentine.
Doublethink (Choosing to be Biased), by Eliezer Yudkowsky
[Optional] False Faces, by.. Ziz
[Optional] LLMs one-box when in a "hostile telepath" version of Newcomb's Paradox, except for the one that beat the predictor, by Kaj_Sotala

Please suggest topics or readings for future meetups on this form. Thanks to Nik for this week's suggestion.

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, February 14th, 2026, 2PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing The Dilbert Afterlife, by Scott Alexander.

Optional: the follow-up Highlights From The Comments On Scott Adams.

Please suggest topics or readings for future meetups on this form.

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, January 31st, 2026, 1PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing To be legible, evidence of misalignment probably has to be behavioral, by Ryan Greenblatt. He argues that for evidence of misalignment to convince people to take strong action, it probably needs to be behavioral, and that internals-based methods (interpretability, ELK) alone are unlikely to be sufficiently persuasive.

Optional readings:
- Safety isn't safety without a social model, by Andrew Critch
- The inaugural Redwood Research podcast, by Ryan Greenblatt and Buck Shlegeris

The podcast is great.

Please suggest topics or readings for future meetups on this form.

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday,... (read more)

Happy new year!

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing Links For December 2025 and our forecasts for 2026.

Please suggest topics or readings for future meetups on this form.

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, January 3rd, 2025, 2PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

Note the change in start time.

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing Sara Constantin's post on Ra.

Optional readings:

Materialism, by Meaningness
Sabbath hard and go home, by Benjamin Hoffman

Please suggest topics or readings for future meetups on this form. Thanks to Jodie for this week's suggestion!

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, December 20th, 2025, 2PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing the post Are you a jerk, or a liar? by Cate Hall. The post sketches "truth-seeking" vs "community-seeking" as modes of communication and argues that a lot of “low-integrity vs low-empathy” conflict is just those modes interacting.

Please suggest topics or readings for future meetups on this form. Thanks to Jodie for this week's suggestion!

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, December 6th, 2025, 1PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

IMPORTANT: Take a look at the Montreal Solstice Announcement.

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing the post Making Your Pain Worse can Get You What You Want, by Logan Riggs.

Optional Reading:

Pain is not the unit of Effort, by alkjash

Please suggest topics or readings for future meetups on this form.

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, November 22nd, 2025, 1PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

IMPORTANT: Take a look at the Montreal Solstice Announcement.

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing the post The Most Common Bad Argument In These Parts. A memorable post about why 'I can't think of any other way it could happen' is almost never a valid argument.

Please suggest topics or readings for future meetups on this form.

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, November 8th, 2025, 1PM.

RSVP by clicking "Going" at the top of this post.

I won't be able to make it to the meetup this week, but Nik has kindly agreed to be the meetup organizer. Thanks Nik!

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

NOTE: Switching to winter meetup location.

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing Emotions Make Sense. The post explores specific "bad" emotions like jealousy, boredom, and confusion to figure out what adaptive purpose they serve.

Please suggest topics or readings for future meetups on this form.

Venue: Ye Olde Orchard Pub & Grill, 20 Prince Arthur St W.
Date & Time: Saturday, October 25th, 2025, 1PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

Additionally, Jeremy is hosting a reading group for If Anyone Builds It, Everyone Dies. Register to the event if you want to attend!

Come on out to the next ACX (Astral Codex Ten) Montreal Meetup! This week, we're discussing Kat Woods' post, Why you should eat meat - even if you hate factory farming.

Optional readings:

Change my mind: Veganism entails trade-offs, and health is one of the axes, by Elizabeth

Please suggest topics or readings for future meetups on this form.

Venue: L'Esplanade Tranquille, 1442 Clark. Rough location here: https://plus.codes/87Q8GC5P+P2R. Note: It's likely the last meetup we'll be able to have at L'Esplanade Tranquille this year, but Saturday has pretty good weather. Just don't forget to bring a jacket.
Date & Time: Saturday, October 11th, 2025, 1PM.

RSVP by clicking "Going" at the top of this post.

Send a message on our Montreal Rationalists Discord on channel #meetup-general if you have trouble finding us or any other issues.

Please also join the mailing list and our Discord server if you haven't already. We host biweekly ACX Montreal meetups, so join us if you don't want to miss any of them!

Quick update! Due to the pub being too full/loud, we'll be moving the meetup to Ezra and Haydn's place in ~15(?) minutes. See specific address on the Montreal Rationalists Discord.

Thanks for noticing and pointing it out!

The API token was cancelled, sorry about that. The most recent version of the chatbot is now at https://chat.stampy.ai and https://chat.aisafety.info, and should not have the API token issue.

~~I'm glad to hear you're trying to catch up with the alignment ecosystem!~~

~~It is still supposed to be live and active, and it still works for me. Are you sure you have~~ ~~https://alignmentsearch.up.railway.app~~? If so, then I'm not sure what's going on, it worked for everyone who I know that tried. If you have a different link, maybe we've been linking to the website incorrectly somewhere so please share the link you do have.

Edit: just realized you weren't speaking of https://alignmentsearch.up.railway.app, I thought it was a standalone comment. I'm getting the same 404 error for the aisafety.world link.

Thanks!

Just did. :)
We've seen the channel, yes, though haven't messaged in it or anything.

Thanks for the comment!

At this point, we don't have a very clear plan, other than thinking of functionalities and adding them as fast as possible in an order that seems sensible. The functionalities we want to add include:

Automatic update of the dataset relatively often.
Stream completions.
Test embeddings using SentenceTransformers + Finetuning instead of OpenAI for cost and quality, and store them in Pinecone/Weaviate/Other (tbd); this will enable us to use the whole dataset for semantic search, and for the semantic similarity to have more 'knowledge' about technical terms used in the alignment space, which I expect to produce better results. We also want to test and add biases to favor 'good' sources to

... (read more)

LESSWRONG
LW

LESSWRONG
LW

BionicD0LPH1N

Optimally Combining Probe Monitors and Black Box Monitors

Untrusted AIs can exploit feedback in control protocols

Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent

BionicD0LPH1N

BionicD0LPH1N

Optimally Combining Probe Monitors and Black Box Monitors

Untrusted AIs can exploit feedback in control protocols

Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent