LESSWRONG
Community
LW

1337
AI
Event

1

AI Safety Thursday: Attempts and Successes of LLMs Persuading on Harmful Topics

by georgia_berg
1 min read
0

1

Thursday 2nd October at 10:00 pm to Friday 3rd October at 1:00 am GMT
30 Adelaide Street East, Toronto, ON, Canada
Register

Posted on: 12th Sep 2025

Subscribe to group

1

New Comment
Moderation Log
More from georgia_berg
View more
Curated and popular this week
0Comments
Trajectory Labs (Toronto AI Safety)

Description

​Large Language Models can persuade people at unprecedented scale—but how effectively, and are they willing to try persuading us toward harmful ideas?

​In this talk, Matthew Kowal and Jasper Timm will present findings showing that LLMs can shift beliefs toward conspiracy theories as effectively as they debunk them, and that many models are willing to attempt harmful persuasion on dangerous topics.

​Event Schedule

​6:00 to 6:30 - Food & Networking
6:30 to 7:30 - Main Presentation & Questions
7:30 to 9:00 - Breakout Discussions

​​​​If you can't make it in person, feel free to join the live stream starting at 6:30 pm, via this link.