LESSWRONG
LW

smallsilo
145Ω6720
Message
Dialogue
Subscribe

AI safety communications at FAR.AI

Previously at AISafety.info

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Singapore - Small casual dinner meetup
smallsilo3y10

Hi! I'd be interested if there are others to form another group, or in any meetups in the future. There was also, an SSC/ACX meetup in Singapore a short while back, if anyone is interested in being added to that group

Reply
13Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations
Ω
9d
Ω
1
17Avoiding AI Deception: Lie Detectors can either Induce Honesty or Evasion
1mo
2
37Illusory Safety: Redteaming DeepSeek R1 and the Strongest Fine-Tunable Models of OpenAI, Anthropic, and Google
Ω
5mo
Ω
0
21Join AISafety.info's Distillation Hackathon (Oct 6-9th)
2y
0
18GPT-powered EA/LW weekly summary
2y
1
19Join AISafety.info's Writing & Editing Hackathon (Aug 25-28) (Prizes to be won!)
2y
3
38All AGI Safety questions welcome (especially basic ones) [July 2023]
2y
41