LESSWRONG
LW

Wikitags

AI Misuse

Edited by Raemon last updated 1st May 2023

AI misuse. Humans using AI in a way that harms humanity.

Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged AI Misuse
63Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt, Buck
2y
Ω
17
30Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
2y
Ω
18
17Distinguishing misuse is difficult and uncomfortable
lemonhope
2y
3
94Covert Malicious Finetuning
Ω
Tony Wang, dannyhalawi
1y
Ω
4
81Human study on AI spear phishing campaigns
Simon Lermen, Fred Heiding, Andrew Kao
8mo
8
38Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour, rusheb, Quentin FEUILLADE--MONTIXI, Arush, scasper
2y
Ω
2
23On excluding dangerous information from training
ShayBenMoshe
2y
5
22Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
2y
6
18Proposal: Align Systems Earlier In Training
OneManyNone
2y
0
6How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
9mo
0
4Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Seon Gunness
3mo
0
3Misalignment or misuse? The AGI alignment tradeoff
Max_He-Ho
2mo
0
2Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
10mo
0
Add Posts