This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Wikitags
AI Misuse
Edited by
Raemon
last updated
1st May 2023
AI misuse.
Humans using AI in a way that harms humanity.
Subscribe
1
Subscribe
1
Discussion
0
Discussion
0
Posts tagged
AI Misuse
Most Relevant
63
Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt
,
Buck
2y
Ω
17
30
Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
2y
Ω
18
17
Distinguishing misuse is difficult and uncomfortable
lemonhope
2y
3
94
Covert Malicious Finetuning
Ω
Tony Wang
,
dannyhalawi
1y
Ω
4
81
Human study on AI spear phishing campaigns
Simon Lermen
,
Fred Heiding
,
Andrew Kao
8mo
8
38
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour
,
rusheb
,
Quentin FEUILLADE--MONTIXI
,
Arush
,
scasper
2y
Ω
2
23
On excluding dangerous information from training
ShayBenMoshe
2y
5
22
Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
2y
6
18
Proposal: Align Systems Earlier In Training
OneManyNone
2y
0
6
How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
9mo
0
4
Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Seon Gunness
3mo
0
3
Misalignment or misuse? The AGI alignment tradeoff
Max_He-Ho
2mo
0
2
Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
10mo
0