This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Wikitags
LW
Login
Subscribe
Discussion
0
1
AI Misuse
Subscribe
Discussion
0
1
Written by
Raemon
last updated
1st May 2023
AI misuse.
Humans using AI in a way that harms humanity.
Posts tagged
AI Misuse
Most Relevant
63
Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt
,
Buck
1y
Ω
17
30
Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
2y
Ω
18
17
Distinguishing misuse is difficult and uncomfortable
lemonhope
2y
3
89
Covert Malicious Finetuning
Ω
Tony Wang
,
dannyhalawi
1y
Ω
4
79
Human study on AI spear phishing campaigns
Simon Lermen
,
Fred Heiding
,
Andrew Kao
5mo
8
38
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour
,
rusheb
,
Quentin FEUILLADE--MONTIXI
,
Arush
,
scasper
2y
Ω
2
23
On excluding dangerous information from training
ShayBenMoshe
2y
5
22
Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
2y
6
18
Proposal: Align Systems Earlier In Training
OneManyNone
2y
0
6
How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
7mo
0
2
Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Seon Gunness
13d
0
2
Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
8mo
0