x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
AI Misuse — LessWrong
AI Misuse
Edited by
Raemon
last updated
1st May 2023
AI misuse.
Humans using AI in a way that harms humanity.
Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged
AI Misuse
Most Relevant
2
63
Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt
,
Buck
2y
Ω
17
2
30
Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
2y
Ω
18
2
17
Jailbreaking AI models to Phish Elderly Victims
Simon Lermen
,
Fred Heiding
16d
0
2
17
Distinguishing misuse is difficult and uncomfortable
lemonhope
3y
3
1
94
Covert Malicious Finetuning
Ω
Tony Wang
,
dannyhalawi
1y
Ω
4
1
81
Human study on AI spear phishing campaigns
Simon Lermen
,
Fred Heiding
,
Andrew Kao
11mo
8
1
38
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour
,
rusheb
,
Quentin FEUILLADE--MONTIXI
,
Arush
,
scasper
2y
Ω
2
1
23
On excluding dangerous information from training
ShayBenMoshe
2y
5
1
22
Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
3y
6
1
18
Proposal: Align Systems Earlier In Training
OneManyNone
3y
0
1
7
AI and Biological Risk: Forecasting Key Capability Thresholds
Ω
Alvin Ånestrand
2mo
Ω
4
1
6
How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
1y
0
1
4
Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Seon Gunness
6mo
0
1
3
Misalignment or misuse? The AGI alignment tradeoff
Max_He-Ho
5mo
0
1
2
Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
1y
0