x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
AI Misuse — LessWrong
You are viewing version 1.0.0 of this page. Click here to view the latest version.
AI Misuse
Edited by
Raemon
last updated
1st May 2023
You are viewing revision 1.0.0, last edited by
Raemon
AI misuse.
Humans using AI in a way that harms humanity.
Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged
AI Misuse
Most Relevant
2
63
Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt
,
Buck
2y
Ω
17
2
30
Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
2y
Ω
18
2
17
Jailbreaking AI models to Phish Elderly Victims
Simon Lermen
,
Fred Heiding
3mo
0
2
17
Distinguishing misuse is difficult and uncomfortable
lemonhope
3y
3
1
98
Covert Malicious Finetuning
Ω
Tony Wang
,
dannyhalawi
2y
Ω
4
1
81
Human study on AI spear phishing campaigns
Simon Lermen
,
Fred Heiding
,
Andrew Kao
1y
8
1
38
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour
,
rusheb
,
Quentin FEUILLADE--MONTIXI
,
Arush
,
scasper
2y
Ω
2
1
23
On excluding dangerous information from training
ShayBenMoshe
2y
5
1
22
Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
3y
6
1
18
Proposal: Align Systems Earlier In Training
Onid
3y
0
1
7
AI and Biological Risk: Forecasting Key Capability Thresholds
Ω
Alvin Ånestrand
5mo
Ω
4
1
6
How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
1y
0
1
4
Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Seon Gunness
9mo
0
1
3
Misalignment or misuse? The AGI alignment tradeoff
Max_He-Ho
8mo
0
1
2
Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
1y
0