This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
3539
Wikitags
AI Misuse
Edited by
Raemon
last updated
1st May 2023
AI misuse.
Humans using AI in a way that harms humanity.
Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged
AI Misuse
Most Relevant
63
Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt
,
Buck
2y
Ω
17
30
Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
2y
Ω
18
17
Distinguishing misuse is difficult and uncomfortable
lemonhope
2y
3
94
Covert Malicious Finetuning
Ω
Tony Wang
,
dannyhalawi
1y
Ω
4
81
Human study on AI spear phishing campaigns
Simon Lermen
,
Fred Heiding
,
Andrew Kao
10mo
8
38
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour
,
rusheb
,
Quentin FEUILLADE--MONTIXI
,
Arush
,
scasper
2y
Ω
2
23
On excluding dangerous information from training
ShayBenMoshe
2y
5
22
Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
2y
6
18
Proposal: Align Systems Earlier In Training
OneManyNone
2y
0
7
AI and Biological Risk: Forecasting Key Capability Thresholds
Ω
Alvin Ånestrand
17d
Ω
4
6
How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
11mo
0
4
Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Seon Gunness
5mo
0
3
Misalignment or misuse? The AGI alignment tradeoff
Max_He-Ho
4mo
0
2
Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
1y
0