LESSWRONG
LW

3539
Wikitags

AI Misuse

Edited by Raemon last updated 1st May 2023

AI misuse. Humans using AI in a way that harms humanity.

Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged AI Misuse
63Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt, Buck
2y
Ω
17
30Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
2y
Ω
18
17Distinguishing misuse is difficult and uncomfortable
lemonhope
2y
3
94Covert Malicious Finetuning
Ω
Tony Wang, dannyhalawi
1y
Ω
4
81Human study on AI spear phishing campaigns
Simon Lermen, Fred Heiding, Andrew Kao
10mo
8
38Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour, rusheb, Quentin FEUILLADE--MONTIXI, Arush, scasper
2y
Ω
2
23On excluding dangerous information from training
ShayBenMoshe
2y
5
22Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
2y
6
18Proposal: Align Systems Earlier In Training
OneManyNone
2y
0
7AI and Biological Risk: Forecasting Key Capability Thresholds
Ω
Alvin Ånestrand
17d
Ω
4
6How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
11mo
0
4Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Seon Gunness
5mo
0
3Misalignment or misuse? The AGI alignment tradeoff
Max_He-Ho
4mo
0
2Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
1y
0
Add Posts