x

LESSWRONG
LW

AI Misuse — LessWrong

You are viewing version 1.0.0 of this page. Click here to view the latest version.

AI Misuse

Edited by Raemon last updated 1st May 2023

You are viewing revision 1.0.0, last edited by Raemon

AI misuse. Humans using AI in a way that harms humanity.

Add Posts

1

1

Posts tagged AI Misuse

2

63Managing catastrophic misuse without robust AIs

ryan_greenblatt, Buck

2y

17

2

30Adversarial Robustness Could Help Prevent Catastrophic Misuse

2y

18

2

17Jailbreaking AI models to Phish Elderly Victims

Simon Lermen, Fred Heiding

3mo

0

2

17Distinguishing misuse is difficult and uncomfortable

3y

3

1

98Covert Malicious Finetuning

Tony Wang, dannyhalawi

2y

4

1

81Human study on AI spear phishing campaigns

Simon Lermen, Fred Heiding, Andrew Kao

1y

8

1

38Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation

Soroush Pour, rusheb, Quentin FEUILLADE--MONTIXI, Arush, scasper

2y

2

1

23On excluding dangerous information from training

2y

5

1

22Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*

Christopher King

3y

6

1

18Proposal: Align Systems Earlier In Training

3y

0

1

7AI and Biological Risk: Forecasting Key Capability Thresholds

Alvin Ånestrand

5mo

4

1

6How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens

1y

0

1

4Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.

9mo

0

1

3Misalignment or misuse? The AGI alignment tradeoff

8mo

0

1

2Technical Risks of (Lethal) Autonomous Weapons Systems

1y

0

Load More (15/15)

Add Posts