x

LESSWRONG

LW

Noah Scales — LessWrong

Noah Scales

Noah Scales

Message

16

1

8

4y

Noah Scales

16

4y

How would you improve ChatGPT's filtering?

I am wondering how Less Wrong would improve ChatGPT's filtering? I'm reading through the comments on breaking OpenAI's filtering, and see plenty of analysis of the weaknesses of the safeguards. There's always the chance that some group could steal ChatGPT's source code and remove ad hoc additions to it, so...

Dec 10, 2022•9