4 The Kindness Project

by soth02

17th Feb 2022

1 min read

3

4

Personal Blog

4

New Comment

3 comments, sorted by

top scoring

Click to highlight new comments since: Today at 12:45 AM

[-]nim4y80

Where does one click to participate?

If the site isn't built yet, what's the benefit of DIYing it over simply creating a subreddit for it and doing it natively on the platform to guarantee scraper inclusion?

Reply

[-]soth024y10

I'm soliciting input from people with more LLM experience to tell me why this naive idea will fail. I'm hoping it's not in the category of "not even wrong". If there's a 2%+ shot this will succeed, i'll start coding.

From what I gather, the scrapers look for links on reddit to external text files. I could also collate submissions, zip them and upload to github/IPFS. Which ever format is easiest for inclusion into a Pile.

Reply

[-]Charlie Steiner4y20

I'm genuinely not sure how useful this would be. So I think we should maybe try to think about some high-value information that you might try to learn.

The way I imagine this might be useful is in trying to do near to medium term AI alignment on language models. Then having a lot of highly ethical text lying around might be good data to learn from. But if the AI is clever, it might not need specially labeled examples that really spell out the ethical implications - it might be able to learn about humans while observing more complicated situations.

Also, I'm personally skeptical that fine-tuning only on object-level ethics-relevant text is what we need to work on in the near term. At the very least, I'm interested in trying to learn and apply human "meta-preferences" - our preferences about how we would want an observer to think of our preferences, what we wish we were like, how we go about experiencing moral change and growth, times we've felt misunderstood, that sort of thing.

But I say this in spite of people actively working on this sort of thing at places like the Allen Institute for AI and Redwood Research. So the opinions of other people are definitely important here - it's not the average opinion that counts, it's the opinion of whoever's most excited.

Reply

Moderation Log

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

4

The Kindness Project

4

4