LESSWRONG
LW

David Pang
1020
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
A Proof Against Oracle AI
David Pang6y00

Interesting thought. I’m a bit new to this site but is there a “generally accepted” class of solutions for these type of AI problems? What if humans used multiple of these Oracle AIs in isolation (they don’t know about each other) and the humans asking the questions show no reaction to the answers. The questions are all planned in advance so the AI cannot “game” humans by “influencing” the next question using its current answer. What about once the AI achieves some level of competency we reset it after each question-answer cycle so it’s competence is frozen at some useful but not nefarious level (assuming we can figure out where that line is).

Reply
Even if your Voice Shakes
David Pang6y30

Do you have a PayPal or something other than Patreon? I’ll make a one time donation but have an (perhaps unreasonable) abhorrence to Patreon’s recurring model.

Reply
No posts to display.