Ariel Kwiatkowski's Shortform

Ariel Kwiatkowski

Ariel Kwiatkowski's Shortform

1 min read30th May 20203 comments

This is a special post for quick takes by Ariel Kwiatkowski. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.

Ariel Kwiatkowski's Shortform

3 comments, sorted by

top scoring

Click to highlight new comments since: Today at 7:49 AM

[-]Ariel Kwiatkowski4y110

Has anyone tried to work with neural networks predicting the weights of other neural networks? I'm thinking about that in the context of something like subsystem alignment, e.g. in an RL setting where an agent first learns about the environment, and then creates the subagent (by outputting the weights or some embedding of its policy) who actually obtains some reward

[-]Ariel Kwiatkowski4y40

Looking for research idea feedback:

Learning to manipulate: consider a system with a large population of agents working on a certain goal, either learned or rule-based, but at this point - fixed. This could be an environment of ants using pheromones to collect food and bring it home.

Now add another agent (or some number of them) which learns in this environment, and tries to get other agents to instead fulfil a different goal. It could be ants redirecting others to a different "home", hijacking their work.

Does this sound interesting? If it works, would it potentially be publishable as a research paper? (or at least a post on LW) Any other feedback is welcome!

[-]romeostevensit4y20

This sounds interesting to me.

Moderation Log