LESSWRONG
LW

2291
NotAWiz4rd
4110
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Differences in Alignment Behaviour between Single-Agent and Multi-Agent AI Systems
NotAWiz4rd1d10

We used Claude Sonnet 4 for the agents and narration, and Claude 3.5 Sonnet for most of the evaluation.

We haven't made any specific plans yet on how to measure alignment; our first goal was to check if there were observable differences at all, before making those differences properly measurable.

Reply
5Differences in Alignment Behaviour between Single-Agent and Multi-Agent AI Systems
1d
3