Fulcrum is excited to open-source two new tools:
Human attention is a scarce resource.
In the best case, coding with agents allows that attention to be spent on the “right” parts of your code: its functionality, architecture, and its failure-modes. In the worst case, your attention is spent on the behavior of your agents: preventing them from taking unsafe actions, asking them not to reward hack, and understanding if their outputs are trustworthy.
Can we use agents to help us out?
Quibbler is a background agent that monitors and critiques your coding agent’s actions using hooks. Unlike most critics and guardrails, Quibbler is an agent: it can read and understand the context of an agent’s action to see if it made a mistake.
We’ve found Quibbler useful in preventing agents from
In longer running tasks, we found Quibbler useful in enforcing intent, allowing us to check in on our agent less. You can configure your guardrails, and Quibbler learns from your usage. Quibbler currently supports Claude Code: we are adding support for other agents soon!
Orchestra is a step towards true multi-agent coding: with parallel execution, active coordination, and full visibility of your coding agents. You plan with a designer agent, which spawns executors that work in isolated environments. When an executor needs help, it messages the designer agent (which gets your input if needed).
We’ve used Orchestra to:
Orchestra's oversight features -- model to model orchestration and monitoring -- are what actually makes it possible for parallelization to be useful, and not destructive.
What are the properties of the interfaces that we will use to manage agents in real time? Here are some ideas:
Real-time oversight of agent systems will be critical as agents scale beyond coding. Live monitoring of agents in real-world settings might also teach us lessons about AI oversight that can later be applied to x-risk mitigation.
If you’re interested, we’d love to hear from you. We're hiring!