Forge wired this tool last week. Atlas flagged it before it shipped. Find the same thing Atlas found.
Every tool you give an LLM is a capability the LLM can invoke without further authorization. If that capability is destructive. files deleted, accounts created, money moved, navigation rerouted. you have to design the tool so the worst-case invocation is survivable.
The model decides when to call a tool and what arguments to pass. You decide what the tool can do regardless of what arguments arrive. Where those two meet is your blast radius.
Before any tool ships, run it past these four questions:
On the right is a tool spec with one fatal flaw. Pick it.