Guard — Agent Runtime Security

npm install @safepaste/guard @safepaste/core
Mode	On detection	Use case
`log`	Returns the GuardResult silently. No side effects.	Monitoring and analytics. Collect data without affecting behavior.
`warn`	Calls `console.warn()` with detection details. Tool still executes.	Development and staging. See detections in your logs without blocking.
`block`	Throws a `GuardError`. Tool does not execute.	Production enforcement. Prevent attacks from reaching your tools.
`callback`	Calls your function with the GuardResult. Return `false` to block.	Custom logic. Route to a review queue, apply business rules, etc.
Option	Type	Default	Description
`mode`	`string \| Function \| Object`	`'warn'`	Detection mode: `'log'`, `'warn'`, `'block'`, a callback function, or `{ input, output }` for per-direction config.
`strict`	`boolean`	`false`	Use strict detection threshold (25 instead of 35). Catches more borderline cases.
`on.detection`	`Function`	`null`	Called on every detection (flagged result), regardless of mode. Receives the GuardResult.
`on.blocked`	`Function`	`null`	Called when a detection results in a block (before the GuardError is thrown). Receives the GuardResult.
`on.error`	`Function`	`null`	Called when the scanning process itself fails (not a block). Receives the error and context `{ tool, point }`.
Field	Type	Description
`flagged`	`boolean`	Whether a prompt injection was detected (score exceeded threshold).
`action`	`'pass' \| 'log' \| 'warn' \| 'block' \| 'callback'`	The action taken based on the mode. `'pass'` means no detection.
`scan.flagged`	`boolean`	Same as top-level `flagged`.
`scan.risk`	`'low' \| 'medium' \| 'high'`	Risk level based on score.
`scan.score`	`number`	Threat score from 0 to 100.
`scan.threshold`	`number`	Score threshold used (35 normal, 25 strict).
`scan.matches`	`Array`	Matched patterns with category, pattern, and weight.
`scan.meta`	`Object`	Additional scan metadata.
`guard.point`	`'input' \| 'output'`	Whether this scan was on the tool input or output.
`guard.tool`	`string \| null`	Name of the tool being guarded.
`guard.mode`	`string`	The resolved mode for this direction.
`guard.timestamp`	`number`	Unix timestamp (ms) when the scan was performed.
`guard.durationMs`	`number`	How long the scan took in milliseconds.
Property	Type	Description
`name`	`string`	Always `'GuardError'`. Use this for reliable catch filtering.
`message`	`string`	Describes the block: direction, tool name, risk level, and score.
`guardResult`	`Object`	The full GuardResult that triggered the block.