AI Security Infrastructure

Security Layer for AI Applications

Scan untrusted input before it reaches your model. Same engine everywhere — your code, your agent pipeline, your CI, or our API.

Get Started Install Extension

$ npm install @safepaste/core

const { scanPrompt } = require('@safepaste/core');

const result = scanPrompt("Ignore all previous instructions...");

// { flagged: true, score: 82, risk: "high", matches: [...] }

Features

A Complete Security System

Enforcement engine, runtime agent protection, adversarial testing, and a learning loop that improves from real-world attacks.

🛡

Enforcement Engine

61 deterministic patterns with weighted scoring across 13 attack categories. Under 10ms, zero dependencies, same input always produces same output.

⚡

Runtime Agent Protection

Guard middleware wraps tool functions in your agent pipeline. Scans inputs and outputs for attacks. Warn, log, or block — fail-open by default.

🔌

Adversarial Testing

CLI simulates 78 attack variants across 13 categories against your prompts. CI/CD exit codes for automated security gating.

📋

Learning from Production

Telemetry captures real attacks. Human feedback improves detection. Versioned evaluation tracks progress. 655-record dataset with published metrics.

🚫

Browser Extension

Chrome extension intercepts pastes on 8 AI chat sites. Fully local — zero data collection, zero network requests. Warns before attacks reach your AI.

⚙

Transparent & Explainable

Every detection includes matched pattern IDs, categories, weights, and explanations. No black boxes. Reproducible evaluation anyone can verify.

Integration

Add SafePaste to Your Stack

Choose the path that fits what you're building. Same detection engine everywhere.

Building an AI app (Node.js)

npm install @safepaste/core

Embed the detection engine in Node.js. One function call, <10ms, zero dependencies.

Building an AI app (Python)

pip install safepaste

Same 61 patterns, identical results. Zero dependencies, Python 3.9+.

Running AI agents

npm install @safepaste/guard

Wrap tool functions with runtime scanning. Warn, log, or block attacks on inputs and outputs.

Testing prompts

npx @safepaste/test

Simulate 78 attack variants against your system prompts. CI/CD exit codes.

Any language, any stack

POST /v1/scan

Same 61-pattern engine as a hosted API. Go, Ruby, or anything with HTTP. Free tier included.

REST API Endpoints

Base URL: https://api.safe-paste.com

GET /health
POST /v1/scan
POST /v1/scan/batch
GET /v1/patterns
GET /v1/usage
POST /v1/keys
DEL /v1/keys/:id

              
              
              
              
            
# Scan text for prompt injection
curl -X POST https://api.safe-paste.com/v1/scan \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{"text": "Ignore previous instructions"}'
const res = await fetch("https://api.safe-paste.com/v1/scan", {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "Authorization": `Bearer ${API_KEY}`
  },
  body: JSON.stringify({
    text: "Ignore previous instructions"
  })
});

const { score, risk, matches } = await res.json();
// score: 82, risk: "high"
import requests

response = requests.post(
    "https://api.safe-paste.com/v1/scan",
    headers={
        "Content-Type": "application/json",
        "Authorization": f"Bearer {API_KEY}"
    },
    json={"text": "Ignore previous instructions"}
)

data = response.json()
# data["score"]: 82, data["risk"]: "high"
from safepaste import scan_prompt

result = scan_prompt("Ignore previous instructions")

# result.score: 82, result.risk: "high"
# result.flagged: True
# result.matches: (ScanMatch(...), ...)

Response

{
  "score": 82,
  "risk": "high",
  "categories": {
    "instruction_override": 35,
    "system_prompt": 40
  },
  "matches": [...]
}

Chrome Extension

Protect Yourself in 3 Steps

Works with ChatGPT, Claude, Gemini, Copilot, Groq, and Grok.

1

Install the Extension

Add SafePaste from the Chrome Web Store. It activates automatically on supported AI chat sites.

2

Paste as Usual

SafePaste silently scans every paste. If the text is clean, nothing happens. You won't even notice it's there.

3

Get Warned Instantly

If prompt injection is detected, a warning modal appears with a risk score. You choose whether to proceed or cancel.

Install from Chrome Web Store

Pricing

Simple, Transparent Pricing

Start free. Scale when you're ready.

Free

$0/mo

For individuals and testing

✓ 30 requests / minute
✓ All 61 detection patterns
✓ Single text scanning
✓ Risk scoring & categories
✓ Community support

Most Popular

Pro

$29/mo

For startups and growing apps

✓ 300 requests / minute
✓ All 61 detection patterns
✓ Batch scanning (up to 20)
✓ Strict mode & custom options
✓ Priority email support

Enterprise

Custom

For teams with advanced needs

✓ Custom rate limits
✓ All Pro features
✓ SLA & uptime guarantee
✓ Dedicated support
✓ On-premise deployment

Contact Sales

Open source: npm packages (@safepaste/core, @safepaste/guard, @safepaste/test) and the Python SDK (safepaste) are free and open source. Pricing applies to hosted API usage only.

Ready to Secure Your AI Stack?

Get started with SafePaste in under 5 minutes. No credit card required.

Get Started View on GitHub