Quality + safety for AI agent skills

Make your agent skills measurably better.
And prove they're safe.

Most agent skills are written once and never tuned. Research shows curated skills lift task completion ~16% while self-written ones add almost nothing. Skillproof grades any skill for quality and safety, then rewrites it into a better best_skill.md, grounded in 4,000+ exemplar skills. Independent. Cross-agent. The free scan needs no signup.

Scan any skill

Paste a SKILL.md or a GitHub URL. You'll get a security grade, a quality grade, and a verdict, instantly, in your browser.

Try:

Two grades. One report.

Every other tool checks one or the other. Skillproof is the independent third party that does both, and then fixes the skill for you.

Lead · the optimizer

✨ Quality & rewrite

We score structure, triggering, steps, examples, output contract and token budget, then the optimizer does a bounded-edit rewrite of the skill body (not just the description), grounded in retrieved exemplar skills, and reports the measured before/after uplift. The SkillOpt method, productized.

Bundled · trust

🛡️ Safety scan

Invisible-Unicode / ASCII-smuggling, bidi overrides, prompt injection, secret exfiltration, dangerous code, over-broad tool grants, config/memory poisoning (CLAUDE.md, settings.json, .mcp.json), supply-chain risk. A letter grade and a verdict, with the smuggled bytes shown, never carried.

Why Skillproof and not the others

The skill-security space is crowded and we don't pretend otherwise. We win on the three things no one else does together.

Corpus-grounded

Edits are driven by retrieval over 4,000+ curated skills, the reference standard for "what good looks like." Others optimize against your own evals or not at all.

Full body rewrite

First-party tools stop at tuning the description so it triggers. We rewrite the actual logic, and refuse any rewrite that lowers your safety grade.

Vendor-neutral

Not a marketplace, not a security-platform upsell. One independent grade across Claude, Cursor, Codex and Gemini skills, embeddable as a badge.

Pricing

The public scan is free forever. You pay to optimize, and to gate quality + safety in your pipeline or marketplace.

Free
$0

Public scans, no signup

  • Unlimited public scans
  • Quality + safety grade
  • Shareable report page
  • Embeddable badge
Scan now
Team / CI
$99/mo

For platform & AppSec teams

  • 5 seats
  • 1,000 optimizations / mo
  • GitHub Action CI-gate
  • Continuous monitoring + alerts
Marketplace
$499/mo

For directories & vendors

  • Bulk certify + optimize a catalog
  • 10,000 optimizations / mo
  • "Verified by Skillproof" OEM badge
  • SSO + SLA

FAQ

Isn't skill scanning already a thing?

Yes. Snyk, Socket, Cisco and others scan skills for safety, and they're good. None of them improve your skill. Skillproof leads with the optimizer and bundles a safety scan so you get one honest "safe and good" report.

How is the uplift measured?

The free grade is a deterministic static lint. The optimizer reports a real before/after quality delta. A task-validated uplift (full SkillOpt-style held-out evaluation) is available on Team/Enterprise when you bring an eval set, and we label estimates honestly.

Do you store my skill?

A scan creates a shareable report you can delete. Optimizations run against your account. We never train on your private skills.

Which agents are supported?

Anything using the SKILL.md / .skill format: Claude Code & Claude Skills, Cursor, Codex, Gemini CLI. The CLI and GitHub Action run in your own CI.