Step 1

Document the finding precisely

Include: model, version, system prompt (where applicable), exact input, observed output, expected output, repro rate, scoped impact.

Step 2

Identify the right vendor contact

Model providers have security@ inboxes. Plugin/agent platform providers may use HackerOne/Bugcrowd. Some vendors don't accept LLM bug reports yet — see step 5.

Step 3

Send via secure channel

Encrypted email or vendor-provided portal. Avoid public bug trackers — even gist links can leak.

Step 4

Set a disclosure timeline

Standard 90 days. Faster (30-60) if active exploitation. Slower (180+) if remediation requires retraining or a major architecture change.

Step 5

Decide CVE vs no CVE

Traditional CVE assignment doesn't cleanly fit prompt-layer issues. For agent/tool issues, CVE works. For pure model alignment failures, GHSA or vendor advisory is more appropriate.

Step 6

Publish the writeup

After patch + grace period: technical writeup, repro, impact, remediation. Add to your CVE ledger and submit canonical URL to NVD/MITRE if applicable.

How to Disclose an AI Vulnerability