Parse vectors + text layer
PDF.js inspects the operator list to identify redaction-like shapes and extract text from standard and aggressive passes.
PDF X-ray
Catch redactions that look safe—but still leak text. Many “redacted” PDFs still contain the original text — it’s just covered by colored boxes. PDF X-ray reveals it in seconds. Run PDF X-ray directly in your browser to highlight “redactions” that still expose selectable or searchable text—and compare the visual PDF with the actual text layer side-by-side.
Live scan status
Updates as your PDF is analyzed.
Status
Ready for a PDF.
Pages
-
Findings
-
The problem
A black box on the page can still leave behind selectable text, searchable layers, exposed metadata, or inconsistent “clean” copies.
Redacted PDF still searchable? That usually means text under black box PDF or a PDF redaction overlay.
PDF X-ray is a redaction audit tool to verify PDF redaction before you share or publish.
How it works
In-browser parsing + Python rules spot leaked text before it ships.
PDF.js inspects the operator list to identify redaction-like shapes and extract text from standard and aggressive passes.
Pyodide runs overlap checks (WASM-accelerated), then filters noise like repeated chars, “redacted” markers, and date-only hits to separate real leaks from noise.
Findings draw SVG overlays, and a text-only PDF is rendered side-by-side for visual vs. extractable comparison.
Live demo
Upload a PDF and review leaks instantly. Nothing leaves your machine by default.
Findings
Rectangles detected
-
Text items detected
-
If rectangles = 0, the PDF likely uses rasterized or non-vector redactions.
Preview
Upload a PDF to see detections and overlays.
Key benefits
Identify visual-only redactions where text still exists.
Export a report of findings for Legal, Security, or Comms.
Browser-first checks reduce data movement and friction.
Messy scans, mixed layers, odd fonts, and legacy exports.
Product demo
Add screenshots or GIFs
Place 2–3 visuals here for the demo.
What we detect
Note: PDF X-ray helps you find likely leaks fast. Final validation depends on your redaction process and toolchain.
Use cases
From legal to incident response, PDF X-ray keeps redaction quality high when timelines are short.
Why browser-based?
Security & privacy
FAQ
No. You can scan only what you can open or decrypt legitimately.
If there’s no text layer, leaks are less likely—but we can still flag metadata and structural oddities.
Re-redact using a true redaction tool that removes hidden text from redacted PDF exports, then re-export and re-scan.
If you ship a fully local build (PWA or desktop wrapper), yes.
It’s a verification tool: “Did this redact correctly?” not “Let me redact.”