# Can Redacted Information Be Recovered From a PDF?

> It depends on how the redaction was done. Visual overlays leave text fully recoverable. Permanent content-stream deletion leaves nothing to recover.

- **Author:** Neetusha
- **Published:** 2026-06-22
- **URL:** https://www.redactifyai.com/answers/can-redacted-information-be-recovered-from-pdf/

---

It depends entirely on how the redaction was done. If a black box, shape, or annotation was placed on top of text without modifying the underlying file structure, the text is still in the PDF and can be extracted with no special tools. If a proper redaction tool was used to delete the content from the PDF content stream and the file was saved correctly, there is nothing left in the file to recover. The distinction is not visible to the naked eye: both methods produce a black rectangle on screen, but only one of them actually removes the data.

## How overlay-only redactions can be reversed

When a PDF editor draws a black rectangle over text without touching the content stream, several straightforward methods expose the hidden text:

- **Copy and paste**: Select the area and press Ctrl+C. Many PDF readers pass the text from the content stream, not from the visual layer, so the covered text appears in the clipboard.
- **PDF text extraction**: Tools such as `pdftotext` (part of the [poppler utilities](https://poppler.freedesktop.org/)) or Python libraries such as PyPDF2 read directly from the content stream and ignore visual overlays entirely.
- **Annotation deletion**: Open the file in any PDF editor that supports annotation management. The black shape is listed as an annotation object. Delete it and the original text is visible again.
- **Different PDF viewers**: Some viewers render the page without displaying all annotation types. Text hidden by an annotation in one viewer may appear uncovered in another.
- **Metadata and document structure inspection**: Tools such as `pdf-parser.py` from [Didier Stevens](https://blog.didierstevens.com/programs/pdf-tools/) can enumerate every object in the PDF, including text strings that are visually covered.

The Department of Justice and several federal courts have issued guidance specifically warning that annotating over text does not constitute proper redaction. The [Administrative Office of the U.S. Courts guidance on redaction](https://www.uscourts.gov/rules-policies/judiciary-policies/privacy-policy/guide-judiciary-policy-privacy-electronic-public-access) underscores that the underlying electronic content must be removed, not merely covered.

## What permanent redaction does to a PDF

A real redaction tool modifies the PDF at the object level. It locates the text rendering instructions in the content stream for the targeted region, deletes those instructions, removes any associated glyph data or embedded font subsets that relate only to the deleted characters, and writes a flat filled rectangle as part of the page content rather than as a removable annotation. The result is a file where the deleted characters simply do not exist anymore. Running `pdftotext` on a permanently redacted region returns whitespace or nothing, not the original text.

Properly redacted PDFs also have their annotation layer flattened so there is no separate layer to strip. For scanned documents, where the page content is an image rather than text objects, permanent redaction burns the black fill directly into the image pixels so that even optical character recognition tools cannot read the concealed area.

## How to check whether a redaction is permanent

Four tests reveal whether a PDF's redactions removed data or merely hid it:

1. Select the redacted area in Adobe Reader or any PDF viewer and press Ctrl+C. Paste into a plain-text editor. If text appears, the redaction is not permanent.
2. Search the document for a specific term that should be redacted (Ctrl+F). If it is found, the text is still present in the content stream.
3. Open the file in a different PDF viewer. A black shape annotation that conceals text in one viewer may not render in another.
4. Run `pdftotext filename.pdf -` in a terminal and search the output for the redacted content.

If any test reveals the covered content, the document must be re-redacted using a tool that performs content-stream deletion before it can be safely shared.

## Choosing a tool that applies permanent redaction

Tools that perform permanent PDF redaction include Adobe Acrobat Pro (using the dedicated Redact workflow under Tools, not the annotation tools), and purpose-built redaction software such as [RedactifyAI](https://redactifyai.com), which uses AI to detect sensitive information such as names, Social Security numbers, dates of birth, and financial account numbers before permanently removing them from the content stream in a single automated pass. Free PDF readers, Preview on macOS, and most basic PDF editors do not offer permanent redaction even though they allow drawing shapes over text.