decoded365
← All changes
NewMicrosoft Purview · Data Loss Prevention

Microsoft Purview compliance portal: Data Loss Prevention for endpoints - Optical character recognition (OCR) support for embedded images in endpoint

OCR for endpoint DLP will expand to detect sensitive content in images embedded within Office files, container files, and hybrid PDFs, increasing detection coverage beyond standalone images.

Key dates

  • 2026-05rollout (Rolling out; rollout paused as of May 20, 2026, resuming soon)

Microsoft's description

This release will extend OCR support from standalone images (JPEG, JPG, PNG, BMP, TIFF, and PDF) to images embedded inside the following files and file types: Office files (XLSX, DOCX, PPTX), container files (zip, rar, 7z, and more), and PDF files. Image-only PDF files are already supported, and this this release will support hybrid PDF files containing images and searchable text. Updated May 20, 2026: We have paused rollout and will resume soon. Thank you for your patience.

View on Microsoft roadmap →