How is layout detection different from OCR?

Layout detection finds page regions. OCR reads text. A common flow is layout first, then OCR on target regions.

Is my document image uploaded?

No. Images are processed in the browser with local inference where available.

Use them as assistive results. Complex layouts, skewed scans, and low-resolution images may need manual review.

A PP-DocLayout browser-side document structure detection entry for titles, paragraphs, tables, and figures

Upload a report, paper, table screenshot, or scanned page
Wait for the local layout model to detect titles, paragraphs, tables, and figures
Review structure boxes and labels on the page
Use the result to decide follow-up OCR, cropping, or table extraction steps

Detect report layout
Input: Upload one PDF page screenshot

Output: Marks title, body text, image, and table regions
Locate table regions
Input: Upload a scanned page with a table

Output: Returns table box positions for later cropping or recognition

How is layout detection different from OCR?: Layout detection finds page regions. OCR reads text. A common flow is layout first, then OCR on target regions.
Is my document image uploaded?: No. Images are processed in the browser with local inference where available.
Can boxes be used as final results?: Use them as assistive results. Complex layouts, skewed scans, and low-resolution images may need manual review.

Image to Base64
Convert image files to Base64 strings
Image Upscaling and Restoration
A free-forever, unlimited Real-ESRGAN 4x image upscaling and restoration tool with general upscaling, photo enhancement, compression repair, line-art enhancement, and pre-upscale sharpening. Images are processed locally and never uploaded.
Browser OCR
A local PP-OCRv5 mobile text detection and recognition entry for screenshots, receipts, IDs, and form images
Monocular Depth Estimation
A Depth Anything V2 Small ONNX/WebNN browser tool for generating relative depth maps
SVG Preview
Preview and edit SVG code online