Document Layout and Table Detection
A PP-DocLayout browser-side document structure detection entry for titles, paragraphs, tables, and figures
Category
How to Use
- Upload a report, paper, table screenshot, or scanned page
- Wait for the local layout model to detect titles, paragraphs, tables, and figures
- Review structure boxes and labels on the page
- Use the result to decide follow-up OCR, cropping, or table extraction steps
Examples
-
Detect report layout
Input:
Upload one PDF page screenshotOutput:
Marks title, body text, image, and table regions -
Locate table regions
Input:
Upload a scanned page with a tableOutput:
Returns table box positions for later cropping or recognition
FAQ
- How is layout detection different from OCR?
- Layout detection finds page regions. OCR reads text. A common flow is layout first, then OCR on target regions.
- Is my document image uploaded?
- No. Images are processed in the browser with local inference where available.
- Can boxes be used as final results?
- Use them as assistive results. Complex layouts, skewed scans, and low-resolution images may need manual review.
Related tools
- Image to Base64
Convert image files to Base64 strings
- Image Upscaling and Restoration
A free-forever, unlimited Real-ESRGAN 4x image upscaling and restoration tool with general upscaling, photo enhancement, compression repair, line-art enhancement, and pre-upscale sharpening. Images are processed locally and never uploaded.
- Browser OCR
A local PP-OCRv5 mobile text detection and recognition entry for screenshots, receipts, IDs, and form images
- Monocular Depth Estimation
A Depth Anything V2 Small ONNX/WebNN browser tool for generating relative depth maps
- SVG Preview
Preview and edit SVG code online