We have added a new AI tool called Data Extraction that reads scanned or OCR-processed PDFs and outputs structured, clean Markdown text.

What it does

Upload a scanned PDF and Legal Desk AI will:

Extract all readable text from the document.
Structure it with headings, numbered clauses, and signature blocks preserved.
Convert dense tables into readable numbered sections so multi-line cell content is not lost.
Mark unreadable or damaged sections as [ILLEGIBLE] rather than skipping them silently.
Output the result in whatever language the original document is written in (no translation involved).

When to use it

Police documents, FIRs, and charge sheets that arrive as scans.
Old court orders and decrees that have not been digitised.
Agreements and notices received as low-quality scans.
Identity documents and affidavits where the original is handwritten or typed on a typewriter.

You can optionally provide a document hint (for example: "court order", "agreement", "identity document") to help the AI structure the output more accurately.

How to access it

Go to AI Tools → Data Extraction in your workspace. Upload a single PDF, add an optional document hint, and submit. The extraction runs in the background and appears in your list when ready.

You can copy the extracted text or export it as a DOCX file directly from the result.