📄 Document Analysis Pipeline

Upload a PDF document to extract structured data including questions, options, answers, passages, and embedded images.

Pipeline Steps:

Upload PDF Document

LayoutLMv3 Model Path (optional)

Structured JSON Output

Download Full JSON

The pipeline generates JSON with the following structure:

Questions: Extracted question text
Options: Multiple choice options (A, B, C, D, etc.)
Answers: Correct answer(s)
Passages: Associated reading passages
Images: Base64-encoded figures and equations (embedded with keys like figure1, equation2)