Skip to main content

Overview

Parse is Contextual AI’s structured data extraction model. It excels at converting unstructured text (PDF, DOCX) into markdown format by identifying and extracting key information.

Key Features

  • Extracts unstructured data into markdown text
  • Hierarchical representation of sections including title, headers, etc.
  • Table extraction
  • Multiple output formats (markdown-per-page, markdown-document, and blocks-per-page)

Getting Started

See the Parse How-to guide for a detailed walkthrough on how to use the Parse API to extract structured data from documents.

Additional Resources