Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
The file to be parsed. The file type must be PDF, DOC / DOCX, PPT / PPTX.
The settings to use for parsing. basic is for simple, text-only documents. standard is for complex documents with images, complex hierarchy, and/or no natively encoded textual data (e.g. for scanned documents).
basic, standard Adds a table of contents to the output with the structure of the entire parsed document. This feature is in beta. Controls parsing heading levels (e.g. H1, H2, H3) at higher quality. Not permitted in basic parsing_mode, or if page_range is not continuous and/or does not start from page zero.
true
Controls whether tables are split into multiple tables by row with the headers propagated. Use for improving LLM comprehension of very large tables. Not permitted in basic parsing_mode.
false
Threshold number of table cells beyond which large tables are split if enable_split_tables is True. Must be null if enable_split_tables is False.
null
Controls how thorough figure captions are. concise is short and minimizes chances of hallucinations. detailed is more thorough and can include commentary; this mode is in beta. Not permitted in basic parsing_mode.
concise, detailed Optional string representing page range to be parsed. Format: comma-separated indexes (0-based, e.g. 0,1,2,5,6), or ranges inclusive of both ends (e.g. 0-2,5,6)
null
Response
Successful Response
/parse response object.
Unique ID of the parse job