This module defines the document structure used throughout the package.
Document
Bases: TypedDict
A TypedDict that represents a page in a PDF file.
Attributes:
-
page_content
(str
)
–
The text content of the page.
-
metadata
(NotRequired[dict[str, Any]]
)
–
Additional metadata about the page.