pdf2docx.page.Pages module#
Collection of Page
instances.
- class pdf2docx.page.Pages.Pages(instances: Optional[list] = None, parent=None)#
Bases:
BaseCollection
A collection of
Page
.- parse(fitz_doc, **settings)#
Analyze document structure, e.g. page section, header, footer.
- Args:
fitz_doc (fitz.Document):
PyMuPDF
Document instance. settings (dict): Parsing parameters.