pdf2docx.page.Pages module#

Collection of Page instances.

class pdf2docx.page.Pages.Pages(instances: Optional[list] = None, parent=None)#

Bases: BaseCollection

A collection of Page.

parse(fitz_doc, **settings)#

Analyze document structure, e.g. page section, header, footer.

Args:

fitz_doc (fitz.Document): PyMuPDF Document instance. settings (dict): Parsing parameters.