pdf2docx.layout.Column module#

Column of Section.

In most cases, one section per page. But in case multi-columns page, sections are used to distinguish these different layouts.

Note

Currently, support at most two columns.

{
    'bbox': (x0, y0, x1, y1),
    'blocks': [{
        ... # block instances
    }, ...],
    'shapes': [{
        ... # shape instances
    }, ...]
}
class pdf2docx.layout.Column.Column(bbox=None)#

Bases: Layout

Column of Section.

add_elements(elements: Collection)#

Add candidate elements, i.e. lines or shapes, to current column.

bbox: fitz.Rect#
make_docx(doc)#

Create Section Column in docx.

Args:

doc (Document): python-docx document object

property working_bbox#

Working bbox of current Layout.