Welcome to pdf2docx’s documentation!

pdf2docx is a Python library to extract data from PDF with PyMuPDF, parse layout with rule, and generate docx file with python-docx.

https://s1.ax1x.com/2020/08/04/aDryx1.png

API DOCUMENTATION

Indices and tables