Convert HTML Files

Docspell Documentation

Context and Problem Statement🔗

How can HTML documents be converted into a PDF file that looks as much as possible like the original?

It would be nice to have a java-only solution. But if an external tool has a better outcome, then an external tool is fine, too.

Since Docspell is free software, the tools must also be free.

Considered Options🔗

Native (firefox) view:

Note: the example html is from here.

I downloaded the HTML file to disk together with its resources (using Save as... in the browser).

Pandoc🔗

Not showing the version using context pdf-engine, since it looked very similiar to the latex variant.

wkhtmltopdf🔗

Unoconv🔗

Decision Outcome🔗

wkhtmltopdf.

It shows the best results.