.TH pdf2htmlEX 1 "Aug 31, 2012" "pdf2htmlEX 0.1" .SH NAME .PP .nf pdf2htmlEX \- Converts PDF to HTML without losing text and format. .fi .SH USAGE .PP .nf pdf2htmlEX [options] [] .fi .SH DESCRIPTION .PP pdf2htmlEX is a utility that converts PDF files to HTML files. pdf2htmlEX tries its best to render the PDF precisely, maintain proper styling, while retaining text and optmizing for Web. Fonts are extracted form PDF and then embedded into HTML (Type 3 fonts are not supported). Text in the converted HTML file is usually selectable and copyable. Other objects are rendered as images and also embedded. .SH OPTIONS .TP .B --help Show all options .TP .B -v, --version Show copyright and version .TP .B -o, --owner-password Specify owner password .TP .B -u, --user-password Specify user password .TP .B --dest-dir (Default: ".") Specify destination folder .TP .B --tmp-dir (Default: "/tmp/pdf2htmlEX") Specify a folder for intermediate files .TP .B -f, --first-page (Default: 1) Specify the first page to process .TP .B -l, --last-page (Default: last page) Specify the last page to process .TP .B --zoom (Default: 1.0) Specify the zoom ratio of the HTML file .TP .B --hpdi , --vpdi (Default: 144) Specify the horizontal and vertical DPI for images .TP .B --process-nontext <0|1> (Default: 1) Whether to process non-text objects (as images) .TP .B --single-html <0|1> (Default: 1) Whether to embed everything into one HTML file. If switched out, there will be several files generated along with the HTML file including files for fonts, css, images. .TP .B --embed-base-font <0|1> (Default: 1) Whether to embed base 14 fonts. There are several base font defined in PDF standards, which are supposed to be provided by the PDF reader. If this switch is on, local matched font will be used and embedded; otherwise only font names are exported such that the font matched