mirror of
https://github.com/pdf2htmlEX/pdf2htmlEX.git
synced 2024-07-08 19:00:33 +00:00
71 lines
1.9 KiB
Groff
71 lines
1.9 KiB
Groff
.TH pdf2htmlEX 1 "Aug 31, 2012" "pdf2htmlEX 0.1"
|
|
.SH NAME
|
|
.PP
|
|
.nf
|
|
pdf2htmlEX \- Converts PDF to HTML without losing text and format.
|
|
.fi
|
|
|
|
.SH USAGE
|
|
.PP
|
|
.nf
|
|
pdf2htmlEX [options] <input\-filename> [<output\-filename>]
|
|
.fi
|
|
|
|
.SH DESCRIPTION
|
|
.PP
|
|
pdf2htmlEX is a utility that converts PDF files to HTML files.
|
|
|
|
pdf2htmlEX tries its best to render the PDF precisely, maintain proper styling, while retaining text and optmizing for Web.
|
|
|
|
Fonts are extracted form PDF and then embedded into HTML (Type 3 fonts are not supported). Text in the converted HTML file is usually selectable and copyable.
|
|
|
|
Other objects are rendered as images and also embedded.
|
|
|
|
.SH OPTIONS
|
|
.TP
|
|
.B --help
|
|
Show all options
|
|
.TP
|
|
.B -v, --version
|
|
Show copyright and version
|
|
.TP
|
|
.B -o, --owner-password <password>
|
|
Specify owner password
|
|
.TP
|
|
.B -u, --user-password <password>
|
|
Specify user password
|
|
.TP
|
|
.B --dest-dir <dir> (Default: ".")
|
|
Specify destination folder
|
|
.TP
|
|
.B --tmp-dir <dir> (Default: "/tmp/pdf2htmlEX")
|
|
Specify a folder for intermediate files
|
|
.TP
|
|
.B -f, --first-page <num> (Default: 1)
|
|
Specify the first page to process
|
|
.TP
|
|
.B -l, --last-page <num> (Default: last page)
|
|
Specify the last page to process
|
|
.TP
|
|
.B --zoom <ratio> (Default: 1.0)
|
|
Specify the zoom ratio of the HTML file
|
|
.TP
|
|
.B --hpdi <dpi>, --vpdi <dpi> (Default: 144)
|
|
Specify the horizontal and vertical DPI for images
|
|
.TP
|
|
.B --process-nontext <0|1> (Default: 1)
|
|
Whether to process non-text objects (as images)
|
|
.TP
|
|
.B --single-html <0|1> (Default: 1)
|
|
Whether to embed everything into one HTML file.
|
|
|
|
If switched out, there will be several files generated along with the HTML file including files for fonts, css, images.
|
|
.TP
|
|
.B --embed-base-font <0|1> (Default: 1)
|
|
Whether to embed base 14 fonts.
|
|
|
|
There are several base font defined in PDF standards, which are supposed to be provided by the PDF reader.
|
|
|
|
If this switch is on, local matched font will be used and embedded; otherwise only font names are exported such that the font matched
|
|
|