1
0
mirror of https://github.com/pdf2htmlEX/pdf2htmlEX.git synced 2024-07-05 17:48:38 +00:00
pdf2htmlEX/pdf2htmlEX.1

71 lines
1.9 KiB
Groff
Raw Normal View History

2012-08-31 10:31:43 +00:00
.TH pdf2htmlEX 1 "Aug 31, 2012" "pdf2htmlEX 0.1"
.SH NAME
.PP
.nf
pdf2htmlEX \- Converts PDF to HTML without losing text and format.
.fi
.SH USAGE
.PP
.nf
pdf2htmlEX [options] <input\-filename> [<output\-filename>]
.fi
.SH DESCRIPTION
.PP
pdf2htmlEX is a utility that converts PDF files to HTML files.
2012-08-31 13:20:04 +00:00
pdf2htmlEX tries its best to render the PDF precisely, maintain proper styling, while retaining text and optmizing for Web.
Fonts are extracted form PDF and then embedded into HTML (Type 3 fonts are not supported). Text in the converted HTML file is usually selectable and copyable.
Other objects are rendered as images and also embedded.
2012-08-31 10:31:43 +00:00
.SH OPTIONS
.TP
.B --help
2012-08-31 13:20:04 +00:00
Show all options
2012-08-31 10:31:43 +00:00
.TP
.B -v, --version
2012-08-31 13:20:04 +00:00
Show copyright and version
2012-08-31 10:31:43 +00:00
.TP
.B -o, --owner-password <password>
2012-08-31 13:20:04 +00:00
Specify owner password
2012-08-31 10:31:43 +00:00
.TP
.B -u, --user-password <password>
2012-08-31 13:20:04 +00:00
Specify user password
2012-08-31 10:31:43 +00:00
.TP
2012-08-31 13:20:04 +00:00
.B --dest-dir <dir> (Default: ".")
Specify destination folder
2012-08-31 10:31:43 +00:00
.TP
2012-08-31 13:20:04 +00:00
.B --tmp-dir <dir> (Default: "/tmp/pdf2htmlEX")
Specify a folder for intermediate files
2012-08-31 10:31:43 +00:00
.TP
2012-08-31 13:20:04 +00:00
.B -f, --first-page <num> (Default: 1)
Specify the first page to process
.TP
.B -l, --last-page <num> (Default: last page)
Specify the last page to process
2012-08-31 10:31:43 +00:00
.TP
2012-08-31 13:20:04 +00:00
.B --zoom <ratio> (Default: 1.0)
Specify the zoom ratio of the HTML file
.TP
.B --hpdi <dpi>, --vpdi <dpi> (Default: 144)
Specify the horizontal and vertical DPI for images
.TP
.B --process-nontext <0|1> (Default: 1)
Whether to process non-text objects (as images)
.TP
.B --single-html <0|1> (Default: 1)
Whether to embed everything into one HTML file.
2012-08-31 10:31:43 +00:00
2012-08-31 13:20:04 +00:00
If switched out, there will be several files generated along with the HTML file including files for fonts, css, images.
2012-08-31 10:31:43 +00:00
.TP
2012-08-31 13:20:04 +00:00
.B --embed-base-font <0|1> (Default: 1)
Whether to embed base 14 fonts.
2012-08-31 10:31:43 +00:00
2012-08-31 13:20:04 +00:00
There are several base font defined in PDF standards, which are supposed to be provided by the PDF reader.
If this switch is on, local matched font will be used and embedded; otherwise only font names are exported such that the font matched
2012-08-31 10:31:43 +00:00