pdf2htmlEX/README.md

85 lines
2.0 KiB
Markdown
Raw Normal View History

2012-08-28 09:54:27 +00:00
# pdf2html**EX**
2012-08-04 18:03:53 +00:00
2012-08-04 18:25:47 +00:00
[**View Demo**](http://coolwanglu.github.com/pdf2htmlEX/demo/demo.html)
2012-08-18 08:05:36 +00:00
2012-08-18 07:55:21 +00:00
[**Another Demo (CJK)**](http://coolwanglu.github.com/pdf2htmlEX/demo/chn.html)
2012-08-04 18:22:50 +00:00
2012-08-18 07:46:07 +00:00
**WINDOWS XP USERS: Please make sure ClearType is turned on**
2012-08-17 13:45:16 +00:00
(Control Panel -> Display -> Appearance -> Effects -> "Use the following method to smooth edges of screen fonts" -> ClearType)
2012-08-28 09:56:37 +00:00
## Introduction
2012-08-28 09:54:27 +00:00
2012-08-07 12:16:05 +00:00
pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies, aims to provide an accuracy rendering, while keeping optimized for Web display.
2012-08-04 18:03:53 +00:00
2012-08-28 09:54:27 +00:00
It is optimized for modern web browsers such as Mozilla Firefox & Google Chrome.
2012-08-07 12:16:05 +00:00
2012-08-28 09:54:27 +00:00
This program is designed for scientific papers with complicate formulas and figures, so a precise rendering is also the #1 concern. But of course general PDF files are also supported.
2012-08-28 09:56:37 +00:00
## Features
2012-08-04 18:03:53 +00:00
2012-08-14 18:54:39 +00:00
* Single HTML file output
2012-08-07 12:16:05 +00:00
* Precise rendering
2012-08-05 15:54:15 +00:00
* Text Selection
2012-08-28 09:54:27 +00:00
* Font embedding & reencoding for Web
2012-08-06 17:22:25 +00:00
* Proper styling (Color, Transformation...)
2012-08-04 18:03:53 +00:00
* Optimization for Web
2012-08-28 09:56:37 +00:00
### Not supported yet
2012-08-28 09:54:27 +00:00
* Type 3 fonts
2012-08-06 06:32:12 +00:00
* Non-text object (Don't worry, they will be rendered as images)
2012-08-04 18:03:53 +00:00
2012-08-28 09:56:37 +00:00
## Get started
2012-08-28 09:54:27 +00:00
2012-08-28 09:56:37 +00:00
### Ubuntu PPA
2012-08-28 09:54:27 +00:00
There is a Ubuntu PPA set up at [here](https://launchpad.net/~coolwanglu/+archive/pdf2htmlex).
2012-08-29 23:42:53 +00:00
Make sure you install fontforge in the PPA or [the git version](https://github.com/fontforge/fontforge).
2012-08-28 09:54:27 +00:00
2012-08-29 23:42:53 +00:00
### Build from source
2012-08-28 09:54:27 +00:00
2012-08-28 09:56:37 +00:00
#### Dependency
2012-08-28 10:27:45 +00:00
2012-08-28 09:54:27 +00:00
* CMake
* compilers support C++11
2012-08-04 18:03:53 +00:00
* libpoppler with xpdf header >= 0.20.2
2012-08-15 03:15:33 +00:00
* boost c++ library (format, program options, gil, filesystem, serialization, system(which is actually required by filesystem))
2012-08-13 05:28:05 +00:00
* fontforge **Please use [the lastest version](https://github.com/fontforge/fontforge)**
2012-08-04 18:03:53 +00:00
2012-08-28 09:56:37 +00:00
#### Compiling
2012-08-28 10:27:45 +00:00
2012-08-14 18:28:19 +00:00
cmake . && make && sudo make install
2012-08-04 18:03:53 +00:00
2012-08-28 09:54:27 +00:00
## Usage
2012-08-28 10:27:45 +00:00
2012-08-28 09:54:27 +00:00
pdf2htmlEX /path/to/foobar.pdf
pdf2htmlEX --help
2012-08-28 09:56:37 +00:00
## LICENSE
2012-08-04 18:03:53 +00:00
2012-08-31 13:19:46 +00:00
GPLv2 & GPLv3 Dual licensed
2012-08-04 18:03:53 +00:00
2012-08-28 09:56:37 +00:00
## Credits
2012-08-28 09:54:27 +00:00
The following projects have been consulted for pdf2htmlEX:
2012-08-04 18:03:53 +00:00
* pdftops & pdftohtml from poppler
2012-08-11 11:55:06 +00:00
* MuPDF
2012-08-04 18:03:53 +00:00
* PDF.js
* Crocodoc
* Google Doc
2012-08-28 09:56:37 +00:00
## Contact
2012-08-28 09:54:27 +00:00
2012-08-04 18:27:18 +00:00
* Lu Wang <coolwanglu@gmail.com>
2012-08-28 09:54:27 +00:00
2012-08-28 09:56:37 +00:00
### Special Thanks
2012-08-28 09:54:27 +00:00
2012-08-04 18:27:18 +00:00
* Hongliang Tian <tatetian@gmail.com>
2012-08-04 18:03:53 +00:00