1
0
mirror of https://github.com/pdf2htmlEX/pdf2htmlEX.git synced 2024-07-01 07:59:00 +00:00
Go to file
2012-08-21 20:37:25 +02:00
share better line merging 2012-08-19 13:05:14 +02:00
src clean code 2012-08-21 20:37:25 +02:00
CMakeLists.txt .. 2012-08-20 23:59:02 +02:00
LICENSE initial import 2012-08-05 02:03:53 +08:00
README.md update REAMD 2012-08-18 16:05:36 +08:00

pdf2htmlEX

View Demo

Another Demo (CJK)

WINDOWS XP USERS: Please make sure ClearType is turned on

(Control Panel -> Display -> Appearance -> Effects -> "Use the following method to smooth edges of screen fonts" -> ClearType)

Introduction

pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies, aims to provide an accuracy rendering, while keeping optimized for Web display.

It is optimized for recent versions of modern web browsers such as Mozilla Firefox & Google Chrome.

This program is designed for scientific papers with complicate formulas and figures, so a precise rendering is also the #1 concern.

Features

  • Single HTML file output
  • Precise rendering
  • Text Selection
  • Font embedding
  • Proper styling (Color, Transformation...)
  • Optimization for Web

Not supported yet

  • Several Font types & encodings
  • Non-text object (Don't worry, they will be rendered as images)
  • Blend Mode
  • ...

Dependency

  • Recent version of GCC (no guarantee on other compilers)
  • libpoppler with xpdf header >= 0.20.2
  • boost c++ library (format, program options, gil, filesystem, serialization, system(which is actually required by filesystem))
  • fontforge Please use the lastest version

HOW TO COMPILE

cmake . && make && sudo make install

HOW TO USE

pdf2htmlEX /path/to/sample.pdf

LICENSE

GPLv3

We would like to acknowledge the following projects that have been consulted while writing this program:

  • pdftops & pdftohtml from poppler
  • MuPDF
  • PDF.js
  • Crocodoc
  • Google Doc

AUTHORS