1
0
mirror of https://github.com/pdf2htmlEX/pdf2htmlEX.git synced 2024-12-21 20:50:07 +00:00
Go to file
2012-08-05 02:25:47 +08:00
bin initial import 2012-08-05 02:03:53 +08:00
demo initial import 2012-08-05 02:03:53 +08:00
src initial import 2012-08-05 02:03:53 +08:00
CMakeLists.txt initial import 2012-08-05 02:03:53 +08:00
LICENSE initial import 2012-08-05 02:03:53 +08:00
README.md update README 2012-08-05 02:25:47 +08:00

pdf2htmlEX

View Demo

Introduction

Traditional pdf -> html conversion tools are more likely pdf -> text tools.

For those who are not satisfied with them, this might be the right one for you.

pdf2htmlEX utilizes latest technologies of html/css, aims to provide an accuracy rendering, while keeping optimized for Web display.

pdf2htmlEX is optimized for recent versions of moderm web browsers such as Mozilla Firefox & Google Chrome.

Features

  • Font embedding
  • Proper styling
  • Optimization for Web
  • Transformation (Experimental)

Not supported yet

  • Non-text object
  • Color
  • CJK

Dependency

  • libpoppler with xpdf header >= 0.20.2
  • fontforge (we will compile with libfontforge later)

HOW TO COMPILE

cmake . && make

HOW TO USE

bin/pdf2htmlEX /path/to/sample.pdf

LICENSE

GPLv3

We would like to acknowledge the following projects that have been consulted while writing this program:

  • pdftops & pdftohtml from poppler
  • PDF.js
  • Crocodoc
  • Google Doc

AUTHORS

Lu Wang coolwanglu@gmail.com Hongliang Tian tatetian@gmail.com