1
0
mirror of https://github.com/pdf2htmlEX/pdf2htmlEX.git synced 2024-07-03 00:35:40 +00:00
Go to file
2012-08-06 22:46:50 +08:00
bin don't mapp to unicode 2012-08-06 22:46:50 +08:00
src don't mapp to unicode 2012-08-06 22:46:50 +08:00
CMakeLists.txt add background image 2012-08-06 14:21:37 +08:00
LICENSE initial import 2012-08-05 02:03:53 +08:00
README.md update readme 2012-08-06 20:03:39 +08:00

pdf2htmlEX

View Demo

Introduction

Traditional pdf -> html conversion tools are more likely pdf -> text tools.

For those who are not satisfied with them, this might be the right one for you.

pdf2htmlEX utilizes latest technologies of html/css, aims to provide an accuracy rendering, while keeping optimized for Web display.

pdf2htmlEX is optimized for recent versions of moderm web browsers such as Mozilla Firefox & Google Chrome.

Features

  • Text Selection
  • Font embedding
  • Proper styling
  • Optimization for Web
  • Transformation

Not supported yet

  • Non-text object (Don't worry, they will be rendered as images)
  • Color
  • CJK

Dependency

  • libpoppler with xpdf header >= 0.20.2
  • boost c++ library (format, program options)
  • fontforge (we will compile with libfontforge later)

HOW TO COMPILE

cmake . && make

HOW TO USE

bin/pdf2htmlEX /path/to/sample.pdf

LICENSE

GPLv3

We would like to acknowledge the following projects that have been consulted while writing this program:

  • pdftops & pdftohtml from poppler
  • PDF.js
  • Crocodoc
  • Google Doc

AUTHORS