1
0
mirror of https://github.com/pdf2htmlEX/pdf2htmlEX.git synced 2024-07-05 17:48:38 +00:00
pdf2htmlEX/README.md
2012-08-07 10:04:39 +08:00

1.5 KiB

pdf2htmlEX

View Demo

Introduction

Traditional pdf -> html conversion tools are more likely pdf -> text tools.

For those who are not satisfied with them, this might be the right one for you.

pdf2htmlEX utilizes latest technologies of html/css, aims to provide an accuracy rendering, while keeping optimized for Web display.

pdf2htmlEX is optimized for recent versions of moderm web browsers such as Mozilla Firefox & Google Chrome.

Features

  • Text Selection
  • Font embedding
  • Proper styling (Color, Transformation...)
  • Optimization for Web

Not supported yet

  • Non-text object (Don't worry, they will be rendered as images)
  • Blend Mode
  • CJK
  • ...

Dependency

  • libpoppler with xpdf header >= 0.20.2
  • boost c++ library (format, program options)
  • fontforge (we will compile with libfontforge later)

HOW TO COMPILE

cmake . && make

HOW TO USE

bin/pdf2htmlEX /path/to/sample.pdf

LICENSE

GPLv3

We would like to acknowledge the following projects that have been consulted while writing this program:

  • pdftops & pdftohtml from poppler
  • PDF.js
  • Crocodoc
  • Google Doc

AUTHORS