1
0
mirror of https://github.com/pdf2htmlEX/pdf2htmlEX.git synced 2024-07-04 17:18:40 +00:00
Go to file
2012-08-08 16:02:30 +08:00
bin generate script for fontforge to extract the font files 2012-08-08 16:01:09 +08:00
src generate script for fontforge to extract the font files 2012-08-08 16:01:09 +08:00
CMakeLists.txt moving to drawString 2012-08-07 09:59:10 +08:00
LICENSE initial import 2012-08-05 02:03:53 +08:00
README.md Merge branch 'devv' 2012-08-08 16:02:30 +08:00

pdf2htmlEX

View Demo

Introduction

pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies, aims to provide an accuracy rendering, while keeping optimized for Web display.

It is optimized for recent versions of modern web browsers such as Mozilla Firefox & Google Chrome.

This program is designed for scientific papers with complicate formulas and figures, so a precise rendering is also the #1 concern.

Features

  • Precise rendering
  • Text Selection
  • Font embedding (Need my modified fontforge to work)
  • Proper styling (Color, Transformation...)
  • Optimization for Web

Not supported yet

  • Non-text object (Don't worry, they will be rendered as images)
  • Blend Mode
  • CJK
  • ...

Dependency

  • libpoppler with xpdf header >= 0.20.2
  • boost c++ library (format, program options)
  • fontforge (we will compile with libfontforge later)

HOW TO COMPILE

cmake . && make

HOW TO USE

bin/pdf2htmlEX /path/to/sample.pdf

LICENSE

GPLv3

We would like to acknowledge the following projects that have been consulted while writing this program:

  • pdftops & pdftohtml from poppler
  • PDF.js
  • Crocodoc
  • Google Doc

AUTHORS