Introducing pdf2htmlEX: converts PDF to HTML without losing format

view story

http://forums.fedoraforum.org – Demo comes first: coolwanglu.github.com/pdf2htmlEX/demo/demo.html (Sorry I cannot create links as it kept getting rejected by the forum) Another (with CJK): coolwanglu.github.com/pdf2htmlEX/demo/chn.html Home page: github.com/coolwanglu/pdf2htmlEX There are bascially 2 types of pdf-to-html converters: One is roughly a pdf-to-text converter with a few pre-defined formats in HTML. The other is render-everything-as-images converter, which loses all text and generated huge files. But pdf2htmlEX takes advatanges of both, retaining both Text and Styling. Features: 1.Extract and e (HowTos)