pdftohtml
=========

pdftohtml was developed by Gueorgui Ovtcharov and Rainer Dorsch. It is
based and benefits a lot from Derek Noonburg's xpdf package.

version 0.20
Jan 27th, 2000

Email: pdftohtml@ralf.informatik.uni-stuttgart.de
WWW:   http://www.ra.informatik.uni-stuttgart.de/~gosho/pdftohtml/

The xpdf software and documentation are
copyright 1996-1999 Derek B. Noonburg.

The PDF data structures, operators, and specification are
copyright 1995 Adobe Systems Inc.


What is pdftohtml.bin?
----------------------

pdftohtml.bin converts files from the Portable Document Format (PDF) files
into the HTML file format.


Images
------
pdftohtml can extract all images from a PDF file and save them as JPEG
or as PNG files. Currently pdf vector drawings are not extracted.

What is pdftohtml?
-----------------

Pdftohtml is a shell script that executes pdftohtml.bin and converts all
PBM/PPM images into PNG image format using "pnmtopng".


How to use pdftohtml.bin?
-------------------------

If HTML-file is not specified pdftohtml.bin converts file.pdf to file.html
and extracts all images.  If the -c option is not specified, only JPEG
files are saved. The conversion of the PBM/PPM images to something
which a browser can display (e.g. PNG) is left to the user. If the
-ext <abc> option is specified all PBM/PPM are referenced within the
HTML file as file with extension .<abc> (default is .png). The names
of all PBM/PPM files are saved in "images.log". The user can convert
them to GIF,JPEG or PNG format.


How to compile pdftohtml.bin ?
------------------------------

To compile pdftohtml.bin, type : 
   
    make

Pdftohtml is tested on Linux and Solaris 2.6 .


Running pdftohtml
-----------------

To run pdftohtml type : 
    
    pdftohtml file.pdf


Distribution
------------

Pdftohtml is licensed under the GNU General Public License (GPL), version
2. See COPYING. 


Getting pdftohtml
----------------- 

The latest version of pdftohtml is available from:

  http://www.ra.informatik.uni-stuttgart.de/~gosho/pdftohtml/


Mail Aliases:
-------------

There are two mail aliases:

For general discussions, bug reports, ideas, patches,...:
pdftohtml@ralf.informatik.uni-stuttgart.de

And an announcement alias, where new versions are announced:
pdftohtml-announce@ralf.informatik.uni-stuttgart.de

If you want to be added to one of them, please send mail to
rainer.dorsch@informatik.uni-stuttgart.de.

