pstotext – Extract ASCII from PostScript and PDF
A Unix program that extracts ASCII text from PostScript and PDF (Acrobat) files. pstotext uses Ghostscript, but does a more careful job with kerned characters and nonstandard font encodings than Ghostscript's ps2ascii utility.
Pstotext is no longer held on CTAN; documentation and downloads are available from its home page.
Free license not otherwise listed, or more than one free license applies
convert one format of file to another
Maybe you are interested in the following packages as well.
- catdoc: Text extractor for word files
- transfig: Transform xfig pictures into many other formats
- latex2html: Convert LaTeX into HTML documents
- tex2page: Produce HTML from TeX/LaTeX