ps2ascii - Man Page

Ghostscript translator from PostScript or PDF to text

Synopsis

ps2ascii [ input.ps [ output.txt ] ]
ps2ascii input.pdf [ output.txt ]

Description

ps2ascii uses gs(1) to extract text from PostScript(tm) or Adobe Portable Document Format (PDF) files. If no files are specified on the command line, gs reads from standard input.  If no output file is specified, the ASCII text is written to standard output.

The old ps2ascii.ps program was deprecated and removed some years ago, the scripts now use the txtwrite device to extract text from the input. This does a generally better job than the old PostScript program and can extract Unicode not just ASCII. However it no longer supports the COMPLEX feature.

See Also

Further documentation on the txtwrite device can be found at https://ghostscript.readthedocs.io/en/latest/Devices.html#text-output

Version

This document was last revised for Ghostscript version 10.04.0.

Author

Artifex Software, Inc. are the primary maintainers of Ghostscript. David M. Jones <dmjones@theory.lcs.mit.edu> made substantial improvements to ps2ascii.

Referenced By

ps2ps(1).

18 Sept 2024 10.04.0 Ghostscript Tools