pdfinfo - Man Page

Portable Document Format (PDF) document information extractor (version 3.03)

Examples (TL;DR)

Synopsis

pdfinfo [options] [PDF-file]

Description

Pdfinfo prints the contents of the ´Info' dictionary (plus some other useful information) from a Portable Document Format (PDF) file.

If PDF-file is ´-', it reads the PDF file from stdin.

The ´Info' dictionary contains the following values:

title
subject
keywords
author
creator
producer
creation date
modification date

In addition, the following information is printed:

custom metadata (yes/no)
metadata stream (yes/no)
tagged (yes/no)
userproperties (yes/no)
suspects (yes/no)
form (AcroForm / XFA / none)
javascript (yes/no)
page count
encrypted flag (yes/no)
print and copy permissions (if encrypted)
page size
file size
linearized (yes/no)
PDF version
metadata (only if requested)

The options -listenc, -meta, -js, -struct, and -struct-text only print the requested information. The 'Info' dictionary and related data listed above is not printed. At most one of these five options may be used.

Options

-f number

Specifies the first page to examine.  If multiple pages are requested using the "-f" and "-l" options, the size of each requested page (and, optionally, the bounding boxes for each requested page) are printed. Otherwise, only page one is examined.

-l number

Specifies the last page to examine.

-box

Prints the page box bounding boxes: MediaBox, CropBox, BleedBox, TrimBox, and ArtBox.

-meta

Prints document-level metadata.  (This is the "Metadata" stream from the PDF file's Catalog object.)

-custom

Prints custom and standard metadata.

-js

Prints all JavaScript in the PDF.

-struct

Prints the logical document structure of a Tagged-PDF file.

-struct-text

Print the textual content along with the document structure of a Tagged-PDF file.  Note that extracting text this way might be slow for big PDF files. (Implies -struct.)

-url

Print all URLs in the PDF. Only the URL types supported by Poppler are listed. Currently, this is limited to Annotations. Note: only URLs referenced by the PDF objects such as Link Annotations are listed. pdfinfo does not attempt to extract strings matching http://... from the text content.

-isodates

Prints dates in ISO-8601 format (including the time zone).

-rawdates

Prints the raw (undecoded) date strings, directly from the PDF file.

-dests

Print a list of all named destinations. If a page range is specified using "-f" and "-l", only destinations in the page range are listed.

-enc encoding-name

Sets the encoding to use for text output. This defaults to "UTF-8".

-listenc

Lits the available encodings

-opw password

Specify the owner password for the PDF file.  Providing this will bypass all security restrictions.

-upw password

Specify the user password for the PDF file.

-v

Print copyright and version information.

-h

Print usage information. (-help and --help are equivalent.)

Exit Codes

The Xpdf tools use the following exit codes:

0

No error.

1

Error opening a PDF file.

2

Error opening an output file.

3

Error related to PDF permissions.

99

Other error.

Author

The pdfinfo software and documentation are copyright 1996-2011 Glyph & Cog, LLC.

See Also

pdfdetach(1), pdffonts(1), pdfimages(1), pdftocairo(1), pdftohtml(1), pdftoppm(1), pdftops(1), pdftotext(1) pdfseparate(1), pdfsig(1), pdfunite(1)

Referenced By

gdcmpdf(1), pdfattach(1), pdfdetach(1), pdffonts(1), pdfimages(1), pdfseparate(1), pdfsig(1), pdftocairo(1), pdftohtml(1), pdftopng(1), pdftoppm(1), pdftops(1), pdftotext(1), pdfunite(1), xpdf(1), xpdfrc(5).

15 August 2011