Wednesday, November 2, 2011
2:02 AM

Extracting Images from PDF files - Pdfimages

Pdfimages is an open source command-line utility for extracting images from PDF files. It is freely available as part of poppler-utils and xpdf-utils, and included by default with many Linux distributions.

Pdfimages saves images from a Portable Document Format  (PDF)  file  as Portable Pixmap (PPM), Portable Bitmap (PBM), or JPEG files. Pdfimages  reads  the  PDF file, scans one or more pages, PDF-file, and writes one PPM, PBM, or JPEG file for each  image,  image-root-nnn.xxx, where  nnn  is  the image number and xxx is the image type (.ppm, .pbm,.jpg).

NB: pdfimages extracts the raw image data from the  PDF  file,  without performing  any  additional  transforms.  Any rotation, clipping, color inversion, etc. done by the PDF content stream is ignored.

Continue Reading...

0 comments:

Post a Comment