Images and Text in PDF files
PDF files can contain any type of file, just as an email can have attachments. File Juicer can extract most of the bitmap images.
File Juicer can also extract PDF which are embedded in other files.
JPEGCompressed images are stored in PDF as embedded JPEG files which can be extracted.
EPS and PostScriptAre converted to PDF the same way Preview does before they are "Juiced".
Losslessly compressed imagesFile Juicer extract them as PDF to preserve the ICC color information with the file.
Vector GraphicsIs an integrated part of the PDF files, and there is no boundary between text and graphics. To extract vector graphics from a PDF, you can use Tiger & Preview which let you place a crop and copy the contents to a new PDF.
One exception is if an EPS file is embedded in a Word document and then printed to PDF. File Juicer can extract the EPS from this PDF.
Images Mirrored, Inverted or Cut Into Strips
File Juicer extracts images, exactly as they are, no changes, from the files you search in. PDF files are made by many applications, and some of those cut images into stripes, inverts, rotates or mirrors them, scales them or covers some of them up. What ends in the PDF is decided by the application which creates the PDF.
You can deal with such PDF files by "rendering" them to a pixel based file format with Preview.
- Select the image you wish to save
- Copy it
- Pick New from the File menu
- Save as TIFF, JPG or PNG as you desire
File Juicer can extract the text both as plain text and as RTF. In the plain text case, you enable the "ascii" checkbox in the preferences. Note that the encoding of the extracted text is UTF-8 which preserve the "exotic" characters which can be found in PDF files. TextEdit understand UTF-8 if you chose it in the preferences.
RTF can also be a good format if you wish to convert simple PDF files to Word.
If you have a scanned document, File Juicer can extract the images from it, but it does not convert the images to text. You need an Optical Character Recognition application.