How can I compare PDF documents quickly and reliably?
You have two versions of a document and now you want to know: What are the differences, what has remained the same?
Are you tired of the tedious comparison by eyeballing?
Or do you already use a comparison tool, but are not satisfied with the results?
Then you are exactly right here. Read on why our software PDiff really helps you:
Why compare documents with PDiff (and not by eyeballing)?
Because you have better things to do. Not to mention the mistakes that can happen to the best reader every now and then.
Error-free and tireless, so you’ll never miss something again.
PDiff does the proofreading with care and with constant attention, even for completely changed layouts.
So you won’t overlook even the slightest text deviation: You won’t miss any legally or technically relevant difference. For sure.
Lightning fast and objective, so that the results are always the same no matter how much time is available or who the reader is.
Even for long documents, you have the results within a few seconds.
Differences are clearly and precisely documented on screen and in the proof report. The results are reproducible and objective. And can be used for internal and external communication.
Efficient, so you have more time to do what you really want.
With PDiff you see the differences at a glance. You can focus on what matters most: examining and reviewing your texts.
Why compare documents with PDiff (and not with other tools)?
Because you need a solution that you can fully rely on. Which will not let you down with complicated documents. And which allows you to shine with your results thanks to tailor-made report formats.
Synchronized display of PDF & Text provides clear insights into the comparison process, so you can fully rely on the results.
With some comparison tools, you only see the text content and completely lose sight of your original PDF documents. With other tools, you only see the PDFs, but you never know exactly which text content was really compared behind the scenes.
Not so with PDiff. Here you see everything synchronized in an ingenious combination: the two PDFs side by side and the recognized text underneath.
Thanks to this display, with PDiff you know for sure what’s actually going on with the comparison and how to find the differences in your PDF documents.
PDiff uses the Adobe® PDF Library™ for standard-compliant PDF processing without nasty surprises.
PDF is a global standard for reliable electronic documents and ISO standard for archiving electronic documents. The Adobe® PDF Library™ ensures a platform-independent, accurate display. And guarantees a reliable and consistent text analysis.
The powerful comparison engine will not let you down with complicated documents.
The most common reason that users are frustrated by comparison tools and switch back to manual proofreading: The display of too many irrelevant, unaccountable differences. And just when the software is needed the most – with more complicated documents – the benefits go to zero.
To make sure that does not happen to you with PDiff, we’ve added 6 power functions to the software that let you filter out unimportant differences in a surprisingly easy way. So, even with complicated inputs, you can rely on PDiff to do a lot of the work for you.
With know-how and horsepower to the finish, so you are quickly free again for creative tasks.
Comparing documents is probably not one of your core tasks. Your exciting, creative, interesting work takes place before or after. It is therefore all the more important that PDiff brings you quickly to your destination.
Thanks to the intuitive and field-proven interface of PDiff, you can immediately start comparing your files. Without training or reading thick manuals.
And also under the hood, everything is geared for speed: 64-bit technology and parallel processing on multi-core CPUs compare even 100-page documents in 17 seconds2.
Another speed factor: The calculations take place completely locally. Your confidential data remains on your computer and you benefit directly from the computing power of your workstation.
Flexible output formats let you deliver brilliant results.
With PDiff you can generate your comparison results not only for yourself, but also for
- internal documentation (colleagues or supervisors)
- external auditors, such as regulators
- other software systems for further processing
And if you give something out of hand, then it is not enough that the reports look only acceptable. They just have to be perfect. The format and content must be correct.
That’s why PDiff offers you 7 types of output formats. The content and look of reports can be configured to fit your needs.
So you can produce exactly the results you need to optimally present yourself and your work.
How to compare PDF files with PDiff.
Drag two PDFs into the PDiff window and you will see all text differences in the unique synchronous display. Proofreading becomes seeing instead of searching. The screencast shows you how fast you compare your PDF files with PDiff.
PDiff finds the differences.
PDiff shows clearly and precisely all text differences: Insertions, deletions, replacements, and rearranged text – even with different document layouts. That is, even if fonts, hyphenation, column breaks or page breaks were changed.
Proof reports as PDF or XML.
Need proof of matches or the exact changes made?
With PDiff, you can create 7 types of proof reports which document the results however you like:
- Tabular PDF report (for regulators like the EMA)
- Annotated PDF A or B
- Side-by-side PDF A + B
- PDiff projects
- Return code for automation via CLI
The content and appearance of the output formats are highly configurable so that they can be optimally adapted to your requirements. The reports document comprehensibly and unambiguously changes and/or similarities with position, text and optional user comments.
Compare Word files (DOC/DOCX) with PDF.
PDiff provides an import mechanism for all common document formats: DOC/DOCX, RTF, TXT, XLS, etc.
Formats other than PDF are converted automatically by calling their native applications. So you can also compare Word files with PDF.
6 Power functions to filter out unimportant differences.
PDiff offers you the following power functions, which allow you to achieve clear comparison results even with complicated documents:
Exclusion areas to hide irrelevant differences in headers and footers
Page ranges to exclude single or multiple pages such as title pages, table of contents, or appendices from the comparison
Text flow tool to synchronize different reading directions in complex, nonlinear layouts
Table tool to match the text flow of data in tables
Replace function to systematically exchange different characters or words in both documents, e.g. for lists with changed bullets (“-“ vs. “•”)
Comments and OK checkmarks to explain individual differences or to manually hide irrelevant differences via a check-off function
Checking foreign-language texts.
PDiff comes with full Unicode support. Thus, you can also easily compare texts in non-Roman writing systems: e.g. Chinese/Japanese /Korean (CJK), Arabic, Hebrew.
To highlight the differences between two documents, with PDiff you do not even have to speak the language yourself.
To review large volumes of documents, you can automate PDiff by batch processing or command line interface: see the Automation section for details.
Synchronized display of PDF & Text.
Here you can try out the synchronized display of PDF & Text yourself: Move your mouse over the words in the interactive screenshot and watch how the synchronous cursor runs through both documents at the same time.
The main window of PDiff consists of a fully synchronized display of both documents:
top: PDF view with a side-by-side display of document A and document B
bottom: Text view with a synoptic comparison of the extracted texts from document A and document B, i.e. corresponding text passages each at the same height
Differences found are clearly highlighted in both window halves by colored markings.
This synchronized display gives you a unique insight into the comparison process: You see …
… where text was found and where not.
… whether special characters, accent marks and symbols are displayed correctly.
… whether spaces and hyphenations were recognized correctly.
… in which order the words were read and compared.
… whether differences are possibly only caused by incorrectly recognized letters of an OCR.
… on which characters style attributes such as bold, italic, underline, and strikethrough were recognized.
… whether numerical values and chemical/mathematical formulas were read correctly (sign, decimal point, order of digits, subscripts and superscripts).
Just right for your documents.
PDiff is an ideal solution to compare versions of your documents:
- important office documents
- mission critical technical manuals
- legal texts and business contracts
- financial reports
- manuscripts and thesis texts
- scientific articles, reports, and proceedings
- fictional books and non-fiction books
- copywriting to ready-to-print layout
Comparing apples and oranges? No problem.
With PDiff you can also compare documents where the layout differs widely: for example, check the print-ready layout against the copy text.
Special functions for packaging.
PDiff Professional also supports the comparison of rearranged texts. So you can check even complicated layouts of packaging against the original text. For example you can compare copy text with the text of a press-ready folding box that does not even have a defined text flow direction.
Did you know…
…that the English idiom to compare apples and oranges, as depicted in the PDiff icon, translates differently to other languages. Another common idiom - e.g. in French, German, Spanish, and Italian - uses the expression apples and pears to describe the comparison between dissimilar things.
Automation for high data volumes.
If you need to examine large document collections: PDiff Professional can be completely automated, either within the GUI or even without GUI for integration with other software systems.
Batch processing in the GUI
A few mouse clicks, and the automation does the work for entire directories. Simply create PDF reports over all document pairs.
Automation via CLI interface
Do you plan to integrate PDiff into your workflow or call it from other software systems? Through the CLI interface, PDiff may be operated as a command line program. Hence, you can integrate the powerful PDF comparison seamlessly in every possible automation solution.
1 The average human reading speed when proofreading texts is 200 words per minute.
2 Speed measurement of PDiff: Measured for a 100-page sample document with 59037 words and 590 word differences (1%) on a MacBook Pro, 2,9 GHz Intel Core i7, 16GB RAM by calling PDiff in CLI mode. The computing time including report generation (annotated PDF A + B) is 17s.
3 According to a comparative study by Ray Panko, University of Hawaii on proofreading error rates, the human recognition rate of word errors is about 75%.