![]() Side-by-side comparison makes it very easy to review both original and modified documents as they are clearly shown in one workspace and stay synchronised as you scroll through them. Remember that OCR is imperfect so comparisons of scanned documents will need more review time. Most of your changes will be accurate but there is a link to an explanation if the results are inconsistent. You can review your changes in the usual way – hover over a change to learn more about it. This alerts you to the fact that OCR has been performed prior to the comparison. When Workshare compares a scanned PDF, you are notified across the top of your comparison. How side-by-side comparison helps you deal with scanned PDFs In a regular PDF, you can select and copy text. You cannot select text in a scanned PDF, you can only select an area of image. One way of knowing whether your PDF is a scanned, image-based PDF is to try and select some text. How to distinguish scanned PDF from a regular PDF? Clicking the change will show that 450 has been deleted and £50 added. The comparison shows a change, when you can see there is none. ![]() The OCR process converts the scanned PDF and mistakenly converts the handwritten £50 to 450.00. Imagine the original document was a scanned rental agreement where the rent had been filled in by hand as £50.00 and the modified document was a regular PDF with the rent as £50. The comparison may indicate that text has been changed, while you can see that the text has not been changed. For example, when the scanned PDF is a document that has been photocopied multiple times or includes hand-written notes. While the conversion attempts to be as accurate as possible, some content may be converted incorrectly. Consequently, the comparison results may not match what you can see in the original and modified documents. You cannot see the converted original PDF. Scanned PDF OCR: When you scan a paper using an electronic scanning device, the whole content will be captured as an image. Workshare converts the PDF to a text-based PDF and then runs the comparison using this converted original PDF. Shown above, a scanned PDF is selected as the original document. This means, that the document Workshare actually compares may not be exactly the same as the document you selected. Workshare automatically runs OCR when you select to compare a scanned PDF and uses the converted version of the document for the comparison. This conversion process - OCR - is an imperfect process. To run a comparison on a scanned PDF, the images must first be converted into editable text. A scanned PDF contains images of content there’s no actual text content but only images embedded into the PDF file. If this happens, it is because Optical Character Recognition (OCR) has been performed on your PDF.Ī regular PDF contains text that can be selected, copied and edited. You may find that when you are comparing a scanned PDF, some of the changes identified by the comparison appear illogical or are unexpected.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |