Who shouldn't buy FineReader ProĪs excellent as FineReader Pro is, it is not the right OCR tool for everyone. (Or vice versa.) Seriously, if you really want an HTML copy of your document, export it as text, convert that to Markdown, and generate your HTML from that. There are some options here for "fixed" vs "flexible" layout, and there's even an exciting-until-you-see-what-it-does option for CSS, but the end result is fairly ugly and convoluted HTML that will make tidy laugh and validators weep. To those who are hoping to take complex print magazine formatting and reproduce it on the web, I have two messages: a) please don't and b) no, really, don't, but if you do, don't rely on FineReader for it. ( Markdown fans: take special note of the option to use a blank line to separate paragraphs). The simplest way to use the text from your scan is to Export as TXT. (What? No option to export a multi-page document as an animated GIF? Apparently the FineReader developers aren't familiar with Tumblr!) Export as Text or HTML I'm not sure who uses an OCR app to export the finished document as an image, but if you're one of those people, you can choose from JPEG, JPEG–2000, TIFF, PNG, BMP, JBIG2, PCX, and DCX. Those hoping for MOBI support should probably consider exporting to ePub and then either using Send To Kindle for Mac or some other solution. You can also use the first page as the book cover image and preserve/embed fonts (the latter option is for ePub only, not FB2). Options for ebooks include setting metadata such as the title, author, keywords, and an annotation (I do wish exporting to PDF had similar options). You can export to ePub (as you'd expect) and FB2, which is a format that I had never even heard of, but is apparently for something called FictionBook. Export as EbookįineReader also supports two ebook formats, but probably not the two you would expect. FineReader includes an option to ignore all text outside tables when exporting to CSV. If your scan is tabular data, you might also consider exporting as CSV. However, now that I have seen it and thought about it some more, it does now seem something like that could come in extremely handy for people who do need to process scans of documents which were originally produced in spreadsheet or presentation apps. I was surprised and somewhat perplexed to see options for Excel (.xlsx) and PowerPoint (.pptx) files, but that was mostly due to the fact that I rarely use either application. (All of those options are also available for RTF and ODT/OpenOffice.) You can also export to Word (.docx), including four layout options (exact copy, editable copy, formatted text, plain text), plus options to retain page numbers, headers and footers keep line breaks and hyphenation keep page breaks keep pictures keep text and background colors high line uncertain characters and keep line numbers. Export as Word (or RTF or ODT), Excel, CSV, or PowerPointĮxporting as PDF is only the beginning. This is especially useful if you want to edit the resulting PDF to correct any OCR mistakes, which will still happen, regardless of which app you choose.īy subscribing, you are agreeing to Engadget's Terms and Privacy Policy. However, the option for "Text over the page image" will allow you to keep the formatting close to the original, but edit the results, if needed, and see it on the screen. "Text under the page image" is what most people usually expect and want from an OCR app: the OCR'd document looks the same on the screen, but you can copy/paste from it into any other application. Its automatic analysis was generally good, but when I took the time to use its more advanced features, it rewarded me with output that was as near-perfect as anyone can expect from an OCR application.įineReader then gives you an unparalleled assortment of export options, including four different options for PDFs alone: Time and time again, FineReader came through. Some of them are pretty good quality, but a few of them have image hovers best described as "a hasty Xerox made on a Friday afternoon before Spring Break by a work-study student who was far more interested in literally anything else." Crooked, dark, speckled, you name it. I have thousands of journal articles saved as PDFs. If the most important feature of an OCR app is how well it does at recognizing text from a PDF or image file, then FineReader Pro is, by far, the best OCR app that I have ever used. If you want fine-grained control over OCR and unmatched export options to a plethora of formats, ABBYY FineReader Pro for Mac is definitely worth a close look, but the current version has some significant caveats which you should consider before spending US$100 on it.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |