ScanSoft OMNIPAGE PRO X FOR MACINTOSH Manual

For macintosh
Table of Contents

Advertisement

Quick Links

Advertisement

Table of Contents
loading

Summary of Contents for ScanSoft OMNIPAGE PRO X FOR MACINTOSH

  • Page 2 E G A L O T I C E S ©2001 by ScanSoft, Inc. All rights reserved. No part of this publication may be transmitted, transcribed, reproduced, stored in any retrieval system or translated into any language or computer language in any form or by any means, mechanical, electronic, magnetic, optical, chemical, manual, or otherwise, without prior written consent from the Legal Department at ScanSoft, Inc., 9 Centennial Drive, Peabody, Massachusetts 01960.
  • Page 3: Table Of Contents

    O N T E N T S W e lc o me Chapter outline Using this Guide How to use online Help Other online resources New features in OmniPage Pro X I n st a l la t io n a n d se t u p System requirements Installing the software Running the program under Mac OS 9...
  • Page 4 P r o c es s i n g d o c u m e n ts Basic processing steps Automatic processing To prepare for automatic processing To process a new document automatically To process an existing document automatically Manual processing Steps for manual processing Using automatic and manual processing together Using the OCR Assistant...
  • Page 5 Listening to a document Closing a document Quitting OmniPage Pro Exporting documents Saving an OmniPage Document Saving images Saving recognition results Saving to Portable Document Format (PDF) Copying a document to the Clipboard Using drag-and-drop functionality Direct OCR Using Direct OCR S e tt in g s OCR Toolbar options Get Page options...
  • Page 6 Te c h n i c a l i n fo r m a t io n Troubleshooting Solutions to try first Low memory situations Low disk space situations Improving accuracy Improving fax recognition Interface problems and solutions System failure during OCR Supported languages Supported saving formats Supported image file formats...
  • Page 7: Welcome

    Welcome Welcome to OmniPage Pro X ™, and thank you for buying our software! This User’s Guide has been provided to help you get started and give you an overview of the program. Chapter outline Chapter 1, Installation and setup, tells you how to install and start the program and select a scanner.
  • Page 8: Using This Guide

    Using this Guide ® This Guide supposes that you know how to work in the Macintosh environment. Please refer to your Macintosh help resources if you have questions about how to use dialog boxes, menus, scroll bars, and so on. The following conventions are used in this Guide. Convention Purpose Italicized text...
  • Page 9: Other Online Resources

    To get help on buttons and pop-up menus Brief help is available without opening the online Help system. Hover the cursor over any button or pop-up list in the OCR Toolbar or the palettes. A concise description of the control appears in the status line along the base of the OCR Toolbar.
  • Page 10: New Features In Omnipage Pro X

    New features in OmniPage Pro X ® The family of OmniPage products is now augmented by OmniPage Pro X for Macintosh. Here we summarize its most important new features compared to OmniPage Pro 8 for Macintosh. A better recognition engine has been integrated, capable of delivering greater accuracy, particularly on degraded documents.
  • Page 11: Installation And Setup

    Chapter 1 Installation and setup This chapter provides information on installing OmniPage Pro X and selecting a scanner to use with it. Please consult the Readme file which provides the most up-to-date information on installing and running the program. Readme is supplied in plain text and PDF formats.
  • Page 12: System Requirements

    System requirements The minimum system requirements for OmniPage Pro X are: iMac, iBook, PowerBook, Power Macintosh or PowerPC compatible computers with at least a G3 processor Mac OS 9.0 or later, Mac OS X (10.1 or above) and QuickTime 4.1 or later (this is normally included in OS X) 128 MB of memory (RAM) on Mac OS X;...
  • Page 13: Running The Program Under Mac Os 9

    Chapter 1 Personalize your copy in the dialog box that appears. Type in your name, the name of your company and the serial number. You will find the serial number on the CD case. Click OK. Click Install in the next dialog box to proceed. A further dialog box lets you choose where the OmniPage Pro files will be installed.
  • Page 14: Starting Omnipage Pro

    Starting OmniPage Pro ® There are several ways of starting OmniPage Pro Open the OmniPage Pro X folder and double-click the OmniPage Pro X icon. The program launches and the OCR Toolbar will be displayed. For quicker access, place an alias program icon on your Desktop. Drag and drop one or more image files onto the OmniPage Pro X icon.
  • Page 15 Chapter 1 general scanner driver types supported by the program. You can select either a Photoshop plug-in or a TWAIN driver depending on your scanner. For specific scanner types which work with a TWAIN driver, you can choose whether to use their own interface or use OmniPage Pro’s interface.
  • Page 16 To select a scanner manually: Follow instructions 1-3 listed above. Select a scanner manufacturer under Manufacturer in the Select Scanner dialog box. Select a scanner model under Scanner. Check the driver name under Driver. If you have more than one driver, select the one you want to use.
  • Page 17 Chapter 1 Decide which user interface you want to use for your scanner: the driver’s own interface or OmniPage Pro’s interface. See the overview table in the online Help topic Selecting a scanner which summarizes the user interface functioning for different scanner drivers.
  • Page 18: Registering Omnipage Pro

    To scan in the Classic Environment: • Select Scan in Classic Mode in the Select Scanner dialog box if it is not already selected. Please wait while the program compiles a scanner list. This option enables you to scan pages even if your scanner has a driver for Mac OS 9 only.
  • Page 19: Introduction

    Chapter 2 Introduction You probably do business correspondence and other written projects on your computer. However, certain sources of information may not be immediately available for use. For example, if you want to incorporate part of a magazine article into a document in your word processor, you somehow have to get its text into your computer.
  • Page 20: What Is Optical Character Recognition

    What is Optical Character Recognition? Optical character recognition (OCR) is the process of extracting text from images. Images can result from scanning paper documents or opening image files. Images do not have editable text characters; they have many tiny dots (pixels) that together form character shapes. These present a picture of the text on a page.
  • Page 21: Basic Steps In The Ocr Process

    Chapter 2 Basic steps in the OCR process There are three main steps in OmniPage Pro’s OCR process. They correspond to three large numbered buttons in the OCR Toolbar. Documents can be processed automatically or manually. In automatic processing, the Start button takes all specified document pages through the whole process (1-2-3) without a stop.
  • Page 22: The Ocr Toolbar

    The OCR Toolbar The OCR Toolbar appears when you first start the program. It is the control center for all document processing. The OCR Toolbar can be minimized under Mac OS 9. Start button: Use this to Get Page Primary language Original Layout OCR button Export button...
  • Page 23: The Full Omnipage Pro Interface

    Chapter 2 The full OmniPage Pro interface The full OmniPage Pro X interface appears when you start a document. The main screen areas of the interface are: The OCR Toolbar The Document window (with Image view and Text view) The Thumbnail window The Zone Info and Tools palettes The Preferences dialog box OCR Toolbar...
  • Page 24: The Document Window

    The Document window The Document window allows you to view and work with pages in the current document. You can drag this window to different locations. Original page images are displayed in Image view and recognition results are displayed in Text view. A highlight-colored border denotes which view is active.
  • Page 25: The Zone Info And Tools Palettes

    Chapter 2 See Working with documents on page 55 for more information on using thumbnails for page operations. The Zone Info and Tools palettes The Zone Info and Tools palettes are displayed whenever Image view is active. You can drag them to different locations. Under Mac OS 9, they can be minimized and restored.
  • Page 26: The Preferences Dialog Box

    The Preferences dialog box This dialog box is the central location for all OmniPage Pro settings not accessible through the OCR Toolbar. To open it, choose Preferences... in the Application menu (Mac OS 9: Edit menu). The Preferences dialog box has four sections: Scanner, OCR, Spelling and Miscellaneous.
  • Page 27: Omnipage Pro X User's Guide

    Chapter 3 Processing documents This chapter describes how to process documents in OmniPage Pro from start to finish. It tells you how the basic steps of OCR are linked during automatic and manual processing. It explains how you can exploit the advantages of each type of processing within a single document.
  • Page 28: Basic Processing Steps

    Basic processing steps The following diagram summarizes how the basic steps are linked, and directs you to a page in this Guide. This workflow is broadly valid for both automatic and manual processing. The steps performed by the three basic OCR Toolbar buttons have a darker border. Describe Create zones: page...
  • Page 29: To Prepare For Automatic Processing

    Chapter 3 To prepare for automatic processing 1. Select the source for one or more page images. Choose Load image to open one or more page images from file. Choose Scan in B&W to scan in black-and-white. Choose Scan in Gray to scan in grayscale. Choose Scan in Color to scan in color (with a color scanner).
  • Page 30: To Process A New Document Automatically

    To process a new document automatically We assume you have started OmniPage Pro X and can see the OCR Toolbar, but you have no document open and all settings are ready. 1. Click the Start button to launch automatic processing. 2.
  • Page 31: To Process An Existing Document Automatically

    Chapter 3 To process an existing document automatically You can also click Start to perform automatic processing when you have a document open. It does not matter whether its pages were processed automatically or manually. To scan new pages into the document, place them in the scanner correctly.
  • Page 32: Manual Processing

    Manual processing You can use manual processing when you want greater control over the OCR process. Processing proceeds step-by-step. This allows you to view and manually zone images before you send them for recognition. It also lets you modify settings between each processing step or from page to page.
  • Page 33: Using Automatic And Manual Processing Together

    Chapter 3 will be auto-zoned. You will see a progress indicator as the current page is recognized. After OCR, recognition results appear in Text view. If you requested proofing and there are suspect words on the page, proofing begins immediately. If you did not request proofing, you can view, edit and verify the recognized text or start proofing from any point in the text.
  • Page 34: Using The Ocr Assistant

    5. Specify a choice in the Zoning Instructions dialog box. 6. Repeat steps 4 and 5 until all pages are adequately recognized. 7. Export the finished document as required. To start manually and finish automatically: 1. Prepare settings and acquire all the images for the document by clicking the Get Page button.
  • Page 35 Chapter 3 OCR: A training file and options for saving graphics. ® Spelling: A user dictionary and Language Analyst options. Miscellaneous: Retain or drop table grids. Click the OCR Assistant button to start moving through the six steps: Step 1, Acquiring images: Choose one of the scanning modes (black- and-white, grayscale or color) or to load image files.
  • Page 36: Bringing Page Images Into Omnipage Pro

    Bringing page images into OmniPage Pro This section describes the different methods for acquiring images: Scanning pages Loading image files Opening OmniPage Documents Using drag-and-drop Scanning pages You can scan a paper document to generate an electronic image. See Starting OmniPage Pro and Selecting your scanner in chapter 1. To scan pages into OmniPage Pro: 1.
  • Page 37 Chapter 3 To load a single page image file: 1. Select Load Image as the option in the Get Page Pop-up menu. 2. Click the Get Page button. The Load Images dialog box appears. It is a standard Macintosh dialog box. 3.
  • Page 38: Opening Omnipage Documents

    Opening OmniPage Documents You can open an OmniPage Document using the Open command in the File menu. An OmniPage Document (OPD) is a file in OmniPage Pro’s proprietary format. OPDs contain original page images, zones, settings and recognition results (if any). Each piece of recognized text remains linked to the image it came from, so text can still be proofed and verified when the OPD is reopened.
  • Page 39: Creating And Modifying Zones

    Chapter 3 If you drag and then drop the image icon on Image view, the page or pages are appended to the end of the document. If you drop the image icon on Thumbnail view, you can choose where to have the page(s) placed. As you drag the icon over the pages, a black bar appears between two pages.
  • Page 40: Creating Zones Automatically

    This section presents the following topics: Creating zones automatically Specifying zone types Drawing zones manually Modifying zones Creating zones automatically OmniPage Pro can create zones automatically for you. To do so, it uses the selected page layout description to find blocks of text and graphics on the page, place these in zones and decide a reading order.
  • Page 41: Specifying Zone Types

    Chapter 3 • Use Only Current Zones (auto-zoning will not run) • Discard Current Zones and Find New Zones • Keep Current Zones and Find Additional Zones. Specifying zone types All zones are identified as a particular type. This determines the way they are treated during OCR.
  • Page 42 Single Column Text zone type OmniPage Pro treats all contents as one block of text; it does not look for columns or detect graphics. Tabs are inserted between any side-by- side columns detected within a zone, so this zone type can be used for tables or texts in columns you do not want decolumnized or placed in a table grid.
  • Page 43 Chapter 3 reversed in your output document, do this in your target application. These zones have black or white borders, depending on the background color. Ignore zone type OmniPage Pro ignores the zone entirely during auto-zoning. This is useful if you want OmniPage Pro to draw zones automatically but first want to identify areas to be ignored.
  • Page 44: Drawing Zones Manually

    2. Select Alphanumeric or Numeric in the Zone Contents pop-up menu. Drawing zones manually You can draw and modify zones using tools in the Tools palette. If the Tools palette does not appear, check that Image view is active and the palette is not minimized (Mac OS 9 only).
  • Page 45 Chapter 3 You can draw up to 64 separate zones. Draw zones in the order you want them processed. A number at the top left of each zone indicates the reading order. If you draw a zone over an existing one, the borders of the new zone will wrap around the existing zone.
  • Page 46: Modifying Zones

    Modifying zones Zones can be modified before OCR takes place. You can move, copy, resize, reorder, extend, connect, divide, and delete zones. If you modify zones after recognition, you will have to re-recognize the page for the modifications to take effect. The Modify Zones tool is for adding and subtracting zone areas.
  • Page 47 Chapter 3 4. Continue until all the zones are appropriately ordered. If you do not number all the zones, they will be automatically numbered when you select another tool or start OCR. Unless you are using the True Page style set, the order of zones determines the order in which text will be placed on a recognized page.
  • Page 48 To connect two or more zones: 1. Click the Modify Zones tool in the Tools palette. 2. Position the mouse pointer in one of the zones you want to connect. 3. Hold the mouse button down and drag the mouse pointer onto the zone(s) you want to connect.
  • Page 49: Table Zones

    Chapter 3 2. Select the zone you want to delete by clicking it. Handles appear on the selected zone. • Shift-click to select additional zones. • Double-click the Draw/Select Zones tool or choose Select All in the Edit menu to select all zones on the current page. 3.
  • Page 50: Performing Recognition

    Performing recognition Performing recognition involves analyzing character shapes found in an image and generating editable text from them. This is also referred to as performing OCR. After OCR, you can proofread for recognition errors and misspelled words before you export the text to another application.
  • Page 51: Proofreading Ocr Results

    Chapter 3 Proofreading OCR results Recognized text appears in Text view after OCR so you can check for errors and misspellings in the text before exporting it. Error checking (proofing) starts automatically after OCR if you chose OCR & Proof as the OCR option. It starts from the first recognized page and continues through all recognized pages in the document.
  • Page 52 This tells why this word is Click Prefs to offered for proofing. select error checking This displays the word as options. OmniPage Pro recognized it. Its color also tells why it is displayed. Drag corner Click in this window to to change enlarge the view of the window size.
  • Page 53: Verifying Recognized Text

    Chapter 3 reached. The program informs you when the end of the document has been reached; all your changes are saved automatically. Note OmniPage Pro can only perform a spelling check on words that it has recognized. It cannot check words that you have manually typed into Text view. To delete unneeded characters (for instance generated by ‘noise’...
  • Page 54: Color Markers

    Color markers Words to be stopped on during proofing may appear in color (red, green or blue) in Text view and in the Proofread OCR dialog box. To temporarily hide color markers in recognized text, make Text view active and choose Hide Markers in the Edit menu. The coloring is removed from all marked words in the current document, and no marking is placed on new pages or documents.
  • Page 55: Working With Documents

    Chapter 3 Working with documents The Thumbnail window gives an overview of all pages in the document and allows you to perform page-level operations. The Document window allows you to work with each page one after the other. This section describes the following procedures: Resizing a page display Saving a document as you work Moving to other pages...
  • Page 56: Saving A Document As You Work

    Saving a document as you work If you are working with a long or important document, or want to reopen the document in OmniPage Pro in a future session, you should save it as an OmniPage Document soon after beginning your work. To save the document to disk for the first time, choose Save or Save As...
  • Page 57: Deleting A Page

    Chapter 3 Deleting a page You can delete a page from a document that has at least two pages. For example, you may want to delete a page that was poorly scanned. To delete the current page, choose Delete Current Page in the Edit menu.
  • Page 58: Modifying Text

    Erasing areas of an image You can erase areas of the actual image using the Erase Image tool in the Tools palette. This is useful if you want to get rid of smudges, signatures, or other types of “noise” on the page before OCR. 1.
  • Page 59: Printing A Document

    Chapter 3 Selecting a block of text Click at the start of the desired text and drag the cursor to the desired end point. Release the mouse button. The selected text is highlighted. With the True Page style set, a selection cannot extend beyond a single frame.
  • Page 60: Listening To A Document

    To select options and print pages: 1. Choose Page Setup... in the File menu. The options available in the Page Setup dialog box depend on your printer. 2. Select the desired options and then click OK. 3. Make the view (Text or Image) from which you want to print active.
  • Page 61: Exporting Documents

    Chapter 3 Exporting documents You can export original images or recognition results, for use in other applications by: Saving an OmniPage Document Saving images Saving recognition results Saving to Portable Document Format (PDF) Copying a document to the Clipboard Using drag-and-drop functionality Saving an OmniPage Document You can save your document as an OmniPage Document file if you want to reopen it in OmniPage Pro again.
  • Page 62: Saving Recognition Results

    Make Image view active and choose Save Images... from the File menu. The Save Images dialog box appears: Define a saving name and location Enter a saving format for the file(s). If you choose these, numerical suffixes will be appended to your file name, to generate unique file names.
  • Page 63 Chapter 3 Type in a name and This is available when define a location for True Page is set, for your file. some saving formats. Select it to maintain page layout without Select a save format. frames, so text can flow between Select save options columns.
  • Page 64: Saving To Portable Document Format (Pdf)

    Saving to Portable Document Format (PDF) When saving to PDF, we recommend you choose the True Page style set, because this forms the basis for saving, whatever style set is chosen. Check that all text is visible within the frame borders. You have four choices when saving recognition results to PDF files.
  • Page 65: Using Drag-And-Drop Functionality

    Chapter 3 only plain text is pasted. Graphics are retained if you selected Retain Graphics and the target application supports them. The graphics have the resolution chosen in the OCR panel of the Preferences dialog box. To copy the image from a zone to Clipboard: 1.
  • Page 66: Direct Ocr

    Dragging from Text view You can drag a block of selected recognized text from Text view to the Desktop or another application that supports drag-and-drop functionality. Text formatting will be transferred if possible. The result appears on the Desktop as a picture clipping icon, and double-clicking on it allows you to view the text only.
  • Page 67: Using Direct Ocr

    Chapter 3 Using Direct OCR You can run Direct OCR using automatic or manual processing. For automatic processing, all settings should be selected suitably in OmniPage Pro before using Direct OCR. If you are uncertain whether settings are suitable or not, or if you want to exclude parts of the pages, use manual processing instead.
  • Page 68 until the Direct OCR operation is finished. Proofing starts as soon as the last page is recognized, if OCR & Proof was selected. 5. When recognition or proofing is finished, the recognition results appear at the insertion point in the target application. To use Direct OCR with manual processing: 1.
  • Page 69: Settings

    Chapter 4 Settings This chapter provides more detailed information on the options available in the pop-up menus on the OCR Toolbar and settings you can select in the Preferences dialog box. Make sure that settings are appropriate for your document before you start processing it.
  • Page 70: Ocr Toolbar Options

    OCR Toolbar options The three numbered OCR Toolbar buttons allow you to take a document through each step of the OCR process. The Start button begins automatic processing. You can select options in the five pop-up menus as described below. Get Page button and Original Layout and OCR button with its...
  • Page 71 Chapter 4 Scan in Gray Select this to scan paper documents from your scanner with grayscale scanning. Choose this if you wish to retain pictures or photos in your output document. For best OCR accuracy, choose this for lower quality pages, for example with low or varying contrast, or with text on shaded or colored backgrounds.
  • Page 72: Original Layout Options

    Original Layout options You can select from the following options in the Original Layout pop- up menu. These let you describe the incoming pages, to assist the program in auto-zoning. Auto-zoning always runs when you perform automatic processing (unless you load a zone template), and sometimes runs during manual processing.
  • Page 73: Style Set Options

    Chapter 4 [Zone Templates] Select the name of a zone template file that you want to use to place zones on new incoming pages. Any zone templates you have created appear at the bottom of the pop-up menu. The example comes from a user who has created two templates to process standardized form-like printed reports –...
  • Page 74 Similar Formats Select this to have results similar to Similar Fonts, but with column widths maintained when multi-column pages are decolumnized. True Page Select this to have the original page layout maintained as closely as possible. Text blocks, headings, tables, graphics and other elements are placed in frames.
  • Page 75: Ocr Options

    Chapter 4 OCR options You can select the following OCR options in the OCR pop-up menu. The selected option is activated during manual processing by clicking the OCR button. This performs recognition or training on the current page only. The option is also activated during automatic processing, in which case it may be applied to a series of pages.
  • Page 76: Preference Settings

    For more information, see Saving a document as you work on page 56, Exporting documents (page 61) and Supported file types in online Help. To Clipboard Select To Clipboard to place a copy of a document’s recognition results (text and embedded graphics) on the Clipboard. See Copying a document to the Clipboard on page 64.
  • Page 77 Chapter 4 Click this to open the Click this to select Scanner panel. an installed scanner, set its parameters and test it. To manually adjust the brightness, drag the slider to left or right. This becomes available as soon as you change a Click this to close the dialog setting.
  • Page 78 • Select Flipscape to have landscape images rotated by 180 degrees. Flipped and Flipscape options are useful if you are scanning pages in a book and have trouble positioning the book correctly in the scanner. You can also rotate a page image after it is loaded into OmniPage Pro.
  • Page 79 Chapter 4 Brightness The brightness setting for scanning a page works like that on a photocopier. This setting can compensate for variations in paper and print quality, so it can have a big influence on OCR accuracy. Click the Manual Brightness check box and move the slider to lighten or darken the brightness for your scanning.
  • Page 80: Ocr Settings

    OCR settings Click the OCR icon in the Preferences dialog box to select accuracy and output options. Use this to decide which character Click this to see the will replace OCR panel unrecognizable characters in the output. Character Type Select a setting to characterize the printed text on your pages in the Character Type pop-up menu.
  • Page 81 Chapter 4 Training files are useful for recognizing characters that prove difficult to recognize or are being regularly misrecognized. To create a training file, see Training OCR on page 97. Retain Graphics switch Select Retain Graphics if you want OmniPage Pro to retain original graphics, such as photographs or drawings, in the recognized document.
  • Page 82: Spelling Settings

    The settings have no effect on recognition accuracy, nor on the display of the embedded images in Text view. They are not used when saving to OmniPage Documents, nor when saving page images, nor when exporting single graphics zones or areas by drag-and-drop or through the Clipboard.
  • Page 83 Chapter 4 Main Language The Main Language pop-up menu enables you to choose the main language for the page(s) you intend to recognize. Your choice determines which characters are validated for recognition and which main dictionary will be used. The languages available are Danish, Dutch, English (UK and US), Finnish, French, German, Italian, Norwegian, Portuguese (Standard and Brazilian), Spanish and Swedish.
  • Page 84 Note It is possible to read more languages than those offered as main and secondary languages, providing you disable the Language Analyst and make a suitable language selection. See Supported languages on page 110 for advice. User Dictionary Select a user (personal) dictionary in the User Dictionary pop-up menu.
  • Page 85: Miscellaneous Settings

    Chapter 4 Miscellaneous settings Click the Miscellaneous icon on the left of the Preferences dialog box to select options for table handling, scripting and the Direct OCR feature. Click this to see the Miscellaneous panel Tables Select Retain Table Grids to have gridded tables in the original document placed in grids in Text view after they are recognized.
  • Page 86 Direct OCR settings should be selected before you use the Direct OCR feature because they influence what happens as soon as you use • Select Begin Processing Automatically on Launch if you want OmniPage Pro to trigger the Start button as soon as you activate Direct OCR.
  • Page 87: Customizing Ocr

    Chapter 5 Customizing OCR OmniPage Pro X has many features that allow you to customize the way your documents are handled during OCR and how they appear after recognition. This chapter describes how to use these facilities. Please continue reading for information on the following topics: Specifying the style set Applying and editing zone styles Zone templates...
  • Page 88 The following tables give an overview of the built-in style sets and the zone styles offered by each of them. Four of these style sets define basic formatting levels. These cannot be deleted and allow only limited editing. They are useful mainly for processing documents automatically or for applying standard formatting during manual processing.
  • Page 89 Chapter 5 All four styles can transmit graphics. For the first three, the graphics are placed at the end of the recognized text. In True Page the graphic is placed in a frame in its location on the original page. All four styles can accept tables.
  • Page 90: Specifying A Global Style Set

    Specifying a global style set Select a style set from the Style Set pop-up menu in the OCR Toolbar. The selected style set is applied to all incoming pages until you change the setting. A new setting here has no effect on existing pages, even if you re-recognize them.
  • Page 91: Applying And Editing Zone Styles

    Chapter 5 To create a style set: Choose in the Edit menu. Style Sets... A dialog box appears displaying all available style sets. Click . The New Style Set dialog box appears. Enter a name for your style set. For example, you could enter as the name if you are Bibliography creating a style set for handling bibliographies.
  • Page 92 To apply styles to existing zones: Make Image view active. The palette appears. Zone Info Check that the style set for the page is suitable. Change it if desired. Click the Draw/Select Zones tool in the Tools palette if it is not already selected.
  • Page 93 Chapter 5 The Edit Style Set dialog box lists the zone styles in the set. Click to make font mapping The currently selections for the selected zone style entire style set. Settings for the currently selected zone style Specimen text for the current zone style Click the name of the zone style you want to edit.
  • Page 94: Font Mapping

    The last three settings define the left and right limits of the text area and first-line indenting. Choose Auto to let OmniPage Pro decide the values. Enter numerical values or drag the markers in the ruler to change settings. The panel below the ruler displays the effects of your settings. Repeat the above steps to edit other zone styles.
  • Page 95 Chapter 5 Monospaced Serif Character width is the same for each character; short lines finish off the letter strokes. The default font is Courier. Monospaced Sans-Serif Character width is the same for each character; letter strokes do not have finishing lines. The default font is Note Font mapping is not applicable to the Plain Format style set.
  • Page 96: Zone Templates

    Zone templates You can use a to quickly and efficiently create zones on zone template documents that have the same zoning requirements. For example, if you frequently process documents with layouts and content that require the same type of zoning, you can create and save a zone template and apply it to all such pages or documents.
  • Page 97: Training Ocr

    Chapter 5 To remove a zone template: • Select a non-template setting in the Original Layout pop-up menu on the OCR Toolbar. OmniPage Pro will no longer place template zones on incoming page images. This does not remove template zones from existing zoned pages.
  • Page 98 Click the OCR button. OmniPage Pro analyzes the page and opens the Training File dialog box. Original character images are displayed along with OmniPage Pro’s interpretation of each character. Characters appear in the alphabetical order of their interpretations. Original image OmniPage Pro’s interpretation Most characters do not need to be trained.
  • Page 99 Chapter 5 edit box, or click a non-keyboard character in the scrolling Code display to add it to the edit box. In our example, the ‘H’ has been cleared and ‘//’ entered. Click OK to accept the character specification. The Training File dialog box reappears. Repeat steps 5–7 to continue specifying characters.
  • Page 100 To edit a training file: Choose in the Edit menu. The Training Files Training Files... dialog box lists all training files in the Training Files folder. Double-click the training file you want to edit, or select it and click Open. The Training File dialog box displays the characters in the training file you specified.
  • Page 101: User Dictionaries

    Chapter 5 User dictionaries Dictionaries are used to assist recognition and provide suggestions during proofing. A is a personal dictionary that you user dictionary build and customize, to supplement a built-in main dictionary. Entries for a user dictionary must consist of 2 to 32 characters, without spaces or control characters, such as tabs.
  • Page 102: Settings Files

    Optionally, click to save your user dictionary as a plain Export... text file, for protection or use outside the program. Click Done to save the changed state of your user dictionary within the program and exit. User dictionaries are saved in the folder within your User Dictionaries installation folder.
  • Page 103: Technical Information

    Chapter 6 Technical information This chapter provides troubleshooting and other technical information to help you use OmniPage Pro X. Please also consult the PDF Readme file and other online help topics, or visit the Support section in the ScanSoft web pages. This answers Frequently Asked Questions (FAQ) and provides other useful guidance.
  • Page 104: Troubleshooting

    Troubleshooting Solutions to try first Try these solutions if you experience problems starting the program: Ensure that your system meets all requirements listed under System requirements in chapter 1. Make sure that your scanner is plugged in and that all cable connections are secure.
  • Page 105: Low Disk Space Situations

    Chapter 6 Do not scan in color unless you need colored graphics in your output files. Prefer Web color or 256 colors (8-bit pixel depth) rather than True color (16-bit depth) or similar choices. To adjust preferred memory size for an application under OS 9.X: Make sure OmniPage Pro X is closed.
  • Page 106 With low-quality originals, sometimes a good-quality photocopy can yield better OCR results. This may be true for documents with low contrast or printed on thin paper. On the other hand, poor-quality photocopies with stripes, blotches or uneven brightness will usually give worse results. Page images should be free of notes, lines, doodles or spots.
  • Page 107 Chapter 6 accuracy. The program will not open image files with resolutions below 200 dpi. If this happens and you have the documents on paper, scan them again with better settings. Ensure zones are suitable Look at the original page images and ensure that all required text areas are enclosed by text zones.
  • Page 108: Improving Fax Recognition

    If you are getting poor results with a training file loaded, check its contents by clicking Training Files... from the Edit menu. Make sure the training file is appropriate for the current document. If it is not, either unload it or edit its contents to remove training from poorly formed character shapes.
  • Page 109: Interface Problems And Solutions

    Chapter 6 Interface problems and solutions The Start button is disabled. Be sure Train OCR is not selected in the OCR pop-up menu. Training can only be done on a single page at a time. The Save button in the Preferences dialog box is grayed. Change a setting in one of the panels, then it will become available.
  • Page 110: Supported Languages

    Supported languages The program supports thirteen languages with a main dictionary and Language Analyst. The program can recognize other languages, but without these facilities. To read text in these languages, select the language(s) indicated and deselect Use Language Analyst in the Preferences dialog box.
  • Page 111: Supported Saving Formats

    Chapter 6 The accented letters used in less spoken languages may vary with dialects, variants, changes over time and transcription norms. Therefore, this table can serve only as a general guide. Supported saving formats Recognition results can be saved to a wide range of target applications and saving formats.
  • Page 112: Supported Image File Formats

    Supported image file formats Page images can be acquired from image files. Scanned images can be saved to file: current page only, all document pages (one file per page or one multipage file), or each graphic zone on a page to a separate file.
  • Page 113: Index

    N D E X Plain Format Custom style sets Similar Fonts Cutting text or graphics Abbreviations, ignoring Similar Formats Accuracy True Page best resolution for Typewriter Memo brightness options for Deleting improving characters from training file Acquiring images current page Acronyms, ignoring Chapter outline graphics...
  • Page 114 user dictionaries Images to pages zones styles acquiring zones English texts read aloud bringing into OmniPage Pro Multi Column Text zone type Erasing image areas defined Multi-page image files Export erasing areas of Multiple column pages To Application loading Multiple-page document To Clipboard modifying using an ADF with...
  • Page 115 Opening settings OmniPage Documents Scanning Quitting OmniPage Pro X Optical character recognition black-and-white see OCR books Optimizing image quality color Ordering zones double-sided pages Reading text aloud Orientation grayscale Recognizing text rotating an image manually pages Rectangular zones selecting for scanning resolution Redetecting table dividers Script Log file...
  • Page 116 deleting reordering pages in editing Toolbar Zone contents selecting see OCR Toolbar copying with drag-and-drop Style sets, built-in Tools palette specifying Article Train OCR Zone Info palette Contemporary Memo Trained characters applying zone styles Plain Format appending to another file applying zone types Similar Fonts deleting...

This manual is also suitable for:

Omnipage pro x

Table of Contents