Table of Contents

Advertisement

Quick Links

OmniPage
Web
®
User's Manual

Advertisement

Table of Contents
loading
Need help?

Need help?

Do you have a question about the OMNIPAGE WEB and is the answer not in the manual?

Questions and answers

Subscribe to Our Youtube Channel

Summary of Contents for NUANCE OMNIPAGE WEB

  • Page 1 OmniPage ® User’s Manual...
  • Page 2 Version 1 Copyright© 1999 Caere Corporation. All rights reserved. The Caere logo, Caere®, OmniPage®, OmniPage Web®, PageKeeper®, Language Analyst®, 3D OCR®, AutoWeb Toolbar™, and OCR Proofreader are trademarks of Caere Corporation All other trademarks are the property of their respective companies.
  • Page 3: Table Of Contents

    Product Support ........................4 Chapter 1 Installation and Setup Minimum System Requirements..................6 Installing OmniPage Web ....................6 Setting Up Your Scanner with OmniPage Web ..............7 Starting OmniPage Web .......................8 Registering OmniPage Web ....................9 Chapter 2 Introduction to OmniPage Web What Is Optical Character Recognition (OCR)?..............12 What Is Outlining? ......................12...
  • Page 4 Deleting Pages ......................38 Printing a Document ....................38 Closing a Document ....................38 Saving a Document ......................39 Testing Your HTML Document ..................41 Chapter 4 OmniPage Web Settings Setting AutoWeb Toolbar Commands................44 AUTO Button Commands ..................45 Image Button Commands ...................46 Zone Button Commands.....................47 OCR Button Commands .....................48...
  • Page 5 Scanner Setup Issues......................78 Scanner Drivers Supplied by the Manufacturer............78 Scanner Drivers Supplied by Caere ................79 Scan Manager is Needed with OmniPage Web ............79 Problems Connecting OmniPage Web to Your Scanner ........80 Missing Scan Image Command ...................80 Scanner Message on Launch ..................81 System Crash Occurs While Scanning ..............81...
  • Page 7: Welcome

    The following documentation has been provided to help you learn about OmniPage Web. This User’s Manual This manual introduces you to the basics of using OmniPage Web. It includes installation and setup instructions, an introduction to OmniPage Web, task-oriented instructions, ways to customize processing, settings guidelines, and technical information.
  • Page 8: Using This Manual

    Using This Manual Using This Manual This manual is written with the assumption that you know how to work in the Microsoft Windows environment. Please refer to your Windows documentation if you have questions about how to use dialog boxes, menu commands, scroll bars, drag and drop functionality, shortcut menus, and so on.
  • Page 9: Getting Online Help

    Getting Online Help Getting Online Help In addition to using this manual, you can use OmniPage Web’s online Help to learn about features, settings, and procedures. Online Help is available after you install OmniPage Web. Choose How to Use Help... in OmniPage Web’s Help menu to get information on using Windows Help.
  • Page 10: Product Support

    Caere on the Web ! Caere Web site in the Help menu. Caere’s Web site address is www.caere.com. To connect to the OmniPage Web site for the latest information on using OmniPage Web, choose Caere on the Web ! OmniPage Web in the Help menu.
  • Page 11: Chapter 1 Installation And Setup

    This chapter provides information on installing and starting OmniPage Web. Please continue reading for information on these topics: • Minimum System Requirements • Installing OmniPage Web • Setting Up Your Scanner with OmniPage Web • Starting OmniPage Web • Registering OmniPage Web...
  • Page 12: Minimum System Requirements

    Before installing OmniPage Web: • If you are using a scanner with OmniPage Web, make sure it is connected, turned on, and compatible with your system. • Close all other applications, especially anti-virus programs.
  • Page 13: Setting Up Your Scanner With Omnipage Web

    Read the Scanner Setup Notes for more information on Caere Scan Manager and supported scanners. You can open the Scanner Setup Notes after OmniPage Web has been installed by clicking Start in the Windows taskbar and choosing Programs ! Caere Applications ! Caere Documents ! Scanner Setup Notes.
  • Page 14: Starting Omnipage Web

    Starting OmniPage Web Starting OmniPage Web To start OmniPage Web, click Start in the Windows taskbar and choose Programs ! Caere Applications ! OmniPage Web 1.0. Or, double-click the OmniPage Web icon on your Windows desktop. OmniPage Web’s desktop appears when you open OmniPage Web. See “The OmniPage Web Desktop”...
  • Page 15: Registering Omnipage Web

    Registering OmniPage Web Registering OmniPage Web Register your copy of OmniPage Web with Caere Corporation to receive access to product support, notification of special offers, and the best prices on product upgrades. To register OmniPage Web: Click the Register menu to open the Register dialog box.
  • Page 16 Chapter 1...
  • Page 17: Chapter 2 Introduction To Omnipage Web

    OmniPage Web outlines the document structure and creates a complete, dynamic Web site with separate Web pages for each chapter or section. OmniPage Web even creates hypertext links, navigation tools, and a hyperlinked table of contents.
  • Page 18: What Is Optical Character Recognition (Ocr)

    Outlining is the process of examining the structure of a document, detecting original document elements (called objects in OmniPage Web), and creating hypertext links. OmniPage Web can recognize and outline these objects in the original document during outlining: • Headline (the title of the document) •...
  • Page 19: Basic Steps Of Creating A Web Page

    You can scan a paper document or load an image file. The resulting image appears in OmniPage Web’s image view. See “Bringing Document Images into OmniPage Web” on page 24 for more information. Create zones to identify areas you want to recognize as text or retain as graphics.
  • Page 20: The Omnipage Web Desktop

    The OmniPage Web Desktop The OmniPage Web Desktop Before a document is outlined, OmniPage Web’s desktop displays the pages of the open document in its thumbnail view, image view, and text view. You can use buttons in the Standard, AutoWeb, and Zone toolbars to perform various tasks on the document.
  • Page 21 The OmniPage Web Desktop After a document is outlined, OmniPage Web’s desktop displays the document outline in outline view, the original image in image view, and a preview of the HTML document in HTML view. Outline The image view toolbar displays the current page’s...
  • Page 22: Autoweb Toolbar

    The OmniPage Web Desktop AutoWeb Toolbar The AutoWeb toolbar contains buttons that can activate each step of the HTML-conversion process. Image Zone Outline AUTO Export button button button button button button Click the down arrow to display the commands in a button’s drop-down list.
  • Page 23: Standard Toolbar

    The OmniPage Web Desktop Standard Toolbar The Standard toolbar contains buttons and a drop-down list for performing standard tasks. Image Save Proofread Undo HTML Straighten Zoom Editor Option Image Rotate Help View Options Open Print Copy Image Zone Toolbar The Zone toolbar contains buttons that allow you to draw and define zones on a page image.
  • Page 24: Outline Toolbar

    You can select settings for processing in the Options dialog box. To open it, click the Options button or choose Options... in the Tools menu. Click the tabs in the Options dialog box to view and select different settings. See Chapter 4, OmniPage Web Settings, for more information on settings. Chapter 2...
  • Page 25: Html Options Dialog Box

    To open it, click the HTML Options button or choose HTML Options... in the Tools menu. Click the tabs in the HTML Options dialog box to view and select different settings. See Chapter 4, OmniPage Web Settings, for more information on settings. Introduction to OmniPage Web...
  • Page 26 Chapter 2...
  • Page 27: Chapter 3 Processing Documents

    Chapter 3 Processing Documents This chapter describes how to work with documents in OmniPage Web, including each step of converting paper documents to HTML. There are different ways to accomplish the same tasks in OmniPage Web. You can use toolbar buttons or menu commands to start procedures.
  • Page 28: Ways To Process Documents

    Ways to Process Documents OmniPage Web instantly turns a paper document into an HTML file that you can publish as a Web page. The basic steps of OmniPage Web’s HTML-conversion process are explained on page 13. The following is a summary of those steps.
  • Page 29: Automatic Processing

    Each page of the document is processed and finished in order according to the selected commands. If page images in an open document already have zones, OmniPage Web will skip zoning those pages and continue with the selected OCR, outline, and export operations.
  • Page 30: Performing Multiple Tasks At Once

    When these tasks are complete, you can begin outlining your documents. Bringing Document Images into OmniPage Web You can bring document images into OmniPage Web by scanning pages or loading image files. Select the desired image resolution in the Page Format tab of the Options dialog box before loading or scanning a color or grayscale image.
  • Page 31: Loading Image Files

    Loading Image Files You can load image files into OmniPage Web. An image file is an electronic picture of text, such as a scanned paper document or an electronic fax, that is saved in an image file format such as PCX or TIFF.
  • Page 32 Image files are loaded in the order selected and combined into one working document. If you have electronic fax files that you want to convert to editable text, save the fax files in TIFF format and load them into OmniPage Web using the Load Image command. Chapter 3...
  • Page 33: Creating Zones For Ocr

    OCR. The easiest way to create zones on a page is to let OmniPage Web do it automatically for you. However, you may want to draw zones manually if you want to customize the way your page will be processed. For example, if you only want to process certain areas of a page, you would manually draw zones around the desired areas.
  • Page 34: Performing Ocr On A Document

    Click the Zone button or choose Auto Zones in the Process menu. OmniPage Web automatically draws zones on the current page in the image view. Each zone has a number indicating its order and a picture indicating its zone type.
  • Page 35: Proofreading Ocr Results

    Proofreading starts automatically if you chose OCR and Proof as the OCR process command. OmniPage Web marks suspected errors in green and inserts a red “reject” character for any character it cannot recognize. To turn off these color markers, choose Show Markers in the View menu so that it is deselected.
  • Page 36: Modifying Words

    Outlining a Document Once an image has been loaded, zoned, recognized, and proofread, OmniPage Web then examines the document structure and creates an outline of the structural elements, called objects. Before you proceed to the outlining step, make sure you have loaded all pages, placed them in the desired order, and checked the type and order of the zones.
  • Page 37: Editing Outline Results

    HTML document. OmniPage Web marks each object with an icon that shows what part of the document it is. The same icon appears next to the corresponding part of the document in HTML view so you can see a preview of how the object will appear in the HTML document.
  • Page 38 Select which objects you want to see in the outline and click this button. These are the objects that OmniPage Web looks for in your document during outlining. To change the outline hierarchy: Highlight the object that you want to change in outline view.
  • Page 39: Selecting Html Components

    Selecting HTML Components Selecting HTML Components You can make your Web site even more usable by adding HTML components. Components are parts of an HTML document that make it interesting and functional, such as a hyperlinked table of contents, copyright notice, or navigation panel. To select and format HTML Components: Click the HTML Options button in the Standard toolbar or choose HTML Options...
  • Page 40: Working With Documents

    Working with Documents Working with Documents OmniPage Web’s thumbnail, image, text, outline, and HTML views allow you to look at and work with pages in the current document. Once pages are recognized, the image, text, and thumbnail views are visible.
  • Page 41: Resizing A Page View

    Working with Documents Once recognition is complete, OmniPage Web analyzes the structure of your document and creates an outline. After outlining, the thumbnail view is hidden behind the outline view, and the text view is replaced by the HMTL view.
  • Page 42: Changing Pages

    Working with Documents You can also click your right mouse button in the view you want to resize and select a size option in the shortcut menu. (If you are resizing the image view, click outside of a zone.) Changing Pages Before outlining, the thumbnail view, image view, and text view all display the same page of a document.
  • Page 43: Reordering Pages

    • Click the Next Page or Previous Page buttons at the lower-right corner of the OmniPage Web desktop. • Choose Next Page, Previous Page, or Go to Page... in the Edit menu. Reordering Pages You can reorder pages in a document by dragging their thumbnails to different positions in the thumbnail view.
  • Page 44: Deleting Pages

    Working with Documents Deleting Pages If you delete a page from a document in OmniPage Web, the thumbnail, original image, and recognized text for that page are all deleted. To permanently delete pages: • Choose Delete Current Page in the Edit menu to delete the currently displayed page.
  • Page 45: Saving A Document

    HTML file type. Be sure to view your document on as many browsers as possible to be sure the formatting is supported. To re-open your document in OmniPage Web, save it as an OmniPage Web document (*.wmt). Type in a file name and select save options.
  • Page 46 File menu to save changes to the current document as you work. The Save As dialog box appears the first time you choose Save if a document has not been saved as an OmniPage Web Document or HTML file. To save and launch your Web browser: Set Save and Launch as the command in the Export button’s...
  • Page 47: Testing Your Html Document

    Components tab of the HTML Options dialog box, and your Web page does not appear in the browser as it was formatted in OmniPage Web, you may not have a browser that supports advanced formatting. Deselect Use style sheets, and check the Web page in your browser again.
  • Page 48 Testing Your HTML Document Chapter 3...
  • Page 49: Chapter 4 Omnipage Web Settings

    This chapter describes the settings in the AutoWeb toolbar, the Options dialog box, and the HTML Options dialog box. Please also look in OmniPage Web’s online Help for more detailed information on settings. The settings you select for processing documents can greatly affect HTML results.
  • Page 50: Setting Autoweb Toolbar Commands

    The AutoWeb toolbar buttons allow you to take a document through each step of the process. Every toolbar button has different process commands that can be set for the operations you want to perform. OmniPage Web can go through all steps automatically, or you can start each step individually. Zone...
  • Page 51: Auto Button Commands

    See “Automatic Processing” on page 23 for more information. Web Wizard For new documents, select Web Wizard to have the Web Wizard guide you through the entire HTML-conversion process. See “Using the Web Wizard” on page 22 for information. OmniPage Web Settings...
  • Page 52: Image Button Commands

    Setting AutoWeb Toolbar Commands Image Button Commands Use the Image button to bring a document image into OmniPage Web’s image view. The Image button’s drop-down list contains the Load Image and Scan Image commands. Load Image Select Load Image to load existing image files such as TIFF, DCX, BMP, JPG, or PCX files.
  • Page 53: Zone Button Commands

    Spreadsheet Pages Select Spreadsheet Pages to have OmniPage Web automatically draw and order zones on pages that have information arranged in rows and columns such as spreadsheets.
  • Page 54: Ocr Button Commands

    Perform OCR and OCR and Proof commands. Perform OCR Select Perform OCR to recognize text on document images. During OCR, OmniPage Web analyzes the image and identifies characters to produce editable text. See “Performing OCR on a Document” on page 28 for more information.
  • Page 55: Outline Button Commands

    Outline and Defer Outlining commands. Outline Select Outline to outline the recognized document structure. During outlining, OmniPage Web detects original objects such as headings, body text, headers and footers, and links cross-references, e-mail addresses, and URLs to their destinations. See “Outlining a Document”...
  • Page 56: Export Button Commands

    Defer Export Select Defer Export if you do not want to save your document right after automatic processing. OmniPage Web will process your document up to the point of export and then stop. This gives you the opportunity to proofread and edit your document before export.
  • Page 57: Selecting Options

    To get the best results, learn how to identify document characteristics and make selections for them. You may have to experiment with different settings to get the results you want. OmniPage Web Settings...
  • Page 58: Accuracy Settings

    Selecting Options Accuracy Settings Click the Accuracy tab to select settings that affect OCR accuracy. The Language Analyst evaluates and replaces unknown words with Select the type words most likely to be of characters correct during OCR. that are in your document.
  • Page 59: Scanner Settings

    This is recommended for feeder. pages with colored backgrounds, colored text, or pages containing Use the slider to grayscale grphics. adjust the brightness. This is recommended for pages with color graphics that you want to save. OmniPage Web Settings...
  • Page 60: Page Format Settings

    The resolution is the number of dots, or pixels, that make up an image. A higher resolution will produce a better quality image. The resolution cannot be changed after an image has been loaded into OmniPage Web. Chapter 4...
  • Page 61: Language Settings

    OmniPage Web is intended for English-only documents. If you are processing a foreign-language document, it may be difficult for OmniPage Web to accurately determine the document structure, and your outline results may be incorrect.
  • Page 62: Process Settings

    Selecting Options Process Settings Click the Process tab to set commands and settings for each step of OCR. The Web Wizard will guide you through the HTML-conversion process when you click the AUTO These specify button on the AutoWeb the processing toolbar.
  • Page 63: Selecting Html Options

    Click the HTML Options button or choose HTML Options... in the Tools menu to open the HTML Options dialog box. This is the central location for HTML settings. Click each tab to view and select different settings. Click for a description of each setting. OmniPage Web Settings...
  • Page 64: General Settings

    Selecting HTML Options General Settings Click the General tab to set commands and settings for your HTML document. Select this to Select this if you do have OmniPage Web create a not want your HTML link to the document formatted. original page image in your Specifies what you...
  • Page 65: Components Settings

    HTML document, and where you want the components to appear on the final Web page. Select options Select the order in for each which you want the component. components to appear on your Web page by clicking the up and down arrow buttons. OmniPage Web Settings...
  • Page 66: Component Styles Settings

    Selecting HTML Options Component Styles Settings Click the Component Styles tab to select formatting options for each component in your HTML document. Select this for more formatting options if you know your visitors have browsers that support cascading style sheets. Available formatting Select the...
  • Page 67: Chapter 5 Customizing Your Web Page

    Please continue reading this chapter for information on these topics: • Making Your Web Page More Effective • Using Themes • Making Your Web Page More Effective • Customizing Zones • Creating User Dictionaries • OmniPage Web’s user dictionaries are saved in the data folder in your installation folder.
  • Page 68: Making Your Web Page More Effective

    Making Your Web Page More Effective Making Your Web Page More Effective Organizing electronic documents is a challenge, but if done well, can allow your Web page visitors to quickly navigate through large amounts of information and cross-reference other topics without having to dig through unnecessary text.
  • Page 69 Making Your Web Page More Effective If your image is dark, make sure you change the text colors to light shades so that they show up, and that you make the document background color dark. Otherwise, if the image fails to load (or takes a long time to load), the text will be unreadable.
  • Page 70: Using Themes

    Themes allow you to instantly format your HTML document, and are useful to create consistantly-formatted Web pages. OmniPage Web provides a selection of fun and professional themes for you to use, or you can create and save one of your own.
  • Page 71 Using Themes To save a new theme: Open the HTML Options dialog box and select one of the provided themes, or begin selecting your own settings. Click Save Themes... to open the Save Themes dialog box. Type in a file name for the new theme. All the current settings in the HTML Options dialog box are saved as a theme file with an .hfo extension.
  • Page 72: Adjusting Page Images Before Ocr

    Or, choose Straighten Image in the View menu. OmniPage Web straightens the page image up to a maximum of 10 degrees. OmniPage Web will not straighten a page if it determines that it is unnecessary. It is recommended that you have OmniPage Web automatically...
  • Page 73: Customizing Zones

    Customizing Zones Customizing Zones Zones are borders created around areas of a page image to identify what will be recognized as text or retained as a graphic during the HTML- conversion process. Zones play a big part in determining outline results. You can create zones automatically, manually, or with a template.
  • Page 74: Reordering Zones

    Customizing Zones Reordering Zones The numbered order of zones determines the order in which text will be placed on a recognized page, objects will be placed in the outline, and components will appear in the HTML document. Make sure the zone order is acceptable before performing OCR and outlining your document.
  • Page 75: Deleting Zones

    Customizing Zones Hold down the mouse button and drag the handle in the direction that you want to enlarge or reduce the zone. Release the mouse button when you are done. The zone border changes to display the modified zone area. Deleting Zones You can delete the current zones if you want to create new zones.
  • Page 76 Zone Content All text zones on a page also have a zone-content setting. This specifies the characters OmniPage Web looks for within a zone during OCR. You can select Alphanumeric or Numeric as the zone-content setting. For example, if a particular zone only contains numbers and mathematical signs, you can specify the contents of that zone to be Numeric.
  • Page 77 Select a zone type for the selected zones. If you change an irregular-shaped zone to a Table type zone, OmniPage Web substitutes the largest rectangle that fully encloses the irregular area. Select a zone content for the selected zones.
  • Page 78: Creating User Dictionaries

    Delete All to remove all words from the dictionary. • Click Import... to add words from a text file. Click Close when you are finished editing the user dictionary. OmniPage Web’s user dictionaries are saved in the data folder in your installation folder. Chapter 5...
  • Page 79: Chapter 6 Technical Information

    Scanner Setup Notes list all supported scanners and any connection or software-driver issues. The Readme file contains last-minute information relating to OmniPage Web. To open these documents, click Start in the Windows taskbar and choose: Programs ! Caere Applications ! Caere Documents ! Scanner Setup Notes or Readme.
  • Page 80: General Troubleshooting Solutions

    General Troubleshooting Solutions General Troubleshooting Solutions Although OmniPage Web is designed to be easy to use, problems sometimes occur. Many of the onscreen error messages contain self- explanatory descriptions of what to do — check connections, close other applications to free up memory, and so on. Sometimes that is all the troubleshooting help you need.
  • Page 81: Testing Omnipage Web

    OmniPage Web has stopped running altogether. See Windows online help for more information. Your scanner will not run with OmniPage Web in safe mode or VGA mode, so do not test scanner problems in this configuration. To test OmniPage Web in safe mode (Windows 95 or 98): Restart your computer in safe mode by pressing F8 immediately after you see the “Starting Windows”...
  • Page 82: Low Memory Problems

    General Troubleshooting Solutions Low Memory Problems OmniPage Web may run poorly under low-memory conditions. This may be indicated by various error messages or if OmniPage Web works slowly and accesses the hard drive often. Try these solutions for low memory conditions: •...
  • Page 83: Supported File-Format Types

    OmniPage Web Document (*.wmt) Saving Image Files OmniPage Web saves each page of a multiple-page image separately. If you select Save all pages in the Save Image dialog box, Page#### (where #### is the four digit page number) is appended to file names to distinguish separately saved pages.
  • Page 84: Scanner Setup Issues

    † HTML (*.htm) OmniPage Web document(*.wmt) † When OmniPage Web saves a document in HTML format, additional files are created. These files may include graphics files, image map files, or cascading style sheet files (*.css). Scanner Setup Issues This section contains information on setting up your scanner and solutions for scanning problems you may encounter.
  • Page 85: Scanner Drivers Supplied By Caere

    Scanner Setup Issues Scanner Drivers Supplied by Caere OmniPage Web is shipped with special scanner drivers that allow it to communicate with supported scanners. These scanner driver files are installed on your computer when you install Caere Scan Manager. These drivers often work in conjunction with the drivers from your scanner manufacturer.
  • Page 86: Problems Connecting Omnipage Web To Your Scanner

    Web and your scanner or if you receive a scanner error message when you launch OmniPage Web. • Make sure the scanner is supported by OmniPage Web with your version of Windows 95 or 98, or Windows NT. A list of tested scanners is provided in the Scanner Setup Notes.
  • Page 87: Scanner Message On Launch

    Scanner Setup Issues Scanner Message on Launch The first time you launch OmniPage Web after installing or changing your current scanner in the Caere Scan Manager, you may get this message: This scanner’s configuration is set using the system-level driver. If it asks for no more information, click OK in the dialog box.
  • Page 88: Scanning Tips

    Scanner Setup Issues Scanning Tips OCR results will be poor if an image is not scanned properly. Remember the following tips when you scan: • Take the color and quality of your document into account when scanning. High-quality documents return better recognition results than low-quality documents.
  • Page 89: Ocr Problems

    • Restart Windows 95 or 98 in safe mode or Windows NT in VGA mode and test OmniPage Web by performing OCR on the included Sample.tif. See “Testing OmniPage Web” on page 75. • If you are performing multiple tasks at once, such as recognizing and printing, OCR may take longer.
  • Page 90: Problems With Fax Recognition

    • Make sure the correct main and secondary document languages are selected in the Language settings. Omnipage Web is intended for English-only documents, but can sometimes recognize text in other languages. For best results, only process English documents.
  • Page 91 13 user dictionaries 72 AutoWeb toolbar Creating zones Editing graphics described 16 automatically 27 see OmniPage Web’s online Export button 50 Current document, finishing 23 help Image button 46 Custom Editor, HTML 6 location of 8, 14...
  • Page 92 40 see OmniPage Web’s online saving recognized text 39 help New documents, automatically Image files processing 23 loading 25 saving 40 supported types 77 Fax files 26 Image viewer 8, 14, 15 Faxes Objects Images improving recognition...
  • Page 93 The basic steps of creating a Web page 13 missing Scan Image command Themes 64 Thumbnail viewer 14, 15 RAM requirements 76 pages into OmniPage Web 24 changing pages in 36 Recognized text system crash during 81 reordering pages in 37 modifying 30...
  • Page 94 13 making it more effective 62 testing 13 Web Wizard using 22 Windows NT memory requirement for 6 testing OmniPage Web on 75 Wizard, for OCR 45 Wizard, Web 22 Word see Microsoft Word Working with documents 34 Index...

Table of Contents