154
By default, each indexing thread uses as much memory as is available from the
system.
-maxnumdoc
Syntax:
Specifies the maximum number of documents to be downloaded or submitted for
indexing. The value for num_docs does not necessarily correspond exactly to the
number of documents indexed. The following factors affect the actual number.
Whether or not the value of
-submitsize
Whether or not documents retrieved are actually indexed because they are invalid or
corrupt.
-mimemap
Syntax:
Specifies a control file (simple ASCII text) that maps file extensions to MIME-types.
This allows you to make custom associations and override defaults.
The format for the control file is:
#file_ext_no_dot
abc
-nocache
Type: Web crawling only
Used with
Web site indexing. This has the effect of decreasing the demands on your disk space.
Normally, Verity Spider downloads URLs and then writes them to a bulk insert file
and downloads the documents themselves. When indexing occurs, once
-submitsize
use
so the documents are not deleted until indexing occurs takes over. This will usually
be
-processbif option.
By using
files locally at all. Files are downloaded only when indexing actually occurs.
See also -noindex.
-nodupdetect
Type: Web crawling only.
Disables checksum-based detection of duplicates when indexing Web sites.
URL-based duplicate detection is still performed.
-maxnumdoc num_docs
. If it does, the entire block of documents must be processed.
-mimemap path_and_filename
or
-noindex
has been reached, the cached files are indexed and then deleted. If you
, the bulk insert file is submitted but not processed by Verity Spider, and
-noindex
or
, or you can subsequently use Verity Spider again with the
mkvdk
collsvc
in conjunction with
-nocache
falls within a block of documents dictated by
num_docs
mime-type
application/word
, this option disables the caching of files during
-nosubmit
-noindex
Chapter 8 Verity Spider
or
, you avoid storing
-nosubmit
Need help?
Do you have a question about the COLDFUSION 5-ADVANCED ADMINISTRATION and is the answer not in the manual?