168 lines
17 KiB
HTML
168 lines
17 KiB
HTML
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">
|
|
|
|
<html>
|
|
|
|
<head>
|
|
<title>Microsoft Index Server Guide: Frequently Asked Questions</title>
|
|
<meta name="FORMATTER" content="Microsoft FrontPage 1.1">
|
|
<meta name="GENERATOR" content="Microsoft FrontPage 1.1">
|
|
</head>
|
|
|
|
<body bgcolor="#FFFFFF">
|
|
<!--Headerbegin--><p align=center><a name="TOP"><img src="onepix.gif" alt="Space" align=middle width=1 height=1></a> <a href="default.htm#Top"><img src="toc.gif" alt=" Contents" align=middle border=0 width=89 height=31></a> <a href="glossary.htm"><img src="previous.gif" alt="Previous" align=middle border=0 width=32 height=31></a> <a href="license.htm"><img src="next.gif" alt="Next" align=middle border=0 width=32 height=31></a> </p>
|
|
<hr>
|
|
<!--Headerend--><p><a name="FrequentlyAskedQuestions"><font size=6><strong>Frequently Asked Questions</strong></font></a></p>
|
|
<p><!--Chaptoc--></p>
|
|
<blockquote>
|
|
<p><a href="faq.htm#IissuedaqueryanditdidntmatchadocumentthatIknowexistsWhy"><font size=3>I issued a query, and it didn’t match a document that I <em>know</em> exists. Why?</font></a> <br>
|
|
<a href="faq.htm#WhydoesmySERVER_NAMEshowupasanIPaddress"><font color="#000000"><font size=3>Why does my SERVER_NAME show up as an IP address?</font></font></a> <br>
|
|
<a href="faq.htm#SometimestheFilestobefilteredcounterisgreaterthantheTotaldocumentscounter"><font color="#000000"><font size=3>Sometimes the <em>Files to be filtered</em> counter is greater than the <em>Total # documents</em> counter. How is that
|
|
possible?</font></font></a> <br>
|
|
<a href="faq.htm#Ihavealargenumberofdocumentsinmycorpusandapowerfulcomputer"><font color="#000000"><font size=3>How do I handle a large number of documents in my corpus when I have a powerful computer?</font></font></a><br>
|
|
<a href="faq.htm#Therearealotofventlogmessages"><font color="#000000"><font size=3>There are a lot of event log messages. Can I turn them off?</font></font></a> <br>
|
|
<a href="faq.htm#IgettheeventVerylowdiskspaceevenafterfreeingupdiskspace"><font color="#000000"><font size=3>Why do I get the event <i>Very low disk space</i>… even after freeing up disk space?</font></font></a> <br>
|
|
<a href="faq.htm#HowmuchdiskspaceshouldIallocateforIndexServerdata"><font color="#000000"><font size=3>How much disk space should I allocate for Index Server data?</font></font></a> <br>
|
|
<a href="faq.htm#MycomputerisidlebuttheFilestobefilteredcounterisnotzero.Whatisthereason"><font color="#000000"><font size=3>My computer is idle but the <em>Files to be filtered</em> counter is not zero. What is the reason?</font></font></a> <br>
|
|
<a href="faq.htm#Whyareallmyfloatingpointnumbersformattedwiththesamenumberofdecimalplaces"><font color="#000000"><font size=3>Why are all my floating point numbers formatted with the same number of decimal places?</font></font></a> <br>
|
|
<a href="faq.htm#WhydounreadablefilesinMicrosoftFrontPage_vtidirectoriesshowupinmyqueryresults"><font color="#000000"><font size=3>Why do unreadable files in Microsoft® FrontPage™ _vti directories show up in my query results?</font></font></a> <br>
|
|
<a href="faq.htm#HowcanIexcludecertaindirectoriesfromsearchresults"><font color="#000000"><font size=3>How can I exclude certain directories from search results?</font></font></a> <br>
|
|
<a href="faq.htm#HowcanIindexremoteUNCshares"><font color="#000000"><font size=3>How can I index remote UNC shares?</font></font></a> <br>
|
|
<a href="#IndexRemoteVirtualRoots"><font size=3>Why doesn’t Index Server index remote virtual roots?</font></a><br>
|
|
</p>
|
|
</blockquote>
|
|
<hr>
|
|
<!--ChaptocEnd--><h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="IissuedaqueryanditdidntmatchadocumentthatIknowexistsWhy"><font size=3><strong>I issued a query, and it didn’t match a document that I </strong><em><strong>know</strong></em><strong> exists. Why</strong></font></a><font size=3><strong>?</strong></font></h1>
|
|
<ul>
|
|
<li><p align=left><em>Different languages were used to filter the page and issue the query.</em> The split of a page into words (the
|
|
fundamental unit stored in the index) is a language-dependent process. When a query is issued from one language and
|
|
the document was filtered in another, query results can be unpredictable. The locale used to filter a document is
|
|
dependent on the filter. Some file formats, such as Microsoft Word, mark documents with a language; this mark is used
|
|
during filtering. Other formats, such as plain text, contain no language specifier. Most filters default to the system locale
|
|
for these files. The locale for the query is specified by use of the <strong>CiLocale</strong> variable. If <strong>CiLocale</strong> is not specified, the
|
|
locale of the browser is used (if available) or the default locale of the server. </p>
|
|
</li>
|
|
<li><p align=left><em>The page has not yet been filtered.</em> There is often a delay between the time a page is modified and the time it is
|
|
indexed. You can use the Performance Monitor or an administrative query to determine the number of pending pages.
|
|
The <strong>CiOutOfDate</strong> variable is set to TRUE if one or more files remain to be filtered. </p>
|
|
</li>
|
|
<li><p align=left><em>An error occurred during filtering of this document.</em> There are a number of reasons why a document may not have
|
|
been filtered. See the <a href="filtrhlp.htm">filtering</a> page for a detailed discussion of filtering. You can check for errors either in the event log
|
|
or by issuing the a query over the “unfiltered property,” using the <a href="adminhlp.htm#UnfilteredFiles">administrative query</a> for unfiltered files. </p>
|
|
</li>
|
|
<li><p align=left><em>The query was too complex. </em>Certain classes of resource intensive queries are turned off when <strong>CiForceUseCi</strong> is set to
|
|
TRUE. When results are missing for this reason, the <strong>CiQueryIncomplete</strong> variable is set to TRUE. </p>
|
|
</li>
|
|
<li><p align=left><em>The query took too much CPU time.</em> An administrator can limit the amount of CPU time a single query can consume.
|
|
When this limit is reached, the <strong>CiQueryTimedOut</strong> variable is set to TRUE. </p>
|
|
</li>
|
|
<li><em>You do not have permission to read the page.</em> A query result will not be returned for a page that you do not have
|
|
permission to read. See <a href="sechelp.htm">Security</a> for a detailed discussion of security as it relates to Index Server.</li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="WhydoesmySERVER_NAMEshowupasanIPaddress"><font color="#000000"><font size=3><strong>Why does my SERVER_NAME show up as an IP address</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><p align=left><font color="#000000"><em>The browser being used does not set the Host: HTTP header.</em> The <strong>SERVER_NAME CGI</strong> variable is set from the
|
|
Host: HTTP header, but only if this is set by the browser. (Note that this header is not set by Internet Explorer 2.0.)</font> </p>
|
|
</li>
|
|
<li><font color="#000000"><em>The IIS parameter ReturnUrlUsingHostName is not set.</em> Set the registry parameter at
|
|
W3SVC\Parameters\ReturnURLUsingHostName to allow using the fully qualified domain name of the local host. See
|
|
the IIS documentation about the ReturnURLUsingHostName registry parameter.</font></li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="SometimestheFilestobefilteredcounterisgreaterthantheTotaldocumentscounter"><font color="#000000"><font size=3><strong>Sometimes the </strong><em><strong>Files to be filtered</strong></em><strong> counter is greater than the </strong><em><strong>Total # documents</strong></em><strong> counter. How is that possible</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><font color="#000000"><em>Some files have been modified more than once before being filtered. </em>If a file is modified multiple times, there may
|
|
be multiple entries for the same file. The <strong>Files to be filtered </strong>counter only specifies how big the change queue is.</font></li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="Ihavealargenumberofdocumentsinmycorpusandapowerfulcomputer"><font color="#000000"><font size=3><strong>How do I handle a large number of documents in my corpus when I have a powerful computer</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li>If there is sufficient RAM (more than 48MB) and the <a href="reghelp.htm#MaxFreshCount">MaxFreshCount</a> is 5000 or less, try increasing the
|
|
MaxFreshCount to 10,000 or 15,000. If there is a lot of disk thrashing, adding more RAM might help.</li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="Therearealotofventlogmessages"><font color="#000000"><font size=3><strong>There are a lot of event log messages. Can I turn them off</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><font color="#000000"><em>Some of the verbose events can be turned off. </em>Please refer to the </font><a href="reghelp.htm#EventLogFlagsMasks"><font color="#000000">“EventLogFlags”</font></a><font color="#000000"> section in the “Registry
|
|
Parameters.”</font></li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="IgettheeventVerylowdiskspaceevenafterfreeingupdiskspace"><font color="#000000"><font size=3><strong>Why do I get the event </strong></font></font></a><a href="errorhlp.htm#LowDiskEvent"><font color="#000000"><font size=3><em><strong>Very low disk space</strong></em><strong>....</strong></font></font></a><font color="#000000"><font size=3><strong> even after freeing up disk space?</strong></font></font></h1>
|
|
<ul>
|
|
<li><font color="#000000"><em>If this event is logged during a MasterMerge, freeing up additional space will help. </em>During a master merge,
|
|
additional free disk space (up to 15% of the size of all the documents that are being indexed) may be required. After the
|
|
master merge completes, disk space is</font> usually freed up.</li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="HowmuchdiskspaceshouldIallocateforIndexServerdata"><font color="#000000"><font size=3><strong>How much disk space should I allocate for Index Server data</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><font color="#000000"><em>The minimum disk space allocated should be at least 30% of the size of your corpus. </em>If you increase the registry
|
|
value of </font><a href="reghelp.htm#MaxCharacterization"><font color="#000000">MaxCharacterization</font></a><font color="#000000"> , then more space should be allocated. Temporarily during a </font><a href="indexhlp.htm#MasterMerge"><font color="#000000">master merge</font></a><font color="#000000">, Index Server
|
|
may need up to 45% of the corpus size.</font></li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="MycomputerisidlebuttheFilestobefilteredcounterisnotzero.Whatisthereason"><font color="#000000"><font size=3><strong>My computer is idle but the </strong><em><strong>Files to be filtered</strong></em> <strong>counter is not zero. What is the reason</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><font color="#000000"><em>Some files failed to filter. </em>If a file is in use by another user process when the </font><a href="filtrhlp.htm#CiDaemon"><font color="#000000">CiDaemon</font></a><font color="#000000"> process tries to filter its
|
|
contents, the file is moved to a lower-priority queue and will be retried for filtering at a later time. The interval between
|
|
the retries is controlled by the registry parameter </font><a href="reghelp.htm#FilterRetryInterval"><font color="#000000">FilterRetryInterval</font></a><font color="#000000">. This typically happens with applications such as
|
|
Microsoft Word that keep the file open for exclusive access and do “fast saves.”</font></li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="Whyareallmyfloatingpointnumbersformattedwiththesamenumberofdecimalplaces"><font color="#000000"><font size=3><strong>Why are all my floating point numbers formatted with the same number of decimal places</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><p align=left><font color="#000000"><em>Index Server uses the settings specified in the Regional Settings application of Control Panel.</em> There is no way in
|
|
this version of Index Server to format individual floating point properties. Users from different locales will see a view of
|
|
the data specific to their locale. U.S. users may see a period used as a decimal point, and French users would see a
|
|
comma, for example. Likewise date, time, and currency values are formatted according to the parameters in Regional
|
|
Settings.</font>  </p>
|
|
</li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="WhydounreadablefilesinMicrosoftFrontPage_vtidirectoriesshowupinmyqueryresults"><font color="#000000"><font size=3><strong>Why do unreadable files in Microsoft® FrontPage™ _vti directories show up in my query results</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><p align=left><font color="#000000"><em>This is a limitation of scoping in Microsoft Index Server. </em>It is not possible to exclude an unreadable root that is
|
|
physically located below a readable root. A workaround is to append to the user’s restriction. For example, FrontPage
|
|
roots can be removed by setting the <strong>CiRestriction</strong> in the .idq file to: <tt>CiRestriction=(%UserRestriction%) &!
|
|
#vpath *\_vti_*</tt>.</font>  </p>
|
|
</li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="HowcanIexcludecertaindirectoriesfromsearchresults"><font color="#000000"><font size=3><strong>How can I exclude certain directories from search results</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><p align=left><font color="#000000">Indexing and searching in Microsoft Index Server is based on virtual roots, and there is no explicit way of excluding
|
|
certain physical directories from being indexed. However, in your .idq files you can specify files to be prevented from
|
|
appearing in search results. For example, if the virtual root /Docs points to C:\Docs but you do not want results from
|
|
c:\Docs\Private to appear in the result set, you can modify the <strong>CiRestriction</strong> parameter in your .idq file as follows:<br>
|
|
<tt><br>
|
|
CiRestriction=%CiRestriction% AND NOT #path c:\docs\private*<br>
|
|
<br>
|
|
</tt>You can append more <strong>AND NOT</strong> #path <em>path</em>* clauses to the CiRestriction to exclude more directories.</font>  </p>
|
|
</li>
|
|
</ul>
|
|
<hr>
|
|
<h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="HowcanIindexremoteUNCshares"><font color="#000000"><font size=3><strong>How can I index remote UNC shares</strong></font></font></a><font color="#000000"><font size=3><strong>?</strong></font></font></h1>
|
|
<ul>
|
|
<li><p align=left><font color="#000000">If you add a virtual root pointing to the remote UNC share, it will automatically be indexed by Index Server. Make sure
|
|
that you specify the user ID (<em>domain\user</em>) and password correctly. Note the backslash (\) in the <em>domain\user</em> format
|
|
of the user ID.</font> </p>
|
|
</li>
|
|
</ul>
|
|
<hr>
|
|
<h1 align=left><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="IndexRemoteVirtualRoots"><font size=3>Why doesn’t Index Server index remote virtual roots</font></a><font size=3>?</font></h1>
|
|
<ul>
|
|
<li><p align=left>When specifying the logon ID for a remote virtual root, type both the domain name and the user name, separated by a
|
|
backslash (\): </p>
|
|
<p align=left><em>domain\username</em> </p>
|
|
<p align=left>If you do not give the domain name, Microsoft Index Server will not index the remote virtual roots. Note that the
|
|
domain name may actually be the name of the computer, if the account is local to that computer. </p>
|
|
</li>
|
|
</ul>
|
|
<!--Footerbegin--><hr>
|
|
<p align=center><a href="default.htm#Top"><img src="toc.gif" alt=" Contents" align=middle border=0 width=89 height=31></a> <a href="glossary.htm"><img src="previous.gif" alt="Previous" align=middle border=0 width=32 height=31></a> <a href="#TOP"><img src="up_end.gif" alt="To Top" align=middle border=0 width=32 height=31></a> <a href="license.htm"><img src="next.gif" alt="Next" align=middle border=0 width=32 height=31></a> </p>
|
|
<hr>
|
|
<p align=center><em>© 1996 by Microsoft Corporation. All rights reserved.<!--Footerend--></em></p>
|
|
</body>
|
|
|
|
</html>
|