DragonFly On-Line Manual Pages
mifluzsearch(1) DragonFly General Commands Manual mifluzsearch(1)
NAME
mifluzsearch - search the content of an inverted index.
SYNOPSIS
mifluzsearch -f words [options]
DESCRIPTION
mifluzsearch searches a mifluz index for documents matching a Alt*Vista
expression (simple syntax). The result of the search is output on the
standard output.
SEARCH SYNTAX
The search syntax uses a + to indicate a mandatory word, a - to
indicated a forbiden word and double quotes (") to specify a literal.
OPTIONS
-v Increase verbosity.
-M config_file
use config_file instead of the default configuration file.
-B dbfile (default test)
use dbfile inverted index.
-f expr
search for documents that match expr.
-H print a HTML header before the results so that mifluzsearch can
be used as a cgi-bin.
-o word
return the number of occurencs of word in the inverted index.
-c number (default 10)
retrieve at most number documents.
-d number (default 0)
the index of the first retrieved document starts at number. The
number of documents retrieved after this document depends on the
-c option. If number is higher than the maximum document that
matches the query, it will be rounded to match the last chunk of
document.
-n only parse the search expression and print it.
-P proximity (default 1)
words are considered near to each other if they are at a maximum
distance of proximity words. If the number is negative the word
ordering does not matter.
-S return at most one match per server.
-l key searching will start at this position in the index. The key is
the ascii representation of a key.
-l key searching will end at this position in the index. The key is the
ascii representation of a key.
CONFIGURATION FILE
wordlist_wordkey_document
wordlist_wordkey_location
wordlist_wordkey_uniq
RESULTS
Here is a sample text result:
match: <UNDEF> <UNDEF> 0 5 <UNDEF> (world the proximity)
match: <UNDEF> <UNDEF> 0 21 <UNDEF> (world the proximity)
match: <UNDEF> <UNDEF> 0 51 <UNDEF> (world the proximity)
match: <UNDEF> <UNDEF> 0 81 <UNDEF> (world the proximity)
count: 100
time: 20
base: 0
words: the an world
Here is a sample xml result (provided the -x option was given):
<?xml version="1.0" encoding="ISO-8859-1" ?>
<searchresults>
<match><document>0 5 </document><info>world the proximity</info></match>
<match><document>0 21 </document><info>world the proximity</info></match>
<match><document>0 51 </document><info>world the proximity</info></match>
<match><document>0 81 </document><info>world the proximity</info></match>
<count>100</count>
<time>30</time>
<words>
<verbatim>the an world </verbatim>
<unaccent>the an world </unaccent>
</words>
<base>0</base>
</searchresults>
match is the document number that matches, as defined by
wordlist_wordkey_document.
info is a human readable information explaining why the document
matched.
count is an estimation of the total number of documents that match the
query.
time is the number of milliseconds that were necessary to process the
query.
words verbatim
is the list of accented words of the query.
words unaccent
is the list of unaccented words of the query.
base is the absolute index of the first document returned in the
total number of possible documents. For instance, if
mifluzsearch was called with -d 50 -c 10 to require ten
documents after the 50th document, the base will be 50. However,
if there are only 35 documents that match the query, the base
will be rounded to the 30 and only 5 documents will be returned.
CGI
Each option may be given in the QUERY_STRING environment variable so
that mifluzsearch can be used as a cgi-bin instead of a line command.
The options are processed in the order in which they appear in
QUERY_STRING.
QUERY_STRING='v=1&f=a+word&M=/etc/mifluz.conf' mifluzsearch
is strictly equivalent to
mifluzsearch -v -f 'a word' -M /etc/mifluz.conf
DEBUG
Debugging information interpretation. A cursor is open in the index for
every word and they are stored in a list. The list of cursors is always
processed in the same order, as a single link list. With -v, each block
is an individual action on behalf of the word shown on the first line.
The last line of the block is the conclusion of the action described in
the block. REDO means the same cursor must be examined again because
the conditions have changed. RESTART means we go back to the first
cursor in the list because it may not match the new conditions anymore.
NEXT means the cursor and all the cursors before it match the
conditions and we may proceed to the next cursor. ATEND means the
cursor cannot match the conditions because it is at the end of the
index.
ENVIRONMENT
QUERY_STRING options are extracted from QUERY_STRING in a CGI like
fashion.
MIFLUZ_CONFIG file name of configuration file read by WordContext(3).
Defaults to ~/.mifluz.
AUTHORS
Loic Dachary loic@gnu.org
The Ht://Dig group http://dev.htdig.org/
SEE ALSO
htdb_dump(1), htdb_stat(1), htdb_load(1), mifluzdump(1), mifluzload(1),
mifluzdict(1), WordContext(3), WordList(3), WordDict(3),
WordListOne(3), WordKey(3), WordKeyInfo(3), WordType(3), WordDBInfo(3),
WordRecordInfo(3), WordRecord(3), WordReference(3), WordCursor(3),
WordCursorOne(3), WordMonitor(3), Configuration(3), mifluz(3)
local mifluzsearch(1)