DataparkSearch Engine 4.28 reference manual
The Web searching software
Copyright © 2003-2005 by Datapark corp.
Copyright © 2001-2003 by Lavtech.com corp.
Table of Contents
1.
Introduction
DataparkSearch Features
Where to get
DataparkSearch
.
Disclaimer
Authors
Contributors
2.
Installation
SQL database requirements
Supported operating systems
Tools required for installation
Installing
DataparkSearch
Possible installation problems
Installation registration
3.
Indexing
Indexing in general
Configuration
Running
indexer
How to create SQL table structure
How to drop SQL table structure
Subsection control
How to clear database
Database Statistics
Link validation
Parallel indexing
Supported HTTP response codes
Content-Encoding support
indexer configuration
Specifying WEB space to be indexed
Aliases
ServerTable
FlushServerTable
External parsers
Other commands uses in
indexer.conf
Extended indexing features
Indexing SQL database tables (htdb: virtual URL scheme)
Indexing binaries output (exec: and cgi: virtual URL schemes)
Mirroring
Using syslog
Storing compressed document copies
Configure stored
How stored works
Using stored during search
4.
DataparkSearch
HTML parser
Tag parser
Special characters
META tags
Links
Comments
5.
Storing data
SQL storage types
General storage information
Various modes of words storage
Storage mode - single
Storage mode - multi
Storage mode - crc
Storage mode - crc-multi
Storage mode - cache
SQL structure notes
Additional features of non-CRC storage modes
Cache mode storage
Introduction
Cache mode word indexes structure
Cache mode tools
Starting cache mode
Optional usage of several splitters
Using run-splitter script
Doing search
Using search limits
DataparkSearch
performance issues
searchd
usage recommendation
Memory based filesystem (mfs) usage recommendation
MySQL performance
Post-indexing optimization
SearchD support
Why using searchd
Starting searchd
Oracle notes
Compilation, Installation and Configuration
6.
Subsections
Tags
Tags in SQL version
Categories
7.
Languages support
Character sets
Supported character sets
Character sets aliases
Recoding
Recoding at search time
Document charset detection
Automatic charset guesser
Default charset
Default Language
Recoding during search
Making multi-language search pages
How does it work?
Possible troubles
Segmenters for Chinese, Japanese, Korean and Thai languages
Japanese language phrase segmenter
Chinese language phrase segmenter
Thai language phrase segmenter
Korean language phrase segmenter
Multilingual servers support
8.
Searching documents
Using search front-ends
Performing search
Search parameters
Changing different document parts weights at search time
Using front-end with an shtml page
Using several templates
Advanced boolean search
How search handles expired documents
mod_dpsearch
module for Apache httpd
Why using
mod_dpsearch
Configuring
mod_dpsearch
How to write search result templates
Template sections
Variables section
Includes in templates
Conditional template operators
Security issues
Designing search.html
How the results page is created
Your HTML
Forms considerations
Relative links in search.htm
Adding Search form to other pages
Relevancy
Ordering documents
Relevancy calculation
Popularity rank
Boolean search
Crosswords
Search queries tracking
Search results cache
Fuzzy search
Ispell
Synonyms
Accent insensitive search
9.
Miscellaneous
Reporting bugs
Core dump reports
Using
libdpsearch
library
dps-config
script
DataparkSearch
API
Database schema
Donations
Index
List of Tables
3-1.
Verbose levels
5-1.
Cache limit types
7-1.
Language groups
7-2.
Charsets aliases
8-1.
Available search parameters
9-1.
server
table schema
9-2.
Several server's parameters values in
srvinfo
table
Next
Introduction