The ht://Dig system is a complete world wide web indexing and searching
system for a small domain or intranet. This system is not meant to replace
the need for powerful internet-wide search systems like Lycos, Infoseek,
Webcrawler and AltaVista. Instead it is meant to cover the search needs for
a single company, campus, or even a particular sub section of a web site. As
opposed to some WAIS-based or web-server based search engines, ht://Dig can
span several web servers at a site. The type of these different web servers
doesn't matter as long as they understand the HTTP 1.0 protocol.
ht://Dig is also used by KDE to search KDE's HTML documentation.
ht://Dig was developed at San Diego State University as a way to search the
various web servers on the campus network.
|htdig||retrieve HTML documents for ht://Dig search engine|
|htdig-pdfparser||parse a PDF document (wrapper script for htdig)|
|htdump||write out an ASCII-text version of the document database|
|htfuzzy||fuzzy command-line search utility for the ht://Dig search engine|
|htload||reads in an ASCII-text version of the document database|
|htmerge||create document index and word database for the ht://Dig search engine|
|htnotify||sends email notifications about out-dated web pages discovered by htmerge|
|htpurge||remove unused odocuments from the database (general maintenance script)|
|htsearch||create document index and word database for the ht://Dig search engine|
|htstat||returns statistics on the document and word databases, much like the -s option...|
|rundig||sample script to create a search database for ht://Dig|