Package htdig

ht://Dig - Web search engine

http://www.htdig.org/

The ht://Dig system is a complete world wide web indexing and searching
system for a small domain or intranet. This system is not meant to replace
the need for powerful internet-wide search systems like Lycos, Infoseek,
Webcrawler and AltaVista. Instead it is meant to cover the search needs for
a single company, campus, or even a particular sub section of a web site. As
opposed to some WAIS-based or web-server based search engines, ht://Dig can
span several web servers at a site. The type of these different web servers
doesn't matter as long as they understand the HTTP 1.0 protocol.
ht://Dig is also used by KDE to search KDE's HTML documentation.

ht://Dig was developed at San Diego State University as a way to search the
various web servers on the campus network.

General Commands
Command Description
htdig retrieve HTML documents for ht://Dig search engine
htdig-pdfparser parse a PDF document (wrapper script for htdig)
htdump write out an ASCII-text version of the document database
htfuzzy fuzzy command-line search utility for the ht://Dig search engine
htload reads in an ASCII-text version of the document database
htmerge create document index and word database for the ht://Dig search engine
htnotify sends email notifications about out-dated web pages discovered by htmerge
htpurge remove unused odocuments from the database (general maintenance script)
htsearch create document index and word database for the ht://Dig search engine
htstat returns statistics on the document and word databases, much like the -s option...
rundig sample script to create a search database for ht://Dig
System Administration
Command Description
htdigconfig script to create fuzzy databases for ht://Dig