http://www.htdig.org/
The ht://Dig system is a complete world wide web indexing and searching
system for a small domain or intranet. This system is not meant to replace
the need for powerful internet-wide search systems like Lycos, Infoseek,
Webcrawler and AltaVista. Instead it is meant to cover the search needs for
a single company, campus, or even a particular sub section of a web site. As
opposed to some WAIS-based or web-server based search engines, ht://Dig can
span several web servers at a site. The type of these different web servers
doesn't matter as long as they understand the HTTP 1.0 protocol.
ht://Dig is also used by KDE to search KDE's HTML documentation.
ht://Dig was developed at San Diego State University as a way to search the
various web servers on the campus network.
Version: 3.2.0
General Commands | |
htdig | retrieve HTML documents for ht://Dig search engine |
htdig-pdfparser | parse a PDF document (wrapper script for htdig) |
htdump | write out an ASCII-text version of the document database |
htfuzzy | fuzzy command-line search utility for the ht://Dig search engine |
htload | reads in an ASCII-text version of the document database |
htmerge | create document index and word database for the ht://Dig search engine |
htnotify | sends email notifications about out-dated web pages discovered by htmerge |
htpurge | remove unused odocuments from the database (general maintenance script) |
htsearch | create document index and word database for the ht://Dig search engine |
htstat | returns statistics on the document and word databases, much like the -s option to htdig or htmerge. |
rundig | sample script to create a search database for ht://Dig |
System Administration | |
htdigconfig | script to create fuzzy databases for ht://Dig |