ht://Dig

Not Rated
Description
The ht://Dig system is a complete World Wide Web indexing and searching system for a small domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Google, or Yahoo!. Instead it is meant to cover the search needs of a single company, campus, or even a particular subsection of a website.

As opposed to some WAIS-based or web-server based search engines, ht://Dig can span several web servers at a site. The type of these different web servers doesn't matter as long as they understand the HTTP 1.0 protocol.

Features:
* Intranet searching
* It is free
* Robot exclusion is supported
* Boolean expression searching
* Configurable search results
* Fuzzy searching (different algorithms supported)
* Searching of HTML and text files
* Keywords can be added to HTML documents
* Email notification of expired documents
* A Protected server can be indexed
* Searches on subsections of the database
* Full source code included
* The depth of the search can be limited
* Full support for the ISO-Latin-1 character set

Please note that ht://Dig is a resource-hog, with respect to processor usage, when indexing.

Disk space requirements:

13.000 documents indexed: 150MB disk space with a 'wordlist database'
93MB disk space without a 'wordlist'

Multiplying the number of documents to index by 12.000 comes pretty close to the real disk space used.
Interface: Command Line
Associated Programs
htdig-doc Documentation for the htdig package
Perl Larry Wall's Practical Extraction and Report Language
wwwoffle World Wide Web OFFline Explorer
Available deb Repositories (how-to add a respository)
Debian 32-bit 64-bit
sarge 1:3.1.6-11 1:3.1.6-11
etch 1:3.2.0b6-3.1etch1 1:3.2.0b6-3.1etch1
sid 1:3.2.0b6-6 1:3.2.0b6-6

Ubuntu 32-bit 64-bit
dapper 1:3.1.6-11.1ubuntu1 1:3.1.6-11.1ubuntu1
edgy 1:3.2.0b6-1 1:3.2.0b6-1
edgy-updates 1:3.2.0b6-1ubuntu0.1 1:3.2.0b6-1ubuntu0.1
feisty 1:3.2.0b6-3 1:3.2.0b6-3
gutsy 1:3.2.0b6-3.1 1:3.2.0b6-3.1
hardy 1:3.2.0b6-4 1:3.2.0b6-4

Available rpm Repositories


Rating: Not Rated (0 votes)


Login or Register to rate ht://Dig, add a Tag, or designate as an alternative to a Windows app



Upload Screenshots
Images must be in GIF, JPG, or PNG formats and can be no larger than 2 MB. Only one file can be uploaded at a time. A description can be included, but it is optional.
Desc:
File:
You must login or register to upload a screenshot.
Submit Web Links
Submit the title and link (including http://) to an article pertaining to ht://Dig and it will appear in the Web Links section of the right banner. Contact us here if an entry needs to be removed.
Title:
Link:
You must login or register to post links.