http://www.htdig.org
--------------------
1. Download and unpack:
> cd /root/archive
> wget http://www.htdig.org/files/htdig-3.2.0b5.tar.gz
> tar -xvzf htdig-3.2.0b5.tar.gz
2. Configure and compile:
> SITE=/home/httpd/{SITE_NAME}/; ./configure --prefix=/usr/local \
--with-config-dir=$SITE/conf \
--with-search-dir=$SITE/www/htdig \
--with-database-dir=$SITE/htdig \
--with-cgi-bin-dir=$SITE/cgi-bin \
--with-image-dir=$SITE/www/htdig/images \
--with-image-url-prefix=/htdig/images
> make
> make install
3. Edit config file:
edit /home/httpd/{SITE_NAME}/conf/htdig.conf and set the proper values
for "start_url" and "maintainer"
4. Build database:
> rundig
5. Setup cron job to reindex database nightly:
Create a file at /etc/cron.daily/htdig with the following:
#!/bin/sh
/usr/local/bin/rundig
Make the file executable:
> chmod 755 /etc/cron.daily/htdig
6. Test:
Open a browser and go to:
http://{SITE_NAME}/htdig/search.html
Try searching to verify that the content has been indexed.