Jens Krämer

RDig 0.3.0

 |  search, rdig, ferret, ruby

In addition to crawling web sites, RDig now can index local documents. Just give it one or more file:/ URLs pointing to the directories to index, optionally define some filename inclusion/exclusion patterns and there you go.

Document locations can be rewritten to ease linking to them in a web based search frontend. To rewrite all file:/base/* URIs to http://www.mydomain.com/virtual_dir/, you say

cfg.index.rewrite_uri = lambda do |uri| uri.path.gsub!(/^\/base\//, '/virtual_dir/') uri.scheme = 'http' uri.host = 'www.mydomain.com' end

in your RDig config file.

Also there’s a new feature for PDF content extraction: titles are now extracted from PDF meta data with the help of the pdfinfo utility.

Have fun!

Comments

You can use Markdown here.

For the sake of spam checking any data you submit, including your IP address, will be transferred to the US based Akismet web service (akismet.com). If that's not acceptable for you, you can also reach me by other means.