[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Q]: Intranet tools




first of all, i leave the original poster's email address in the from
line, so his/her mailer can notify him/her it's a letter sent directly to
them, thus he/she can discard all the list's noise and jump directly to
their personal mail ;0

now, to the letter itself:

On 13 Jan 1998, Alexander L. Belikoff wrote:

> Actually you've raised another question: how one would implement a
> global Intranet search facility? There are many issues about this and
> the most important is that documents don't share the same format. You
> may have: manpages (roff), mailing list archives (mailfolder), web
> pages (HTML), raw text documents, PostScript, PDF, DVI, TeX etc. Add
> the fact that documents above may be also compressed (let's assume
> gzip compression only) and you'll get the picture.

you've just described the perfect grounds for using harvest. check out
harvest.cs.colorado.edu for more info on their indexing and searching
facility.

it cannot do the all-to-html conversion, but it can do the automatic
indexing and searching (thought not searching by categories).

you could work out the categories section using a different tool (that
maybe someone else can recommend...).

guy