[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Hebrew Lynx: How much Hebrew and how much Lynx ?



On Tue, 9 Feb 1999, Matan Ziv-Av wrote:

>> I know that many .ac.il sites and some ISPs run Hebrew Lynx, and that it
>> must be here somewhere. Where ? Is it a PD job or not and does it run on
>> Linux ?
>
>I did not know that, and as far as I can tell the lynx used by bgu math
>and cs department is the standard one.

Huh ? Hebrew Lynx looks standard but it has all the goodies to display
Hebrew for Windows etc set up. afaik, no bidi. The menus are in English
but must not be.

>I did that, with little success: I could not find the place to apply the
>reversing, andso, I have them applied at three places, and it works
>only sometimes, i.e some strings are reversed, some are not, and some
>are changed when selected/un-selected.
>
>I believe the reversing should be done at the libWWW part, rather than
>at the lynx part, but I could not find the right place to do it.

;) The libWWW is probably harder to hack. You should find the HTML stream
and apply changes there. I do not claim to understand the whole Lynx
sources, or to have ever read them. BTW Lynx is a moving target (more than
pine f.ex).

>> One more thing: That button text thing is a pest. imho an algorythm must
>> be developed to vote for the right direction based on a few well-known
>> words.Am I way wrong ? 
>
>There might be a way to know if a given html document uses "visual" or
>"logical" encoding.

The only way I trust, is to look at a reasonable amount of words (like,
one page), and sieve this against a small list of often-used words, give
marks, and choose what wins. Even this is not perfect, but it should work
more or less. I have no stats but there are about 12 words per line in
English. The probability of a word having a meaning read backwards is less
than 1:6 and thus one line analyzed should determine is pretty well. I'd
look at more than one line because HTML has embedded comments titles and
other gory details that may be misleading (can u say Javascript ?). I've
done this before for other purposes (language recognition to select proper
display charset and font in DOS).

Peter