DRAFT of specification if Indexing By words for strings.
Ruslan Zasukhin
sunshine at public.kherson.ua
Wed Sep 22 23:10:12 CDT 2004
On 9/22/04 10:32 PM, "Erik Mueller-Harder"
<valentina-list at vermontsoftworks.com> wrote:
Hi Erik,
> On Sep 22, 2004, at 10:41, Ruslan Zasukhin wrote:
>
>> Btw, guys, if somebody have need in some other functions,
>> Please tell us.
>>
>> We already have made many new.
>>
>> We need ASAP make at list short list of them in 2.0
>> So you can check it.
>
> Have you been thinking about word boundaries? In 1.x, we have no
> control over what code points signify word boundaries, and I find that
> IndexByWords often breaks strings into more "words" than I would wish.
>
> Alternatively, I suppose, -- or by default -- I think there are Unicode
> libraries that describe word breaks....
>
> Still, it would be nice to be able to override these somehow for
> IndexByWords.
IBM ICU have class BreakIterator
It make sure that for specified language we will get the correct words.
--
Best regards,
Ruslan Zasukhin [ I feel the need...the need for speed ]
-------------------------------------------------------------
e-mail: ruslan at paradigmasoft.com
web: http://www.paradigmasoft.com
To subscribe to the Valentina mail list go to:
http://lists.macserve.net/mailman/listinfo/valentina
-------------------------------------------------------------
More information about the Valentina-beta
mailing list