DRAFT of specification if Indexing By words for strings.

Ruslan Zasukhin sunshine at public.kherson.ua
Wed Sep 22 23:10:12 CDT 2004


On 9/22/04 10:32 PM, "Erik Mueller-Harder"
<valentina-list at vermontsoftworks.com> wrote:

Hi Erik,

> On Sep 22, 2004, at 10:41, Ruslan Zasukhin wrote:
> 
>> Btw, guys, if somebody have need in some other functions,
>> Please tell us.
>> 
>> We already have made many new.
>> 
>> We need ASAP make at list short list of them in 2.0
>> So you can check it.
> 
> Have you been thinking about word boundaries?  In 1.x, we have no
> control over what code points signify word boundaries, and I find that
> IndexByWords often breaks strings into more "words" than I would wish.
> 
> Alternatively, I suppose, -- or by default -- I think there are Unicode
> libraries that describe word breaks....
> 
> Still, it would be nice to be able to override these somehow for
> IndexByWords.

IBM ICU have class BreakIterator

It make sure that for specified language we will get the correct words.


-- 
Best regards,
Ruslan Zasukhin      [ I feel the need...the need for speed ]
-------------------------------------------------------------
e-mail: ruslan at paradigmasoft.com
web: http://www.paradigmasoft.com

To subscribe to the Valentina mail list go to:
http://lists.macserve.net/mailman/listinfo/valentina
-------------------------------------------------------------



More information about the Valentina-beta mailing list