Searching large amounts of text

Ruslan Zasukhin ruslan_zasukhin at valentina-db.com
Sun Oct 21 11:19:27 CDT 2012


On 10/18/12 6:26 PM, "Beatrix Willius" <bwillius at gmx.de> wrote:

Hi Beatrix,

> Since a while sorting and searching my database has become really fast.

> However, there is one area where searching is still too slow. And this is
> searching large amounts of text, which in my case can be GBs of data. But it's
> mainly one field for the database, typically in the range of a few KB to
> seldom larger than one MB.
> 
> I'm using index by words on one side and regex on the other side. Thinking
> about this: does this make sense??? Code is quite simple:
> 
> where Message.MessageBody REGEX '(?i)something'
> 
> Using something like a * for wildcard searching isn't acceptable in the times
> of Google finding everything with the part of a second.
 
> What are my options here? Does anyone have an acceptable solution? Can I use
> an additional software like Lucene together with Valentina? Would it make a
> good feature request to integrate Lucene? Is there something better available?

1) can you formulate task?
What is a key?  You need find words? In your texts?
I know you keep emails there.

So you have searches as word1 & word2 & word3 ?

Or we talk about searches CONTAINS?


2) I will take a look on Lucene now


-- 
Best regards,

Ruslan Zasukhin
VP Engineering and New Technology
Paradigma Software, Inc

Valentina - Joining Worlds of Information
http://www.paradigmasoft.com

[I feel the need: the need for speed]




More information about the Valentina mailing list