Searching large amounts of text
Ruslan Zasukhin
ruslan_zasukhin at valentina-db.com
Sun Oct 21 11:19:27 CDT 2012
On 10/18/12 6:26 PM, "Beatrix Willius" <bwillius at gmx.de> wrote:
Hi Beatrix,
> Since a while sorting and searching my database has become really fast.
> However, there is one area where searching is still too slow. And this is
> searching large amounts of text, which in my case can be GBs of data. But it's
> mainly one field for the database, typically in the range of a few KB to
> seldom larger than one MB.
>
> I'm using index by words on one side and regex on the other side. Thinking
> about this: does this make sense??? Code is quite simple:
>
> where Message.MessageBody REGEX '(?i)something'
>
> Using something like a * for wildcard searching isn't acceptable in the times
> of Google finding everything with the part of a second.
> What are my options here? Does anyone have an acceptable solution? Can I use
> an additional software like Lucene together with Valentina? Would it make a
> good feature request to integrate Lucene? Is there something better available?
1) can you formulate task?
What is a key? You need find words? In your texts?
I know you keep emails there.
So you have searches as word1 & word2 & word3 ?
Or we talk about searches CONTAINS?
2) I will take a look on Lucene now
--
Best regards,
Ruslan Zasukhin
VP Engineering and New Technology
Paradigma Software, Inc
Valentina - Joining Worlds of Information
http://www.paradigmasoft.com
[I feel the need: the need for speed]
More information about the Valentina
mailing list