Searching large amounts of text

Beatrix Willius bwillius at gmx.de
Sun Oct 21 11:38:52 CDT 2012


Hi Ruslan,

On 21.10.2012, at 18:19, Ruslan Zasukhin <ruslan_zasukhin at valentina-db.com> wrote:

> 1) can you formulate task?
> What is a key?  You need find words? In your texts?
> I know you keep emails there.
> 
> So you have searches as word1 & word2 & word3 ?

Yes, emails need to be searched. Searching From and To is lightning fast now. But searching for text in the emails is pretty slow (>10 seconds in a db with about 50.000 mails).
> 
> Or we talk about searches CONTAINS?

At the moment I'm using a simple regex.
> 
> 
> 2) I will take a look on Lucene now


On 21.10.2012, at 18:21, Ruslan Zasukhin <ruslan_zasukhin at valentina-db.com> wrote:

> On 10/18/12 6:26 PM, "Beatrix Willius" <bwillius at gmx.de> wrote:
> 
>> What are my options here? Does anyone have an acceptable solution? Can I use
>> an additional software like Lucene together with Valentina? Would it make a
>> good feature request to integrate Lucene? Is there something better available?
> 
> Apache Lucene(TM) is a high-performance, full-featured text search engine
> library written entirely in Java. It is a technology suitable for nearly any
> application ...

No, haven't tried it. Was just throwing around an idea. There a 2 reasons to have a look at Lucene:

1. It's available for many languages like C dialects.
2. It has an Apache license.




Mit freundlichen Grüßen/Regards

Trixi Willius

http://www.mothsoftware.com
Mail Archiver X: The email archiving solution for professionals

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.macserve.net/pipermail/valentina/attachments/20121021/dbbc2853/attachment.html>


More information about the Valentina mailing list