Searching large amounts of text
Beatrix Willius
bwillius at gmx.de
Sun Oct 21 11:38:52 CDT 2012
Hi Ruslan,
On 21.10.2012, at 18:19, Ruslan Zasukhin <ruslan_zasukhin at valentina-db.com> wrote:
> 1) can you formulate task?
> What is a key? You need find words? In your texts?
> I know you keep emails there.
>
> So you have searches as word1 & word2 & word3 ?
Yes, emails need to be searched. Searching From and To is lightning fast now. But searching for text in the emails is pretty slow (>10 seconds in a db with about 50.000 mails).
>
> Or we talk about searches CONTAINS?
At the moment I'm using a simple regex.
>
>
> 2) I will take a look on Lucene now
On 21.10.2012, at 18:21, Ruslan Zasukhin <ruslan_zasukhin at valentina-db.com> wrote:
> On 10/18/12 6:26 PM, "Beatrix Willius" <bwillius at gmx.de> wrote:
>
>> What are my options here? Does anyone have an acceptable solution? Can I use
>> an additional software like Lucene together with Valentina? Would it make a
>> good feature request to integrate Lucene? Is there something better available?
>
> Apache Lucene(TM) is a high-performance, full-featured text search engine
> library written entirely in Java. It is a technology suitable for nearly any
> application ...
No, haven't tried it. Was just throwing around an idea. There a 2 reasons to have a look at Lucene:
1. It's available for many languages like C dialects.
2. It has an Apache license.
Mit freundlichen Grüßen/Regards
Trixi Willius
http://www.mothsoftware.com
Mail Archiver X: The email archiving solution for professionals
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.macserve.net/pipermail/valentina/attachments/20121021/dbbc2853/attachment.html>
More information about the Valentina
mailing list