SitePoint Sponsor

User Tag List

Results 1 to 4 of 4

Hybrid View

  1. #1
    SitePoint Member
    Join Date
    Feb 2005
    Posts
    13
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)

    Fulltext indexing - Xapian vs. Swish-e (vs others) benchmarking

    Hello,

    Just curious if anybody has looked at some of the more specialized searching packages to do an evaluation of them? I'm specifically interested in performance with large indexes > 1 million documents and the ability to handle large numbers of searches in a minute.

    Anybody have any experience in this area?

    I originally thought that MySQL would be a good idea but I've heard that for my needs, MySQL probably wouldn't do as well in throughput as a binding to a dedicated full-text search engine.

  2. #2
    SitePoint Guru asterix's Avatar
    Join Date
    Jun 2003
    Posts
    847
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    What kind of documents are we talking about here?
    Small (<1Kb), multilingual, web pages...

    It kinda depends, a lot.

  3. #3
    SitePoint Member
    Join Date
    Feb 2005
    Posts
    13
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Two different fields -- The first will be a maximum of 1k, and the second will be at most 2-3k.

    I'd like to enable UTF-8/unicode so there will be an option to store multibyte characters as there will be some foreign language stuff.

    All text-based, no html tags, but I'm not sure if that matters when you are dealing with foreign language charsets.

  4. #4
    SitePoint Member
    Join Date
    Feb 2005
    Posts
    13
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    any ideas about advanced indexing?


Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •