DataparkSearch Engine is an open source search engine written in C. It is distributed under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system
Egothor is an open source search engine implementation written entirely in Java to ensure cross platform compatibility. Egothor is able to recognize many common file formats: HTML, PDF, PS, DOC and XLS. Its architecture means that other file formats can be easily added. The engine can index about 50 pages a second and comes with a high capacity crawler robot which is compatible with the robots.txt standard.
Gonzui is a source code search engine software.Gonzui is specially designed for searching source code including various programming languages. As of 2005, it supports C, C++, Java, JavaScript, Ruby, Python, PHP, Perl, Objective Caml, Brainfuck, CSS, Shell and plain text.
Terrier is a software for the rapid development of Web, intranet and desktop search engines. More generally, it is a modular platform for the rapid development of large-scale Information Retrieval applications, providing indexing and retrieval functionalities.
Puggle is an open-source graphical desktop search engine written exclusively in Java. It provides full text and metadata search over files, folders, music, photos, web pages and more that are stored locally on your computer.

Wikia is a selective free web hosting service for wikis (or wiki farm) operated by Wikia, Inc., a for-profit Delaware company founded in late 2004 by Jimmy Wales and Angela Beesley. Wikia particularly targets communities, both those established offline and those with a purely online following. Wikia is free of charge for readers and editors and licenses user-provided content under the GNU Free Documentation License or, in the case of Memory Alpha and Uncyclopedia, a Creative Commons license.







On Facebook
On Technorati
On Twitter
On Digg
On FriendFeed
On
Stumbleupon
On LinkedIn
On
del.icio.us
On Kwippy
On Sphinn
December 22nd, 2007 at 9:53 pm
Hi! Nice collection of open source search engines. How about adding Lucene to the list? It’s one of the leading open source SEs.
Also, although Wikia is reportedly going to be free for use, I don’t know if I would classify it as “open source”. And if you’re going to include it in the list, you may also want to include the Google Desktop - a free application that provides desktop search.
December 23rd, 2007 at 5:24 pm
NItink, thanks for your comments.
December 24th, 2007 at 9:00 pm
It depends on what you mean by “search engine”, of course. There are low level information retrieval packages like Lucene, all the way up to distributed web crawler/search engine combos like Nutch.
For example, this search will return back over 7000 hits, but I’m guessing that many of them wouldn’t qualify as a “search engine” given the list you posted above.
– Ken