Skip to content

Latest commit

 

History

History
143 lines (118 loc) · 6.13 KB

OpenSourceSearchEngines.md

File metadata and controls

143 lines (118 loc) · 6.13 KB

Open Source Search Engines

Table of Contents

  • Apache Lucene
    • Lucene++
  • Apache Solr
  • Open Semantic Search
    • Subprojects
      • Solr PHP UI
  • Elasticsearch
    • Other Projects
      • dejavu
      • Fess
      • Searchkit
  • OpenSearch
    • Other Projects
  • Gigablast
  • YaCy
    • Articles
  • Vald
  • Weaviate
  • MWMBL
  • Alexandria
  • Wiby
  • OpenSearchServer
  • Metasearch
    • MetaGer
  • Not Web Scale
    • meilisearch
    • Typesense
  • Smaller Engines
    • Sonic
    • ZincSearch

Apache Lucene

  • https://lucene.apache.org/
  • The open source Java library that powers Apache Solr and Elasticsearch, among many other search projects.

Lucene++

Apache Solr

Open Semantic Search

  • https://opensemanticsearch.org/
  • Under the hood one is running Apache Solr, but there are some significant changes that make listing Open Semantic Search separately worthwile.1

Subprojects

Elasticsearch

Other Projects

  • dejavu - Open source, JS web-based UI for Elasticsearch and OpenSearch.
  • Fess - Open source, enterprise search server with web crawler and GUI. Written in Java.
  • Searchkit - Updated: 3/2023 - Checked: 3/2023 - Stars: 4.6k - Open source library for building search UI's with JS, React, Vue, Angular, etc. Written in TypeScript primarily.

OpenSearch

Other Projects

  • Please see Other Projects under Elasticsearch. Only projects that are for OpenSearch exclusively will be listed here.

Gigablast

  • https://gigablast.com/
  • GitHub Repo
  • Founded in 2000 by Matt Wells as a closed source search engine it was later open sourceed. It is written in C++, is distributed, and includes both the engine and a crawler.

YaCy

Vald

Weaviate

MWMBL

Alexandria

Wiby

OpenSearchServer

Metasearch

MetaGer

Not Web Scale

meilisearch

Typesense

Smaller Engines

  • Sonic - Updated: 1/2023 - Checked: 3/2023 - Stars: 18k - A lightweight, speedy search backend written in Rust.
  • ZincSearch - Updated: 3/2023 - Checked: 3/2023 - Stars: 14.7k - Lightweight alternative to Elasticsearch, written in Go. Includes a web UI.

Footnotes

Footnotes

  1. It isn't meant for web search particularly but it offers a number of features which could be useful in a search engine - e.g. exploratory search as well as collaborative annotation and tagging.

  2. The fork was started following controversial licensing changes by Elasticsearch. For more on the history of this controversy see Graham Gillen's Elasticsearch vs OpenSearch series. For a brief evaluation of OpenSearch's progress see Matt Asay's One year of OpenSearch: Grading AWS’ open source effort.

  3. The project has some similarities with what I'm looking to do with Phoebe. It is open source, a non-profit, and the code is written in Python.

  4. Some interesting functionality includes tunable ranking, sorting, faceting & filtering, grouping & distinct, federated search, and curation. It doesn't appear to be in web scale usage but they've expressed interest in benchmarking larger datasets so I submmited an issue requesting CommonCrawl be benchmarked.