Nearly related to probabilistic stuff is Xapian, a.k.a. Omseek, a.k.a.
Omsee, a.k.a. Open Muscat, an open-source project intended as a
probabilistic search-engine framework. Initially financed by Brightstation,
was some time ago left to its own. Now lives in Sourceforge.
More in the research field, there’s libbow/rainbow by Andrew McCallum et al.
from CMU, including bayesian classifiers, vector-space algorithms, and other
Here at gtd, we’re experimenting internally with some new vector-space based
search and classification algorithms. What we have does look quite
promising, but AFAIK it’s not to be open-sourced — by now.
htdig-dev mailing list