I have created and published open source software for eDiscovery, called FreeEed. The project is hosted on GitHub here. The discussion group for it is here.
The software is in working state, but it is an early release, which follows the common open source "approach of commit early, commit often." At this time, I am looking for feedback on what the next incremental improvement steps can be.
The software has been tested in Ubuntu, but it may work in Windows. It works in local mode or on a cluster and is scalable: the same code will work on a cluster without any change.