Say I write a post. This one.
The plugin (which may suffer from CPU and storage issues, but screw that) compiles a trigram-based linguistic vector of the post. That is, it counts how many there are of every 3-letter substring in the entire text. It also notes the length of the post. Or maybe it creates some other sort of computationally linguistic indexing key, like a Statistically Improbably dohickey, or a discrete wavelet alphabetotron. One of those.
The plugin then takes this trigram frequency vector (or whatever), and scans the entire corpus of Project Gutenberg text files for the most similar passages of the same length.
So in the end, you get a link like “Public-domain eTexts which may be interesting,” and it links to the eText, with a same-length passage.
Maybe a few processing and storage considerations. Like I said. But offline processing would work….

