DuckDB wrongly applies stoplists

The default DuckDB indexer can be used with an optional stoplist, for instance 'english'. The stoplist is used before an optional stemmer, for instance 'porter'. During searching however, the stoplist is not used. This may result in stopwords like 'the' matching stemmed terms in your index, for instance terms that are included by stemming a word like 'thes'. This word 'the' (originally 'thes') will have a very low document frequency (df), and therefore an unwanted and large impact on the final ranking.

Edited by Djoerd Hiemstra