Updated filters for Recoll
The following describe new and updated filters, which will be part of the next release, but can be installed on the current release if you need them.
For updated filters, you just need to copy the script to the filters directory which may be typically either /usr/share/recoll/filters, or /usr/local/share/recoll/filters. Please check that the script is executable after copying it, and make it so if needed (chmod a+x scriptname)
For new filters, you'll need to copy the script file as above, possibly install the supporting application, and usually edit the mimemap, mimeview and mimeconf files, either in the shared directory ( /usr[/local]/share/recoll/examples), or in your personal configuration directory ($HOME/.recoll or $RECOLL_CONFDIR).
Alternatively, you can replace your system files with these updated and complete versions: mimemap mimeconf mimeview
rclchm needs to be updated for all Recoll versions up to and including 1.17.1.
All filters are up to date in Recoll 1.16.2, except rclchm, and the new ones for gnumeric and Okular annotations.
Recoll 1.15 may benefit from some of the newer filters linked below.
If you are running an older Recoll version, you really should upgrade.
DIA files
rcldia is a new filter for Dia files, contributed by Stefan Friedel.
CHM files
rclchm. The previous version of the filter mishandled files which had encoded internal URLs (not very frequent, but happens).
Okular annotations
rclokulnote. Okular lets you create annotations for PDF documents and stores them in xml format somewhere under ~/.kde. This filter does not do a nice job to format the data, but will at least let you find it...
Gnumeric
rclgnm. Needs xsltproc and gunzip. As .gnumeric was in the list of explicitely ignored suffixes, you can't just add the mime and indexer script lines to your local mimemap and mimeconf, you also need to define recoll_noindex in the local mimemap (to override the system one which contains .gnumeric). The simplest approach may be to just replace the system files with those above.
Rar archive support
rclrar. This is up to date in Recoll 1.16.2 but may be added to Recoll 1.15. It needs the Python rarfile module.
Mimehtml support
This is based on the internal mail filter, you just need to download and install the configuration files (mimemap and mimeconf. Will only work with 1.15 and later.
Konqueror webarchive (.war) filter
Updated zip archive filter
The filter is corrected to handle utf-8 paths in zip archives: rclzip. Up to date in Recoll 1.16, but may be useful with Recoll 1.15
Updated audio tag filter
The mutagen-based rclaudio filter delivered with recoll 1.14.2 used a very recent mutagen interface which will only work with mutagen versions after 1.17 (probably. at least works with 1.19, doesn't with 1.15). You can download the corrected script here. Not useful with Recoll 1.5 or 1.6.