This php script sorts your documents (by using hardlinks) into subfolders based on the hashtags it finds in your documents filenames.
Why do you think that https://github.com/thiagoalessio/tesseract-ocr-for-php is a good alternative to FileBasedMiniDMS