I created slackbuilds for the following programs (Compiled in Fatdog810b):
xapian-core-1.4.17-x86_64-1_SBo.tgz
https://slackbuilds.org/repository/14.1 ... pian-core/
*slackbuild modified to use newer source
xapian-omega-1.4.17-x86_64-1_SBo.tgz
https://slackbuilds.org/repository/14.2 ... ian-omega/
*slackbuild modified to use newer source
xapian-bindings-1.4.17-x86_64-2_SBo.tgz
https://slackbuilds.org/repository/14.2 ... -bindings/
*slackbuild modified to use newer source
These programs can be used for things like indexing directories and web pages. Supports many document types such as htm, pdf and office documents. I tested this and it works
Create an index with the following command
Code: Select all
omindex -p --db info --url documents /mnt/data0/Documents.
https://www.ibm.com/developerworks/libr ... index.html
https://manpages.ubuntu.com/manpages/ar ... dex.1.html
Query the database as follows:
Code: Select all
quest --db=info redbook
https://www.ibm.com/developerworks/libr ... index.html
Some related links:
https://wiki.python.org/moin/HelpOnXapi ... g_an_index
https://xapian.org/download
https://github.com/xapian/xapian-docspr ... de/python3
https://getting-started-with-xapian.rea ... ample-code
https://xapian.org/docs/
alternativesto xapian:
https://unix.stackexchange.com/question ... t-indexing
https://www.tecmint.com/count-word-occu ... text-file/
http://swishplusplus.sourceforge.net/
https://web.archive.org/web/20061223111 ... ish-e.org/
https://en.wikipedia.org/wiki/SWISH-E
https://metacpan.org/pod/SWISH
https://www.linuxjournal.com/article/6652