antiword - show the text and images of MS Word documents
I built the slackbuild for antiword:
antiword-0.37-x86_64-1_SBo.tgz
https://slackbuilds.org/repository/12.2 ... /antiword/
I wanted it to convert office documents to text files for an indexing program that I'm using. See thread:
xapian indexing software for searching
For some reason, antiword is being used when I'm building my indexes but would rather use a different program (e.g. soffice) because antiword can't handle rtf files which might have the ".doc" extension. In my efforts to get around this issue, I'm working on a script to try to find the most suitable utility on a system to convert an office document to text: