wv-1.2.9 - Covert Word docs to: text, html, dvi, ps, pdf, wml, rtf, latex

Moderators: kirk, jamesbond, p310don, JakeSFR, step, Forum moderators

Post Reply
s243a
Posts: 501
Joined: Mon Dec 09, 2019 7:29 pm
Has thanked: 90 times
Been thanked: 37 times

wv-1.2.9 - Covert Word docs to: text, html, dvi, ps, pdf, wml, rtf, latex

Post by s243a »

I compiled (on Fatdog 810b) a slackbuild for "wv" which is a utility for converting word documents to other formats.

wv-1.2.9-x86_64-1_SBo.tgz

I'm testing out various tools that can convert documents (mostly office documents) to text (see post) for purposes of indexing (see post) and searching my file system. This is one such tool.

Here's an overview of this package:

From sourcefordge:

wvHtml: convert your Word document into HTML4.0. Use "AbiWord --to=html" instead.
wvLatex: convert your Word document into visually (pretty) correct LaTeX. Use "AbiWord --to=tex" instead.
wvCleanLatex: convert into 'cleaner' LaTeX containing less visual mark-up, more suitable for further use and LyX import. Work in progress. Use "AbiWord --to=tex" instead.
wvSummary prints out metadata from your Microsoft Office documents. Not deprecated; actually useful.
wvDVI: converts word to DVI. Requires 'latex'. "AbiWord --to=tex" will get you halfway there.
wvPS: converts word to PostScript. Requires 'dvips'. Use "AbiWord --print=file.ps" instead.
wvPDF: converts word to Adobe PDF. Use "AbiWord --to=pdf" or "AbiWord --print=file.ps && ps2pdf file.ps" instead.
wvText: converts word to plain text. Textually correct output requires 'lynx.' For poor output, this doesn't require anything special. Use "AbiWord --to=txt" instead.
wvAbw: converts word to Abiword format. Use "AbiWord --to=abw" instead.
wvWml: converts word to WML for viewing on portable devices like WebPhones and Palm Pilots. Use "AbiWord --to=wml" instead.
wvRtf: a basic version exists to convert DOC to RTF. Use "AbiWord --to=rtf" instead.

http://wvware.sourceforge.net/#resource

Note, that the sourcefordge project is outdated, and this project is now part of abiword.
https://www.abisource.com/downloads/wv/1.2.9/

I got the slackbuild from:
https://slackbuilds.org/repository/13.1/libraries/wv/

but modified it to use more recent source.

**Note, while this package appears to be maintained by the abiword team, I'm not sure if it is as up to date as libreoffice. Also, I don't think it can handle rtf files as input files (need to verify).

Post Reply

Return to “Software”