Hello Puppians,
After some adventures I downloaded and installed Tesseract OCR on my desktop
The single line above shows the end result of my work.
Everything below is a record of everything I did wrong. - but this can be ignored because I discarded ALL my previous effort and installed the package using the Puppy Package Manager. The moral of this: Always work with the latest version of your Puppy.
I would like to run the OCR package called tesseract.
Found and loaded the package - more detail below.
The package runs from bash/CLI
The general form of a command is: tesseract OCR_image.tif outfile.txt
The manual advises I can learn something by typing tesseract --help.
You see the command and the result:
# tesseract --help
tesseract: error while loading shared libraries: libtesseract.so.4: cannot open shared object file: No such file or directory
#
When I invoke tesseract on an image file the result (I mean the error message) is identical. It can't find the shared object.
My method: Info about downloading and installing Tesseract is sparsely spread around the web.
The info I found applied to other versions of Linux where 'god' (i.e. sudo) had to be invoked to install Tesseract.
In other places a package installer was recommended.
I found tesseract binary (version: tesseract-ocr_4.1.1-2build2_i386.deb) on this page:
Link: http://nz.archive.ubuntu.com/ubuntu/poo ... _i386.deb
I saved the file to: /home/spot/downloads/
Installing tesseract: Using ROX file manager I clicked on the binary. I saw: do you want to install tesseract? <oh, yes please!>
A moment later I saw a message from the Puppy Package Manager: The following package has been installed:
(and then the name tesseract-ocr_4.1.1 CATEGORY: Utility.)
Then (still from PPM) Note: this package does not have a menu. <OK>
The final <OK> is just to 'shrink' the message box when it has been read.
Help file: Various pages suggested the command: Tesseract --help would provide the usual help msg posted by bash.
For me, --help resulted in a so.4 error message as below:
# tesseract --help
tesseract: error while loading shared libraries: libtesseract.so.4: cannot open shared object file: No such file or directory
Looking for an explanation about shared object errors I found some comment (for Unix installations) that suggest a path variable has not been set. At the bottom of this page: https://www.pythonanywhere.com/forums/topic/28295/ I see this:
"You probably need to set the LD_LIBRARY_PATH environment variable to point to the directory that contains the library file."
I read similar comments on other pages about the so.4 error message. Speaking generally the file structure of Puppy differs from 'common' Linux. I do not know where (in the Puppy file system) the shared object should be placed.
Can anyone here advise me where to go for info about this problem?
Ditto for any help in solving this problem.
I have a large pile of paper documents. I wish to see if they can be transmogrified to UTF-8 text.
Please help! Thanks to all!
cobaka
Distro is: Distro: BionicPup32 19.03
Hardware is: Dell laptop running P4 + 2GiBytes RAM. Old but reliable.
Mostly I will run a desktop box/I3.