Page 1 of 1

tesseract-ocr_3.04.01-4 with Xenial 32

Posted: Thu Oct 22, 2020 2:29 pm
by gilles
Hello
In the PPM I downloaded this OCR system, together with;
- gimagereader_3.1.2+git368fa8f
- languages files for french and dutch (dutch language : tesseract-ocr-nld_3.04.00-1 )
It is a quite good system to convert a PDF file to a text file I can put in libreoffice, to modify it
In french, it works.
My only problem is that the dutch language file doesn't work. I have a message telling me that the dutch dictionary is not there.

How could I do ?

Thank you

Re: tesseract-ocr_3.04.01-4 with Xenial 32

Posted: Sat Mar 20, 2021 7:13 pm
by wanthinker
gilles wrote: Thu Oct 22, 2020 2:29 pm

Hello
In the PPM I downloaded this OCR system, together with;
- gimagereader_3.1.2+git368fa8f
- languages files for french and dutch (dutch language : tesseract-ocr-nld_3.04.00-1 )
It is a quite good system to convert a PDF file to a text file I can put in libreoffice, to modify it
In french, it works.
My only problem is that the dutch language file doesn't work. I have a message telling me that the dutch dictionary is not there.

How could I do ?

Thank you

You can get the missing language file at https://github.com/tesseract-ocr/tessdata
Think the one for Dutch is https://github.com/tesseract-ocr/tessda ... raineddata

Either download/move that to the same directory as the other language file, or specify it using the commandline.


unable to install tesseract

Posted: Tue Mar 21, 2023 7:37 pm
by xdv

PPM package search: No matching package name
my repositories are:

  • ubuntu-xenial-universe

  • ubuntu-xenial-main

  • ubuntu-xenial-multiverse

  • puppy-xenial

  • puppy-noarch

and my machine runs xenialpup 32


Re: tesseract-ocr_3.04.01-4 with Xenial 32

Posted: Wed Mar 22, 2023 7:34 am
by dogcat

Ubuntu moves the old repositories that they no longer support, Xenial files should still be available but at a different url than where PPM is looking. Probably need to point PPM at that new / different url to find all the old packages. Someone here should know the new url.


Re: unable to install tesseract

Posted: Wed Mar 22, 2023 10:53 am
by forthuser
xdv wrote: Tue Mar 21, 2023 7:37 pm

and my machine runs xenialpup 32

Did you do a PPM update?


Re: tesseract-ocr_3.04.01-4 with Xenial 32

Posted: Wed Mar 22, 2023 11:22 am
by bigpup

Also try Quickpet -> Info -> Xenialpup updates to fix the repositories location info for PPM.

then do PPM database update


Re: tesseract-ocr_3.04.01-4 with Xenial 32

Posted: Wed Mar 22, 2023 11:31 am
by pp4mnklinux

I used software (Adobe Acrobat PRO XI) till I discovered online services.

https://www.ilovepdf.com

Give it a try