tesseract-ocr_3.04.01-4 with Xenial 32
Moderator: Forum moderators
-
- Posts: 186
- Joined: Fri Aug 21, 2020 11:07 pm
- Location: France
- Has thanked: 44 times
- Been thanked: 13 times
tesseract-ocr_3.04.01-4 with Xenial 32
In the PPM I downloaded this OCR system, together with;
- gimagereader_3.1.2+git368fa8f
- languages files for french and dutch (dutch language : tesseract-ocr-nld_3.04.00-1 )
It is a quite good system to convert a PDF file to a text file I can put in libreoffice, to modify it
In french, it works.
My only problem is that the dutch language file doesn't work. I have a message telling me that the dutch dictionary is not there.
How could I do ?
Thank you
-
- Posts: 33
- Joined: Thu Mar 18, 2021 11:35 pm
- Has thanked: 17 times
- Been thanked: 1 time
Re: tesseract-ocr_3.04.01-4 with Xenial 32
gilles wrote: Thu Oct 22, 2020 2:29 pmHello
In the PPM I downloaded this OCR system, together with;
- gimagereader_3.1.2+git368fa8f
- languages files for french and dutch (dutch language : tesseract-ocr-nld_3.04.00-1 )
It is a quite good system to convert a PDF file to a text file I can put in libreoffice, to modify it
In french, it works.
My only problem is that the dutch language file doesn't work. I have a message telling me that the dutch dictionary is not there.How could I do ?
Thank you
You can get the missing language file at https://github.com/tesseract-ocr/tessdata
Think the one for Dutch is https://github.com/tesseract-ocr/tessda ... raineddata
Either download/move that to the same directory as the other language file, or specify it using the commandline.
unable to install tesseract
PPM package search: No matching package name
my repositories are:
ubuntu-xenial-universe
ubuntu-xenial-main
ubuntu-xenial-multiverse
puppy-xenial
puppy-noarch
and my machine runs xenialpup 32
XenialPup 32-bit
Re: tesseract-ocr_3.04.01-4 with Xenial 32
Ubuntu moves the old repositories that they no longer support, Xenial files should still be available but at a different url than where PPM is looking. Probably need to point PPM at that new / different url to find all the old packages. Someone here should know the new url.
Μακάριοι οι καθαροί στην καρδιά * επειδή, θα δουν τον Θεό.
- bigpup
- Moderator
- Posts: 7298
- Joined: Tue Jul 14, 2020 11:19 pm
- Location: Earth, South Eastern U.S.
- Has thanked: 951 times
- Been thanked: 1615 times
Re: tesseract-ocr_3.04.01-4 with Xenial 32
Also try Quickpet -> Info -> Xenialpup updates to fix the repositories location info for PPM.
then do PPM database update
The things you do not tell us, are usually the clue to fixing the problem.
When I was a kid, I wanted to be older.
This is not what I expected
- pp4mnklinux
- Posts: 1239
- Joined: Wed Aug 19, 2020 5:43 pm
- Has thanked: 661 times
- Been thanked: 321 times
Re: tesseract-ocr_3.04.01-4 with Xenial 32
I used software (Adobe Acrobat PRO XI) till I discovered online services.
Give it a try
PP4MNK