Honestly, I know nothing about this. The files my partner receives come from a Spanish speaking country. I think I converted the filenames using (but forgot to take special note of the command):
Code: Select all
convmv -r -f ISO-8859-1 -t utf8 /directory_to_convert
Above is the test without renaming the files. Use --notest option to include the rename. Include -i option if you want interactive (yes/no) mode.
Despite still knowing next to nothing about such matters, at least I found a reference that may explain matters to me once I digest it more fully:
http://dwheeler.com/essays/fixing-unix- ... .html#utf8
Mojibake: https://en.wikipedia.org/wiki/Mojibake
MochiMoppel wrote: Sat Jun 26, 2021 3:59 am
Code: Select all
> /tmp/testfiles/$'Malm\xf6' #Malmö invalid UTF-8
This is an interesting construct, though I'm lost as to how it works. I can get by with most bash constructs, but I am certainly no genius at it. $'Malm\xf6' ... how does that do what it does???! (EDIT: found out - per my next post)
Anyway, main question I have is "what is that script supposed to do"? When I run it I get the Xdialog, but all it says is per the screenshot attached.
doesn't seem to be working; meaning, it doesn't find anything so what was to be expected? EDIT: it works now, I had not set LANG locale correctly (previously just had LANG C; not that I really have a clue about locale issues...).
Maybe I am missing some language-related component on my installation - that is perfectly possible since my own usage isn't usually concerned about such matters, but it would be good to sort out and I'm unlikely myself to know how to fix that; the ignorance that comes from only knowing and using one language, being English... well, I do know quite a bit of Norwegian, and a sliight bit of French, and a minute bit of German, and once studied Russian for about six weeks because I liked chess (though found I was using more time on the optional Russian course than on my main Electronic Engineering degree study courses), but never bothered configuring any computer to work with the various non-english-alphabet characters involved with any of these.
Note: I only had locales C and POSIX (per locale -a command). I tried adding locales:
en_GB.UTF-8 UTF-8
en_GB ISO-8859-1
by uncommenting these two in /etc/locale.gen (on my WDL_Arch64 system). Of course I would be better with NZ (since I'm there...) or maybe US, but force of old habits made me use GB... So, if wanting US... could manually edit /etc/locale.conf or use command: (per: arch wiki Locale link below)
Code: Select all
localectl set-locale LANG=en_US.UTF-8
followed by running locale-gen command. But this made no difference to my results...
https://wiki.archlinux.org/title/Locale
I find it all so frustrating that for tonight's bed-time reading I'm annoying myself further with:
"The UNIX-Haters Handbook"
http://web.mit.edu/~simsong/www/ugh.pdf
EUREKA!!! Okay, I was daft. After the above, I forgot to set LANG=en_GB.utf8 (in /etc/locale.conf), which I have now done and maybe the new attached image (fixed_I_think.png) is what I'm supposed to get... Mind you, what a weird but working idea to use command: