Character maps and code pages

For discussions about programming, and for programming questions and advice


Moderator: Forum moderators

User avatar
MochiMoppel
Posts: 1128
Joined: Mon Jun 15, 2020 6:25 am
Location: Japan
Has thanked: 18 times
Been thanked: 366 times

Re: Character maps and code pages

Post by MochiMoppel »

Update

Someone must have "fixed" the old and trusted find command :evil:
With version 4.9.0, found in @radky 's BW64, the trick to find file names with incorrect UTF-8 encoding doesn't work anymore.

Code: Select all

#!/bin/bash
mkdir /tmp/testfiles 2>/dev/null
/tmp/testfiles/$'Malm\xf6'  #Malmö invalid UTF-8
/tmp/testfiles/$'Malm\xf8'  #Malmø invalid UTF-8
/tmp/testfiles/$'\x80uro'   #€uro  invalid UTF-8

FOUND=$(find /tmp/ -not -name "*")
Xdialog -left -back "Invalid UTF-8 filenames:" -msg "$FOUND" 200x200

Version 4.7.0 in @rockedge 's F96-CE still works fine. However using F96-CE's version in BW64 does not work ... and this may be an indication that find is not the culprit. I suspect that in BW64 other changes have been made, because also ROX-Filer interprets '*' names differently.

[EDIT] The problem concerning BW64 was discussed in viewtopic.php?p=102155#p102155 where @Burunduk posted a smart workaround.

Post Reply

Return to “Programming”