This lightweight tool converts non-UTF-encoded (such as GB2312, GBK, BIG5 encoded) files to UTF-8 encoded files. It can either be executed from command line (CLI), or imported into other Python code.
In the previous article, we discussed essential knowledge about Unicode encoding and decoding. This article continues by exploring how Python handles encoding errors. >> s1='cà phê' >> ...
OK, so long story short, I have a set of files that contain exported binary values from the windows registry. The values look something like this: hex:f3,33,02,f1,02,83,a4 [and so on, multiple lines] ...