Thursday, August 5, 2010

Encoding: Convert between character sets with iconv

Using a linux shell command called iconv you can convert content of a file from one encoding to another.

If you know exactly what format its in:

iconv --from-code=LATIN1 --to-code=UTF8 -c --output=~/test/xml/input_clean.xml ~/test/xml/input.xml

The "-c" option tells the program to omit unmappable characters. Otherwise it will stop at the first unmappable character it encounters.

No comments:

Post a Comment

Leave your thoughts ...

---
If you are having trouble with copy/pasting in comments, you need to sign in or click 'Preview'.

For more information about this Firefox bug, see here.