Topic: A note on handling UTF-8 texts
Regarding the post with id 240:
Maybe it is worth to mention that if you have the config
clean_ms_char = 1
turned on, this will destroy korean or japanese characters, despite the fact that the text is properly UTF-8 encoded.
As soon as you turn this config off, everything works as expected.