The interesting thing is - as @rdentato
points out in his recent post - that the UTF-8 encoding plays nicely with plain old C, as long as you don't need any special properties of Unicode.
E.g. UTF-8 never introduces a null byte, so strlen works; UTF-8 never introduces ASCII bytes (all bytes used for encoding are > 0x7F), so searching for ASCII characters in a UTF-8 encoded string still works by iterating over the strings bytes; etc.
For further actions, you may consider blocking this person and/or reporting abuse
We're a place where coders share, stay up-to-date and grow their careers.
The interesting thing is - as @rdentato points out in his recent post - that the UTF-8 encoding plays nicely with plain old C, as long as you don't need any special properties of Unicode.
E.g. UTF-8 never introduces a null byte, so
strlen
works; UTF-8 never introduces ASCII bytes (all bytes used for encoding are > 0x7F), so searching for ASCII characters in a UTF-8 encoded string still works by iterating over the strings bytes; etc.