missing messages when DC ships bad utf8 #20
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
We're dealing with raw bytes almost everywhere. It's assumed these are UTF-8 strings.
When DC sends us a message that's not valid UTF-8 (e.g. today, a pasted em-dash in Windows-1252 encoding(?)) TG silently(?) drops the message.
We should either
Attempt to parse all incoming messages as UTF-8. If they fail, replace invalid characters with U+FFFD REPLACEMENT CHARACTER