]> Kevux Git Server - fll/commit
Update: finish implementing f_utf_character_is_valid() and related UTF-8 changes
authorKevin Day <thekevinday@gmail.com>
Wed, 18 Sep 2019 00:09:44 +0000 (19:09 -0500)
committerKevin Day <thekevinday@gmail.com>
Wed, 18 Sep 2019 00:09:44 +0000 (19:09 -0500)
commit0b913142dff7ac3a3a968b1b138acc7af9f1e5d6
tree41280fd78585e4525229400631064e2df5e07fe0
parent1804dfe3d010684a1181cc39702158e2b1322616
Update: finish implementing f_utf_character_is_valid() and related UTF-8 changes

UTF-8 BOM is actually not a thing but only a suggestion, see RFC 3629.
I consider it a very bad practice now that I have learned that it is also the zero width space.
Get rid of the UTF-8 BOM support, it is a bad idea and is not to be supported by this project.

The referenced rfc also provides an easier way to view the valid ranges that my previous resources (such as wikipedia).
This helped me finish this function.

Updated byte_dump to better utilize this and to remove no longer necessary code.
Fix an accidental incorrect "invalid detection" check use before calling f_utf_character_is_valid() in byte_dump.
Explicitly print a "." or " " for UTF-8 control characters (ASCII control characters are already handled before this point so it is safe to call f_utf_character_is_control()).
level_0/f_fss/c/fss.h
level_0/f_utf/c/utf.c
level_0/f_utf/c/utf.h
level_1/fl_console/c/console.h
level_1/fl_fss/c/fss.c
level_1/fl_fss/c/fss.h
level_1/fl_string/c/string.h
level_1/fl_utf/c/utf.h
level_3/byte_dump/c/byte_dump.h
level_3/byte_dump/c/private-byte_dump.c
level_3/byte_dump/c/private-byte_dump.h