Unicode / ASCII
Unicode Character Table / Miscellaneous Symbols and Pictograph
Emoji
Usefull char
- star emoji - β β β β β
- Check mark - Tick symbol - β β
- line - βββ¬ β β
- circle - β¦Ώ β β‘ β’ β£ β€ β₯ β¦ β§ βΆ β© πβ π« β½
- square - π° π
- π ±π Ύππ ΄π ³
- β±ancy - β³
- Braille β ³ β ΄ β ΅ β Ά β ·
- Arrow β β β β
- Battery π πͺ« / power β» βΌ β β β‘ π π‘π¦
- Temperature π‘
- Droplet π§
- House βπ‘
- β‘ Bourbaki dangerous bend symbol / HN
- robot π€
- table or grid αααα
- computer πΎ β¨ πΏ π½ π¨ π» π₯ π³ π₯οΈ
- Link π
- pencils π π π ποΈ βοΈ βοΈ β π
UTF-8, UTF-16, and UTF-32 ?
Stick to UTF-8 and these three character sets - coding policy for software devlopment
Thatβs one of the great features of UTF-8. You can move forwards and backwards through a UTF-8 string without having to start from the beginning. - UTF-8 is a brilliant design
UTF-8 has an advantage in the case where ASCII characters represent the majority of characters in a block of text, because UTF-8 encodes these into 8 bits (like ASCII). It is also advantageous in that a UTF-8 file containing only ASCII characters has the same encoding as an ASCII file.
UTF-16 is better where ASCII is not predominant, since it uses 2 bytes per character, primarily. UTF-8 will start to use 3 or more bytes for the higher order characters where UTF-16 remains at just 2 bytes for most characters.
UTF-32 will cover all possible characters in 4 bytes. This makes it pretty bloated. I canβt think of any advantage to using it.
ASCII
Four Column ASCII / HN
Remember that ASCII is a 7 bit encoding. Letβs say the following:
- The first two bits denote the group of the character (2^2 so 4 possible values)
- The remaining five bits describe a character (2^5 so 32 possible values)
| 00 | 01 | 10 | 11 | 5last |
|---|---|---|---|---|
| NUL | Spc | @ | ` | 00000 |
| SOH | ! | A | a | 00001 |
| STX | β | B | b | 00010 |
| β¦ | Β | Β | Β | Β |