Character Sets - Unicode Conversion Chart

ADVERTISEMENT

Character Sets – Unicode - Reading
R 47
The older character sets of ASCII and extended ASCII use 7 and 8 bits. Unicode is another character
set that maps binary combinations to characters.
The problem with ASCII is that it has a maximum of 128 characters that it can store. Japanese people
need to know over 2000 characters and the Chinese alphabet contains around 50 000 characters.
Therefore the character sets need to have more bits to store them. Unicode is an international
system of storing these characters.
Unicode currently stores over 100 000 characters. There is a 16 bit (2 byte) version of Unicode which
16
32
contains 65536 (2
) characters. A 32 bit (4 byte) version can store over 4 billion (2
) characters, far
more than required for every language in the world.
Unicode uses the same codes as ASCII from 0~127.
The following table shows a sample of characters available in Unicode. The black lines in the table
show sections of the character set which have been left out.
Binary
Hex
Decimal
Char
Binary
Hex
Decimal
Char
Basic Latin
0000 0000 0100 0100
0044
68
D
0000 0000 0010 0000
0020
32
SPACE
0000 0000 0100 0101
0045
69
E
0000 0000 0010 0001
0021
33
!
0000 0000 0100 0110
0046
70
F
0000 0000 0010 0010
0022
34
0000 0000 0100 0111
0047
71
G
Basic Latin
0000 0000 0100 1000
0048
72
H
0000 0000 0010 1100
002C
44
,
Basic Latin - Lowercase
0000 0000 0010 1101
002D
45
-
0000 0000 0110 0001
0061
97
a
0000 0000 0010 1110
002E
46
.
0000 0000 0110 0010
0062
98
b
Basic Latin - Numeric
0000 0000 0110 0011
0063
99
c
0000 0000 0011 0000
0030
48
0
0000 0000 0110 0100
0064
100
d
0000 0000 0011 0001
0031
49
1
Greek - Lowercase
0000 0000 0011 0010
0032
50
2
0000 0011 1011 0001
03B1
945
0000 0000 0011 0011
0033
51
3
0000 0011 1011 0010
03B2
946
0000 0000 0011 0100
0034
52
4
0000 0011 1011 0011
03B3
947
0000 0000 0011 0101
0035
53
5
Japanese – Hiragana
0000 0000 0011 0110
0036
54
6
0011 0000 0110 1001
3069
12393
0000 0000 0011 0111
0037
55
7
0011 0000 0110 1010
306A
12394
0000 0000 0011 1000
0038
56
8
0011 0000 0110 1011
306B
12395
0000 0000 0011 1001
0039
57
9
Arabic
Basic Latin - Uppercase
1111 1100 0010 1000
FC28
64552
0000 0000 0100 0001
0041
65
A
1111 1100 0010 1001
FC29
64553
0000 0000 0100 0010
0042
66
B
1111 1100 0010 1010
FC2A
64554
0000 0000 0100 0011
0043
67
C
Question: What is the binary in Unicode that represents the letter D?
Answer: 0000 0000 0100 0100
Question: What letter does the hexadecimal 3069 represent in Unicode?
Answer:
Produced by
7

ADVERTISEMENT

00 votes

Related Articles

Related forms

Related Categories

Parent category: Life
Go
Page of 3