Unicode System
Unicode system is universal international standard character encoding which is capable to represent most of the world's languages.
Before Unicode system there were several encoding systems
1 . ASCII - Supports language of united states.
2 . ISO 8859-1 - It supports western European language.
3 . KOI-8 - Supports Russian language.
4 . GB18030 and BIG-5 - Supports Chinese language.
This caused the following problem.
A particular code value corresponds to different letters in the various language standards and The encodings for languages with large character sets have variable length.Some common characters are encoded as single bytes, other require two or more byte.
To solve this problem, A new encoding system was developed called Unicode system which supports world's most of the languages. In unicode, character holds 2 bytes, so java also uses 2 bytes for characters.
Lowest value in unicode system- \u0000
Highest value in unicode system- \uFFFF
Previous topic Next topic
Jdk, jre, jvm Java operators
0 comments: