When we are using Unicode database, and if we are say in chinese language then each character takes 3 bytes of storage and similarly for english-US (en-US) it takes 1 bye of storage.
I want to know how many bytes are taken by each character in different languages. A comprehensive list would be more helpful for documenting purpose and development purpose for me.
I did some checking, and I dont think your going to find out how many bytes each character takes up in all languages. but if you did each of the main languages this would be the faster way of doing it. I hope this following link helps you.
http://www.isoc.org/inet97/proceedings/E1/E1_2.HTM
Unicode using more bytes is not language dependent but the implementation, generally RDBMS(relational database management systems) Unicode is UTF16 which is actually 4bytes instead of three, but there is UCS which uses less bytes but you could run into character
conversion issues with languages like Chinese or Japanese. Hope this helps.
pmd_Areef
Member
210 Points
42 Posts
Unicode DataBase
Dec 02, 2005 05:09 PM|LINK
Hi Guys,
When we are using Unicode database, and if we are say in chinese language then each character takes 3 bytes of storage and similarly for english-US (en-US) it takes 1 bye of storage.
I want to know how many bytes are taken by each character in different languages. A comprehensive list would be more helpful for documenting purpose and development purpose for me.
Can anyone tell me where can I get this list.
Thanks and Regards,
Areef.
Blake05
Contributor
2561 Points
511 Posts
Re: Unicode DataBase
Dec 03, 2005 01:10 AM|LINK
I did some checking, and I dont think your going to find out how many bytes each character takes up in all languages. but if you did each of the main languages this would be the faster way of doing it. I hope this following link helps you.
http://www.isoc.org/inet97/proceedings/E1/E1_2.HTM
Blog - Website: windowscoding.com
Caddre
All-Star
26581 Points
5308 Posts
Re: Unicode DataBase
Dec 05, 2005 11:49 AM|LINK
Gift Peddie