-
Front end to several search engines and portals that allows you to enter queries in various character sets.
-
A concise history of the development of character encoding in Western and East Asian languages, including ASCII, EBCDIC, Unicode and TRON.
-
A comparison of two of these two basic encoding systems, with tables.
-
Covers the beginnings of the ASCII standards from ASCII-1963 onwards and information on Cyrillic, Japanese, Korean, Thai and Vietnamese encoding systems, including various localized versions of EBCDIC. With tables and links to other resources.
-
A wide range of articles on Unicode, East Asian localization and Internationalization issues.
-
A tutorial on character code issues in digital processing and transfer of text data, on the Internet or otherwise. Includes tables and a detailed listing of control codes. In English and Finnish.
-
A character set conversion component for Unicode, Japanese, Chinese, Korean, Cyrillic, Arabic, Hebrew, Thai, Vietnamese and all Western languages.
-
Hints and tips about character sets and fonts in web development. Includes links to related resources.
-
Specifies the structure of ECMA-35, for 8-bit codes and 7-bit codes which provide for the coding of character sets, with a detailed PDF document.
-
Query character sets, encoding, codepages and Unicode information in an easy-to-use web form. Held at the Institute of the Estonian Language.
-
Chapter covering document character sets and encodings in HTML from the World Wide Web Consortium's HTML 4.0 Specification.
-
How to validate HTML documents in various character encodings.
-
The official names for character sets that may be used in the Internet and referred to in Internet documentation - held at the Internet Assigned Number Authority.
-
The standard names for use in SGML and XML, including a complete list of language name codes.
-
Codetables for ISO 8859-6, ASMO 449 plus, ASMO 708 (Arabic) and ISO 8859-8 (Hebrew) and further information about the company's work in multilingual UNIX.
-
A review of the HTML authoring problems caused by some special characters which belong to MS Windows character set but not to ISO Latin 1. Includes technical details and substitution tables. In English and Finnish.
-
Mirror of Roman Czyborra's work on character sets and encoding systems. In English and German.
-
Pennsylvania State University's guide to reading and publishing different languages on the web. Includes details of various encoding systems and links.
-
A tutorial that explains HTML character sets, character encodings and character references from Webreference.com.
-
Quick reference and searchable ASCII code and conversion tables.
-
Covers code tables, Unicode, HTML and XML and links to other resources and discusses internationalization and localization issues relating to character sets.
-
A library for Windows developers that allows applications to encode binary data and files into text and vice-versa.