Sample Unicode Test Pages and Script Links



Some of the following links are to other pages at this web site and others are direct links to other sites. The pages on this web site are (mostly) provided in pairs. The first link will be for a page encoded in decimal NCR format. The second link is for pages encoded in UTF-8 format with the UTF-8 tag in the file header.

(This web page is from my old web site. Many of the external links no longer work. Updating in process...)




LINKS OF INTEREST:

Specific Languages: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Multiple Language Resources | Unicode Editors | Unicode-based Fonts | Unicode Resources | Search Engine—the Next Generation

ARABIC
http://www.ayna.com/help/index.utf8.html


BASSA LANGUAGE / VAH SCRIPT

Visit Varnie Karmo's site to learn about the Bassa people of Liberia and Sierra Leone. There is currently a freeware beta TTF for the Vah script available for download. There is now an on-line dictionary in the works!

http://www.ie-inc.com/vkarmo/index.html


BELARUS - БЕЛАРУСЬ

The Belarusian Language: Using Unicode (UTF-8) Encoding by Peter Kasaty. Site includes charts of the alphabet as well as Belarusian poetry with English translations and much more.

http://www.belarus-misc.org/unicode.htm. (Unicode UTF-8 format)


BENGALI - বাঙালী
http://lekho.sourceforge.net/banglapage.html (Unicode UTF-8 format)


BOPOMOFO - ㄅㄆㄇㄈ
Bopomofo Test. (Unicode decimal NCR format)
Bopomofo Test. (Unicode UTF-8 format)


CHEROKEE - ᏣᎳᎩ
Tsalagi Test and Resource Link. (Unicode decimal NCR format)
Tsalagi Test and Resource Link. (Unicode UTF-8 format)


CHINESE
The Chinese Radicals. (Unicode UTF-8 format)


CIRTH (Unofficial Encodings)
Cirth.


CONVERSION TOOLS (Encoding Conversion)
http://www.sil.org/nrsi/teckit/


CZECH
Some Handy Czech Phrases. (Unicode decimal NCR format)
Some Handy Czech Phrases. (Unicode UTF-8 format)


DANISH
Birger Langkjers hjemmeside. (Includes some useful Unicode-related links.)


DARI (Persian, Eastern Farsi, Parsi)
http://www.afghanpedia.com/ (Unicode UTF-8 format)

Learn about the history and culture of the region. This site has UTF-8 format content in the following languages: Dari (Persian, Farsi, Parsi), Pashto, Ozbeky, Turkmany, English, and French.


DESERET - 𐐜 𐐔𐐇𐐞𐐊𐐡𐐇𐐓 𐐈𐐢𐐙𐐊𐐒𐐇𐐓
The Deseret Alphabet (Unicode decimal NCR format)


EDEN'S PAGE: SCRIPTS OF ALL OF ASIA
http://www.geocities.com/Athens/Academy/9594/index.html

Please visit this award-winning site. This site was formerly Scripts of all India, and it has grown and is still growing ! A superb reference rich with readable graphics.


ENGLISH
Does Your Browser Support English? (Unicode UTF-8 format)


ESPERANTO
Esperanto Test and Resource Links. (Unicode decimal NCR format)
Esperanto Test and Resource Links. (Unicode UTF-8 format)


ETHIOPIAN - የኢትዮጵያ ፊደል
"http://news.com.et/" ወደ መነሻው ገጽ (Ethiopian News Headlines) የሣምንቱ ዜናዎች
This link appears to be down as of early September 2006.


ETRUSCAN - ‮𐌓𐌀𐌔𐌍𐌀‬ - and other scripts
Tex Texin's Plane One page (Unicode NCR version)
...based on his Unicode Example Celebrity List
Sorin Paliga's Old Italic keyboard for MAC


GOTHIC
Gothic Test and Resource Links. (Unicode decimal NCR format)


GREEK - ΕΛΛΗΝΙΚΆ
Greek Test and Resource Link. (Unicode decimal NCR format)
Greek Test and Resource Link. (Unicode UTF-8 format)


INDIC SCRIPTS
http://www.tamil.net/people/sivaraj/links.html (Links Related to Indian Language Computing)


INPUT
http://people.w3.org/rishida/scripts/pickers/ Script pickers from Richard Ishida

Unicode Input - Some old on-line java-based screen keyboards plus links to other resources.

Andrew C. West's freeware BabelMap program for Windows.


IPA - ɪntəˈnæʃənəl fəˈnɛtɪk
http://www.phon.ucl.ac.uk/home/wells/ipa-unicode.htm (The International Phonetic Alphabet in Unicode)


JAPANESE - 日本語
Konnichiwa Around the World. (Unicode decimal NCR format)
Konnichiwa Around the World. (Unicode UTF-8 format)


KAYAH LI - ꤊꤢ꤬ꤛꤢ꤭ ꤜꤟꤤ꤬
Kayah Li Test and Resource Link. (Unicode UTF-8 format)


KLALLAM - nəxʷsƛʼáyəm
http://www.elwha.org/language.htm (Graphics)


KLINGON -  -  
Klingon Test and Resource Link. (Unicode decimal NCR format)
Klingon Test and Resource Link. (Unicode UTF-8 format)


KOREAN - 한글
Korean Test. (Unicode decimal NCR format)
Korean Test. (Unicode UTF-8 format)
http://jshin.net/i18n/korean/hunmin.html (Unicode UTF-8 format) - Test page for Hangul Jamo range by Jungshik Shin.
Making Home Pages in Korean (In Japanese, Shift_JIS)


LAO - ລາວ
Lao Test Page from Tavultesoft. (Unicode UTF-8 format)
Note - this page not available as of 2006/09/05


MĀORI
Māori Test. (Unicode decimal NCR format)
Māori Test. (Unicode UTF-8 format)


MATH
Math Test. (Unicode decimal NCR format)


NAVAJO

Navajo Test. (Unicode UTF-8 format)


NEPALI

http://www.nepali.info/nepali/ (Unicode UTF-8 format)


NUMBERS - 1٢੩四൫Ⅵ௭೮୯
A Table of Numbers. (Unicode decimal NCR format)
A Table of Numbers. (Unicode UTF-8 format)


OGHAM - ᚁᚓᚈᚆ ᚂᚒᚔᚄ ᚅᚔᚑᚅ
Ogham Test and Resource Link. (Unicode decimal NCR format)
Ogham Test and Resource Link. (Unicode UTF-8 format)


OLD PERSIAN CUNEIFORM
Behistun Inscription Portion (Unicode decimal NCR format)


PHAGS-PA
BabelStone : Phags-pa Script

Andrew West has released the first Unicode font for the Phags-pa script. The above linked page has a link for the font download. This page (and other pages linked therein) contains a wealth of information about this fascinating historic script.


PHOENICIAN
A Bequest Unearthed - Phoenicia

A huge site full of interesting information about Phoenicia and its heritage, including pages about the language, the Phoenician script, and the lore of this fascinating culture of historic importance.


PHOENICIAN (MOABITE)
The Moabite Stone. (Unicode UTF-8 format)


PUNJABI - ਪੰਜਾਬੀ - (Poetry by ਪ੍ਰੀਤਮ ਸਿਘ ਧੰਜਲ)
http://www.dhanjal.com/ (Graphics)
Punjabi Computing Resource Centre Unicode 4.0 Gurmukhi Unicode font download!


RUNIC - ᚠᚢᚦᚫᚱᚴ
The Björketorp Inscription. (Unicode decimal NCR format)
The Björketorp Inscription. (Unicode UTF-8 format)


RUSSIAN - РУССКИ
Some Phrases In Russian. (Unicode decimal NCR format)
Some Phrases In Russian. (Unicode UTF-8 format)


SOUTHERN TUTCHONE
Southern Tutchone Orthography Example (Unicode decimal NCR format)
Southern Tutchone Orthography Example (Unicode UTF-8 format)


SYRIAC
Sample Syriac Text (Unicode Decimal NCR format)


TAMIL - தமிழ்
Sample Tamil Text (Unicode Decimal NCR format)
Sample Tamil Text (Unicode UTF-8 format)


TENGWAR (Unofficial encodings)
Tengwar Test and Links


THAANA - ަނ​ާތ  ިޅ​ުބ​ަގ
Sample Thaana Text and Resource Links. (Unicode UTF-8 format)


THAI - ไทย
Title Page from English-Thai Dictionary. (Unicode Decimal NCR format)
Title Page from English-Thai Dictionary. (Unicode UTF-8 format)


UCAS - as used in ᐅᒥᐅᔭᕐᒃ, ᓄᓇᕕᒃ and other places.
- Unified Canadian Aboriginal Syllabics

Ronald Ogawa has a very nice font currently available as beta-test freeware which includes Latin, Cyrillic, Greek and the UCAS. The UCAS are used for writing languages such as Cree, Naskapi, Ojibwe, and Inuktitut. The font is called "Ballymun RO" and is available at: http://nexus.brocku.ca/rogawa/ucas

There are sample UCAS documents available on his server's home page: http://nexus.brocku.ca for Cree and Inuktitut, as well as test pages for Gaelic.


VIETNAMESE - VIỆT
Giống Kiến Lửa Du Nhập (Unicode decimal NCR format)
Giống Kiến Lửa Du Nhập (Unicode UTF-8 format)
http://www.vovisoft.com/ (Unicode Decimal NCR format) Trụ Sở Vovisoft has articles on computing in the Vietnamese language, even tutorials for Visual Basic!


YI
http://www.babelstone.co.uk/Yi/index.html Andrew C. West's Yi Pages


YIDDISH
http://www.uyip.org/unicode Understanding Yiddish Information Processing with Unicode


YORÙBÁ
Yorùbá Example and Resource Link. (Unicode decimal NCR format)
Yorùbá Example and Resource Link. (Unicode UTF-8 format)



Multiple Language Resources


AIYONG MULTILINGUAL
http://www.ask.ne.jp/~shumei/aiyong-e.html Site uses many graphics so some pages do not load quickly, but worth the wait. In addition to both free and commercial translations, the site offers basic conversational phrases in 31 languages. (Script and sound!)


ÉCRITURES DU MONDE
http://www.culture.gouv.fr/edm/fr/


THE FOUR ESSENTIAL TRAVEL PHRASES
http://www.travelphrases.info/ Four phrases every traveller should know translated into many languages and dialects.


HOT PEACH PAGES/EARTHWORDS
http://www.hotpeachpages.net/lang/index.html Over 20 different Unicode ranges used, including Greek, Cyrillic, Hebrew, Arabic, Devanagari, Bengali, Gurmukhi, Gujarati, Tamil, Thai, Lao, Georgian, Ethiopic, Unified Canadian Aboriginal Syllabics and CJK ranges.


INTERNATIONAL GLOSSARY OF HYDROLOGY
http://www.cig.ensmp.fr/~hubert/glu/aglo.htm Pierre Hubert is working on a project to provide the International Glossary of Hydrology in HTML/Unicode format. The original book, published by UNESCO and the World Meteorological Organization, contained terms in English, Spanish, French, and Russian. With assistance from diverse colleagues, he has added several languages to the glossary. All of the encoding on the site is Unicode, with the exception of the Hindi pages.


JENNIFER'S LANGUAGE LINKS
http://www.elite.net/~runner/jennifers/index.htm Comprehensive Linguistic Links


LANGUAGES ON THE WEB
http://www.languages-on-the-web.com/ Thousands of links. Well organized. Site does not use fancy graphics or java scripts so pages load quickly. You can find links to on-line instruction, on-line dictionaries, as well as references on specific languages and the cultures of the language speakers.


TEX TEXIN

Tex Texin's Unicode Example Celebrity List


TITUS
- Unicode Support/Testing with Multilingual Sample Pages.

http://titus.uni-frankfurt.de/unicode/unitest.htm



Unicode Editors


Now you can create Unicode documents for the World Wide Web with Sharmahd Computing's UniPad. Just create a file with UniPad, save it as UTF-8, and rename the file with file extension *.htm. You can then read the document in your Internet Explorer 5.0 (with "View"-"Encoding"-"Unicode (UTF-8)" selected). (Calling a file *.htm doesn't make the file an HTML file, though! Some mark-up may be required, such as the <BR> tag...)

UniPad is a commercial product, a limited freeware version is also offered.
http://www.unipad.org


Andrew C. West's BabelPad is currently freeware. It is for Windows platforms. Features include the ability to use any installed TrueType/OpenType font along with several extremely useful input methods for eastern scripts. The editor also uses the system's complex script shaping engine, which means that complex scripts can be displayed correctly in a plain-text editor. BabelPad includes complete support for Unicode 4.0, including characters beyond the Basic Multilingual Plane. This editor represents a tremendous amount of work and is most welcome.

http://www.babelstone.co.uk/Software/BabelPad.html



Unicode-based Fonts


In addition to the Ballymun RO font mentioned above (under UCAS), and the Code2000 font available on my home page...
Herman Miller creates fictional and experimental languages. He has a freeware Unicode-based font called Thryomanes available at his website:

http://www.io.com/~hmiller/

Thryomanes includes all of the extended Latin characters. Try viewing the Vietnamese test page referenced above with Thryomanes.

Thryomanes is currently archived on his "Languages" page along with several other interesting freeware TTFs.
Junicode, a Unicode-based font for medievalists:

http://www.engl.virginia.edu/OE/junicode/junicode.html
Cardo, a Unicode-based font with OpenType tables for Hebrew and Latin!

http://scholarsfonts.net/
WAZU JAPAN's Gallery of Unicode Fonts

An extensive guide to Unicode fonts.

Unicode Resources


For more information about Unicode, please visit "Fonts for the Unicode Character Set", maintained by Nelson H. F. Beebe at the University of Utah.

http://www.math.utah.edu/~beebe/fonts/unicode.html

There is also an abundance of useful information on his home page:

http://www.math.utah.edu/~beebe/

...as well as the Unicode-related bibliography at:

http://www.math.utah.edu/pub/tex/bib/index-table-u.html#unicode

...which he maintains as part of the TeX User Group Bibliography Archive.
Another excellent Unicode resource:

http://www.alanwood.net/unicode/

Please visit Alan Wood's website.

There are Unicode charts with character names as well as clear, instructive information about the background and implementation of Unicode, including tips about using Unicode in several applications.

Unicode Test Pages:
Be sure to follow this link to the SAMPA home page for additional browser/unicode tests.

http://www.phon.ucl.ac.uk/home/sampa/unicodetest.htm

Unicode Test Pages:

СЛОВО - SLOVO (Christoph Singer)

This site tells you how to make your PC write in Slavic, Cyrillic, and East Central European languages.

Эти страницы содержат информацию о том, как настроить персональный компьютер для работы с русским разыком и другими славранскими языками, в том числе старославянским и дореволюционным русским, и где можно найти шрифты и полезные для многоязычной работы программы в Интернете.

http://www.ccss.de/slovo/index.html

Unicode Information in Vietnamese:

This page from Trụ Sở Vovisoft explains about Unicode and shows how to enable Vietnamese support in various software applications. There is a link provided on the site for the new version of the Tahoma font which supports Vietnamese requirements.

Unicode cho chữ Việt

http://www.vovisoft.com/vovisoft/UnicodeChoVN.htm

International Commerce:
Multilingual Websites and Foreign Language E-commerce

Plenty of practical information here including well-written overviews with titles such as: "How to Get Started in Multilingual Web E-commerce"

http://shoppingforbargains.com/xml.htm
...and, be sure to visit their home page.

Unicode Test Pages:
Test pages from Richard Ishida as well as charts
comparing features of various scripts:

http://www.xerox-emea.com/globaldesign/scripts/scriptsamples.htm

Unicode Test Pages:
Test pages and charts from MauveCloud
which also include a handy romaji to kana converter page!

http://www.mauvecloud.net/charsets

Unicode Test Page:
Here is a quick Unicode test page from Robert Parker:

http://ccwf.cc.utexas.edu/~robp/testcz.htm

Unicode Test Page:
Here is the Unicode Test Page from EHC:

Unicode によるハングル•日本語混在のテストページ

Unicode Test Page:
Test pages from Paul Johnston:

http://lismore.ccl.umist.ac.uk/paulj/unitest.html

Unicode Test Page:
Test page from Kermit, the Terminal Emulation folks:

http://www.columbia.edu/kermit/utf8.html

Unicode Test Page:
Test page from Katolika Lingvolaboristo:

http://pages.hotbot.com/politics/zjw/images/utftest.html

Unicode Test Page:
Test page from ThreeWeb:

http://www.threeweb.ad.jp/logos/mlweb/allutf8.html

Unicode Test Page:
Test page from Maribyrnong Library:

http://library.maribyrnong.vic.gov.au/utf8/

Unicode Test Page:
Test page from Тверской Государственный Университет
( UTF-7 format )

http://homepages.tversu.ru/~susov/unicodetest.htm

Unicode Test Page:
Peter Kleiweg has Unicode charts for fixed width fonts
under 'Manuals'

http://odur.let.rug.nl/~kleiweg/index.html

Unicode Test Page:

http://www.eleves.ens.fr:8080/home/madore/misc/unitest/


Search Engine - The Next Generation


Bjondi International has developed a new web site called Skworm. It promises to be different than anything available.
Visit their home page and phrase your own search at:
http://www.skworm.com


⇑ My e-mail. ⇑

My name is James Kass. Let me know if you have any suggestions for additions to this page.

My home page
Valid HTML 4.01!