Sample Unicode Test Pages and Script Links
Some of the following links are to other pages at this web site and others
are direct links to other sites. The pages on this web site are (mostly) provided
in pairs. The first link will be for a page encoded in decimal NCR format.
The second link is for pages encoded in UTF-8 format with the UTF-8 tag in
the file header.
(This web page is from my old web site. Many of the external links no longer
work. Updating in process...)
LINKS OF INTEREST:
ARABIC
http://www.ayna.com/help/index.utf8.html
BASSA LANGUAGE / VAH SCRIPT
Visit Varnie Karmo's site to learn about the Bassa people of Liberia and
Sierra Leone. There is currently a freeware beta TTF for the Vah script
available for download. There is now an on-line dictionary in the works!
http://www.ie-inc.com/vkarmo/index.html
BELARUS - БЕЛАРУСЬ
The Belarusian Language: Using Unicode (UTF-8) Encoding by Peter Kasaty.
Site includes charts of the alphabet as well as Belarusian poetry with
English translations and much more.
http://www.belarus-misc.org/unicode.htm. (Unicode UTF-8 format)
BENGALI - বাঙালী
http://lekho.sourceforge.net/banglapage.html (Unicode UTF-8 format)
BOPOMOFO - ㄅㄆㄇㄈ
Bopomofo Test. (Unicode decimal NCR format)
Bopomofo Test. (Unicode UTF-8 format)
CHEROKEE - ᏣᎳᎩ
Tsalagi Test and Resource Link. (Unicode decimal NCR format)
Tsalagi Test and Resource Link. (Unicode UTF-8 format)
CHINESE
The Chinese Radicals. (Unicode UTF-8 format)
CIRTH (Unofficial Encodings)
Cirth.
CONVERSION TOOLS (Encoding Conversion)
http://www.sil.org/nrsi/teckit/
CZECH
Some Handy Czech Phrases. (Unicode decimal NCR format)
Some Handy Czech Phrases. (Unicode UTF-8 format)
DANISH
Birger Langkjers hjemmeside. (Includes some
useful Unicode-related links.)
DARI (Persian, Eastern Farsi, Parsi)
http://www.afghanpedia.com/ (Unicode UTF-8 format)
Learn about the history and culture of the region. This site has UTF-8 format content
in the following languages: Dari (Persian, Farsi, Parsi), Pashto, Ozbeky, Turkmany,
English, and French.
DESERET - 𐐜 𐐔𐐇𐐞𐐊𐐡𐐇𐐓 𐐈𐐢𐐙𐐊𐐒𐐇𐐓
The Deseret Alphabet (Unicode decimal NCR format)
EDEN'S PAGE: SCRIPTS OF ALL OF ASIA
http://www.geocities.com/Athens/Academy/9594/index.html
Please visit this award-winning site. This site was formerly Scripts of all India, and it has
grown and is still growing ! A superb reference rich with readable graphics.
ENGLISH
Does Your Browser Support English? (Unicode UTF-8 format)
ESPERANTO
Esperanto Test and Resource Links. (Unicode decimal NCR format)
Esperanto Test and Resource Links. (Unicode UTF-8 format)
ETHIOPIAN - የኢትዮጵያ ፊደል
"http://news.com.et/" ወደ መነሻው ገጽ (Ethiopian News Headlines) የሣምንቱ ዜናዎች
This link appears to be down as of early September 2006.
ETRUSCAN - 𐌓𐌀𐌔𐌍𐌀 -
and other scripts
Tex Texin's Plane One page (Unicode NCR version)
...based on his Unicode Example Celebrity List
Sorin Paliga's Old Italic keyboard for MAC
GOTHIC
Gothic Test and Resource Links. (Unicode decimal NCR format)
GREEK - ΕΛΛΗΝΙΚΆ
Greek Test and Resource Link. (Unicode decimal NCR format)
Greek Test and Resource Link. (Unicode UTF-8 format)
INDIC SCRIPTS
http://www.tamil.net/people/sivaraj/links.html (Links Related to Indian Language Computing)
INPUT
http://people.w3.org/rishida/scripts/pickers/ Script pickers from Richard Ishida
Unicode Input - Some old on-line java-based
screen keyboards plus links to other resources.
Andrew C. West's freeware
BabelMap program for Windows.
IPA - ɪntəˈnæʃənəl fəˈnɛtɪk
http://www.phon.ucl.ac.uk/home/wells/ipa-unicode.htm
(The International Phonetic Alphabet in Unicode)
JAPANESE - 日本語
Konnichiwa Around the World. (Unicode decimal NCR format)
Konnichiwa Around the World. (Unicode UTF-8 format)
KAYAH LI - ꤊꤢ꤬ꤛꤢ꤭ ꤜꤟꤤ꤬
Kayah Li Test and Resource Link. (Unicode UTF-8 format)
KLALLAM - nəxʷsƛʼáyəm
http://www.elwha.org/language.htm (Graphics)
KLINGON - -
Klingon Test and Resource Link. (Unicode decimal NCR format)
Klingon Test and Resource Link. (Unicode UTF-8 format)
KOREAN - 한글
Korean Test. (Unicode decimal NCR format)
Korean Test. (Unicode UTF-8 format)
http://jshin.net/i18n/korean/hunmin.html (Unicode UTF-8 format) - Test page for Hangul Jamo range by Jungshik Shin.
Making Home Pages in Korean (In Japanese, Shift_JIS)
LAO - ລາວ
Lao Test Page from Tavultesoft. (Unicode UTF-8 format)
Note - this page not available as of 2006/09/05
MĀORI
Māori Test. (Unicode decimal NCR format)
Māori Test. (Unicode UTF-8 format)
MATH
Math Test. (Unicode decimal NCR format)
NAVAJO
Navajo Test. (Unicode UTF-8 format)
NEPALI
http://www.nepali.info/nepali/ (Unicode UTF-8 format)
NUMBERS - 1٢੩四൫Ⅵ௭೮୯
A Table of Numbers. (Unicode decimal NCR format)
A Table of Numbers. (Unicode UTF-8 format)
OGHAM - ᚁᚓᚈᚆ ᚂᚒᚔᚄ ᚅᚔᚑᚅ
Ogham Test and Resource Link. (Unicode decimal NCR format)
Ogham Test and Resource Link. (Unicode UTF-8 format)
OLD PERSIAN CUNEIFORM
Behistun Inscription Portion (Unicode decimal NCR format)
PHAGS-PA
BabelStone : Phags-pa Script
Andrew West has released the first Unicode font for the Phags-pa
script. The above linked page has a link for the font download. This
page (and other pages linked therein) contains a wealth of information
about this fascinating historic script.
PHOENICIAN
A Bequest Unearthed - Phoenicia
A huge site full of interesting information about Phoenicia and its heritage,
including pages about the language, the Phoenician script, and the lore of
this fascinating culture of historic importance.
PHOENICIAN (MOABITE)
The Moabite Stone. (Unicode UTF-8 format)
PUNJABI - ਪੰਜਾਬੀ - (Poetry by ਪ੍ਰੀਤਮ ਸਿਘ ਧੰਜਲ)
http://www.dhanjal.com/ (Graphics)
Punjabi Computing Resource Centre Unicode 4.0 Gurmukhi Unicode font download!
RUNIC - ᚠᚢᚦᚫᚱᚴ
The Björketorp Inscription. (Unicode decimal NCR format)
The Björketorp Inscription. (Unicode UTF-8 format)
RUSSIAN - РУССКИ
Some Phrases In Russian. (Unicode decimal NCR format)
Some Phrases In Russian. (Unicode UTF-8 format)
SOUTHERN TUTCHONE
Southern Tutchone Orthography Example (Unicode decimal NCR format)
Southern Tutchone Orthography Example (Unicode UTF-8 format)
SYRIAC
Sample Syriac Text (Unicode Decimal NCR format)
TAMIL - தமிழ்
Sample Tamil Text (Unicode Decimal NCR format)
Sample Tamil Text (Unicode UTF-8 format)
TENGWAR (Unofficial encodings)
Tengwar Test and Links
THAANA - ަނާތ ިޅުބަގ
Sample Thaana Text and Resource Links. (Unicode UTF-8 format)
THAI - ไทย
Title Page from English-Thai Dictionary. (Unicode Decimal NCR format)
Title Page from English-Thai Dictionary. (Unicode UTF-8 format)
UCAS - as used in ᐅᒥᐅᔭᕐᒃ, ᓄᓇᕕᒃ and other places.
- Unified Canadian Aboriginal Syllabics
Ronald Ogawa has a very nice font currently available as beta-test freeware
which includes Latin, Cyrillic, Greek and the UCAS.
The UCAS are used for writing languages such as Cree, Naskapi, Ojibwe,
and Inuktitut. The font is called "Ballymun RO"
and is available at:
http://nexus.brocku.ca/rogawa/ucas
There are sample UCAS documents available on his server's home page:
http://nexus.brocku.ca for Cree and Inuktitut, as well as test pages for Gaelic.
VIETNAMESE - VIỆT
Giống Kiến Lửa Du Nhập (Unicode decimal NCR format)
Giống Kiến Lửa Du Nhập (Unicode UTF-8 format)
http://www.vovisoft.com/ (Unicode Decimal NCR format)
Trụ Sở Vovisoft has articles on computing in the Vietnamese language, even
tutorials for Visual Basic!
YI
http://www.babelstone.co.uk/Yi/index.html
Andrew C. West's Yi Pages
YIDDISH
http://www.uyip.org/unicode Understanding Yiddish Information Processing with Unicode
YORÙBÁ
Yorùbá Example and Resource Link. (Unicode decimal NCR format)
Yorùbá Example and Resource Link. (Unicode UTF-8 format)
Multiple Language Resources
AIYONG MULTILINGUAL
http://www.ask.ne.jp/~shumei/aiyong-e.html Site uses many graphics
so some pages do not load quickly, but worth the wait. In addition to both free and commercial
translations, the site offers basic conversational phrases in 31 languages. (Script and sound!)
ÉCRITURES DU MONDE
http://www.culture.gouv.fr/edm/fr/
THE FOUR ESSENTIAL TRAVEL PHRASES
http://www.travelphrases.info/
Four phrases every traveller should know translated into many languages and
dialects.
HOT PEACH PAGES/EARTHWORDS
http://www.hotpeachpages.net/lang/index.html
Over 20 different Unicode ranges used, including Greek, Cyrillic, Hebrew, Arabic, Devanagari, Bengali,
Gurmukhi, Gujarati, Tamil, Thai, Lao, Georgian, Ethiopic, Unified Canadian Aboriginal Syllabics and CJK ranges.
INTERNATIONAL GLOSSARY OF HYDROLOGY
http://www.cig.ensmp.fr/~hubert/glu/aglo.htm Pierre Hubert is working on a project to provide the
International Glossary of Hydrology in HTML/Unicode format. The original book,
published by UNESCO and the World Meteorological Organization, contained terms in
English, Spanish, French, and Russian. With assistance from diverse colleagues,
he has added several languages to the glossary. All of the encoding on the site
is Unicode, with the exception of the Hindi pages.
JENNIFER'S LANGUAGE LINKS
http://www.elite.net/~runner/jennifers/index.htm Comprehensive Linguistic Links
LANGUAGES ON THE WEB
http://www.languages-on-the-web.com/ Thousands of links. Well organized. Site does not use fancy graphics or
java scripts so pages load quickly. You can find links to on-line instruction, on-line dictionaries, as well as references on specific languages and the cultures
of the language speakers.
TEX TEXIN
Tex Texin's Unicode Example Celebrity List
TITUS
- Unicode Support/Testing with Multilingual Sample Pages.
http://titus.uni-frankfurt.de/unicode/unitest.htm
Unicode Editors
Now you can create Unicode documents for the World Wide Web with Sharmahd Computing's UniPad.
Just create a file with UniPad, save it as UTF-8, and rename the file with file extension *.htm.
You can then read the document in your Internet Explorer 5.0 (with "View"-"Encoding"-"Unicode (UTF-8)"
selected). (Calling a file *.htm doesn't make the file an HTML file, though! Some mark-up may be
required, such as the <BR> tag...)
UniPad is a commercial product, a limited freeware version is also offered.
http://www.unipad.org
Andrew C. West's BabelPad is currently freeware. It is for Windows platforms.
Features include the ability to use any installed TrueType/OpenType font along
with several extremely useful input methods for eastern scripts. The editor
also uses the system's complex script shaping engine, which means that complex
scripts can be displayed correctly in a plain-text editor. BabelPad includes
complete support for Unicode 4.0, including characters beyond the Basic
Multilingual Plane. This editor represents a tremendous amount of work and is
most welcome.
http://www.babelstone.co.uk/Software/BabelPad.html
Unicode-based Fonts
In addition to the Ballymun RO font mentioned above (under UCAS), and the Code2000
font available on my home page...
Herman Miller creates fictional and experimental languages. He has a freeware Unicode-based
font called Thryomanes available at his website:
http://www.io.com/~hmiller/
Thryomanes includes all of the extended Latin characters. Try viewing the Vietnamese test
page referenced above with Thryomanes.
Thryomanes is currently archived on his "Languages" page along
with several other interesting freeware TTFs.
Junicode, a Unicode-based font for medievalists:
http://www.engl.virginia.edu/OE/junicode/junicode.html
Cardo, a Unicode-based font with OpenType tables for Hebrew
and Latin!
http://scholarsfonts.net/
WAZU JAPAN's
Gallery of Unicode Fonts
An extensive guide to Unicode fonts.
Unicode Resources
For more information about Unicode, please visit "Fonts for the Unicode
Character Set", maintained by Nelson H. F. Beebe at the University of Utah.
http://www.math.utah.edu/~beebe/fonts/unicode.html
There is also an abundance of useful information on his home page:
http://www.math.utah.edu/~beebe/
...as well as the Unicode-related bibliography at:
http://www.math.utah.edu/pub/tex/bib/index-table-u.html#unicode
...which he maintains as part of the TeX User Group Bibliography Archive.
Another excellent Unicode resource:
http://www.alanwood.net/unicode/
Please visit Alan Wood's website.
There are Unicode charts
with character names as well as clear, instructive information
about the background and implementation of Unicode, including
tips about using Unicode in several applications.
Unicode Test Pages:
Be sure to follow this link to the SAMPA home page for additional browser/unicode tests.
http://www.phon.ucl.ac.uk/home/sampa/unicodetest.htm
Unicode Test Pages:
СЛОВО - SLOVO
(Christoph Singer)
This site tells you how to make your PC write in Slavic, Cyrillic, and East Central European languages.
Эти страницы содержат информацию о том, как настроить персональный компьютер
для работы с русским разыком и другими славранскими языками, в том числе
старославянским и дореволюционным русским, и где можно найти шрифты и
полезные для многоязычной работы программы в Интернете.
http://www.ccss.de/slovo/index.html
Unicode Information in Vietnamese:
This page from Trụ Sở Vovisoft explains about Unicode and shows how
to enable Vietnamese support in various software applications. There
is a link provided on the site for the new version of the Tahoma font which
supports Vietnamese requirements.
Unicode cho chữ Việt
http://www.vovisoft.com/vovisoft/UnicodeChoVN.htm
International Commerce:
Multilingual Websites and Foreign Language E-commerce
Plenty of practical information here including well-written overviews with titles
such as: "How to Get Started in Multilingual Web E-commerce"
http://shoppingforbargains.com/xml.htm
...and, be sure to visit their home page.
Unicode Test Pages:
Test pages from Richard Ishida as well as charts
comparing features of various scripts:
http://www.xerox-emea.com/globaldesign/scripts/scriptsamples.htm
Unicode Test Pages:
Test pages and charts from MauveCloud
which also include a handy romaji to kana converter page!
http://www.mauvecloud.net/charsets
Unicode Test Page:
Here is a quick Unicode test page from Robert Parker:
http://ccwf.cc.utexas.edu/~robp/testcz.htm
Unicode Test Page:
Here is the Unicode Test Page from EHC:
Unicode によるハングル•日本語混在のテストページ
Unicode Test Page:
Test pages from Paul Johnston:
http://lismore.ccl.umist.ac.uk/paulj/unitest.html
Unicode Test Page:
Test page from Kermit, the Terminal Emulation folks:
http://www.columbia.edu/kermit/utf8.html
Unicode Test Page:
Test page from Katolika Lingvolaboristo:
http://pages.hotbot.com/politics/zjw/images/utftest.html
Unicode Test Page:
Test page from ThreeWeb:
http://www.threeweb.ad.jp/logos/mlweb/allutf8.html
Unicode Test Page:
Test page from Maribyrnong Library:
http://library.maribyrnong.vic.gov.au/utf8/
Unicode Test Page:
Test page from Тверской Государственный Университет
( UTF-7 format )
http://homepages.tversu.ru/~susov/unicodetest.htm
Unicode Test Page:
Peter Kleiweg has Unicode charts for fixed width fonts
under 'Manuals'
http://odur.let.rug.nl/~kleiweg/index.html
Unicode Test Page:
http://www.eleves.ens.fr:8080/home/madore/misc/unitest/
Search Engine - The Next Generation
Bjondi International has developed a new web site called Skworm. It promises
to be different than anything available.
Visit their home page and phrase your own search at:
http://www.skworm.com
⇑ My e-mail. ⇑
My name is James Kass. Let me know if you
have any suggestions for additions to this page.
My home page