Skip to main content
OCLC Support

Unsupported non-Latin characters

Discover information about which Unicode characters are currently unsupported in WorldShare Record Manager.

Some Unicode characters are not currently supported for use in Record Manager and will cause a validation error to occur. For information about the Telugu and Sinhala scripts specifically related to the Library of Congress scripts expansion project, see Details from OCLC: Library of Congress scripts.

If you need to enter these characters in a bibliographic or authority record in Record Manager:

  1. Enter the name of the character within square brackets, using the Unicode standard if available (e.g., enter [schwa]).
  2. You may also enter the hex values provided into Connexion.

Currently unsupported characters

Telugu - Visarga

Name: TELUGU SIGN VISARGA
Unicode: U+0C04
Hex value: &+0C04;
Character: ఼
Title: తెలుగు-ఉర్దూ ఫ఼ారసీ పదకోశము
Where it appears: In ఖ఼ాన్ (author name) and ఫ఼ారసీ (Fārsī in title)

Telugu - Nukta

Name: TELUGU SIGN NUKTA
Unicode: U+0C3C
Hex value: ఼
Character:
Title: తెలుగు-ఉర్దూ ఫ఼ారసీ పదకోశము
Where it appears: In ఖ఼ాన్ (author name) and ఫ఼ారసీ (Fārsī in title)

Telugu - Nakaara Pollu

Name: TELUGU LETTER NAKAARA POLLU
Unicode:  U+0C5D
Hex value: ౝ
Character:
Title: శ్రీ కృష్దేవమహారాయల ప్రభుత్వము
Where it appears: In కృష్దేవ (archaic spelling of Kṛṣṇadēva)

Telugu - Siddham

Name: TELUGU SIGN SIDDHAM
Unicode: U+0C77
Hex value: ౷
Character:
Title: ౷ సిద్ధిరస్తు
Where it appears: As an invocation at the beginning of inscriptions

Sinhala - Candrabindu

Name: SINHALA SIGN CANDRABINDU
Unicode: U+0D81
Hex value: ඁ
Character: ඁ
Title: සංස්කෘත-සිංහල ශබ්දකෝෂය
Where it appears: At the end of කෝෂය (kōṣayaṁ)

Canadian Syllabics Nattilik Ha

Unicode: U+11AB4 (part of the Supplementary Multilingual Plane)
Hex value: 𑪴
Character𑪴
Usage: Part of the Nattilik dialect of Inktitut