Module talk:Unicode data/testcases
1 of 110 tests failed. (refresh)
test_lookup_name
| Test | Expected | Actual | |
|---|---|---|---|
| U+0000 | <control-0000> | <control-0000> | |
| U+007F | <control-007F> | <control-007F> | |
| U+00C1: Á | LATIN CAPITAL LETTER A WITH ACUTE | LATIN CAPITAL LETTER A WITH ACUTE | |
| U+0300: ◌̀ | COMBINING GRAVE ACCENT | COMBINING GRAVE ACCENT | |
| U+0378 | <reserved-0378> | <reserved-0378> | |
| U+1B44: ◌᭄ | BALINESE ADEG ADEG | BALINESE ADEG ADEG | |
| U+1F71: ά | GREEK SMALL LETTER ALPHA WITH OXIA | GREEK SMALL LETTER ALPHA WITH OXIA | |
| U+3555: 㕕 | CJK UNIFIED IDEOGRAPH-3555 | CJK UNIFIED IDEOGRAPH-3555 | |
| U+AC01: 각 | HANGUL SYLLABLE GAG | HANGUL SYLLABLE GAG | |
| U+D5FF: 헿 | HANGUL SYLLABLE HEH | HANGUL SYLLABLE HEH | |
| U+DC00 | <surrogate-DC00> | <surrogate-DC00> | |
| U+EEEE | <private-use-EEEE> | <private-use-EEEE> | |
| U+FE04: ︄ | VARIATION SELECTOR-5 | VARIATION SELECTOR-5 | |
| U+FDD1 | <noncharacter-FDD1> | <noncharacter-FDD1> | |
| U+FFFD: � | REPLACEMENT CHARACTER | REPLACEMENT CHARACTER | |
| U+FFFF | <noncharacter-FFFF> | <noncharacter-FFFF> | |
| U+1F4A9: 💩 | PILE OF POO | PILE OF POO | |
| U+2ED9D: | CJK UNIFIED IDEOGRAPH-2ED9D | CJK UNIFIED IDEOGRAPH-2ED9D | |
| U+E0000 | <reserved-E0000> | <reserved-E0000> | |
| U+F0F0F | <private-use-F0F0F> | <private-use-F0F0F> | |
| U+10FFFF | <noncharacter-10FFFF> | <noncharacter-10FFFF> |
test_lookup_age
| Test | Expected | Actual |
|---|
test_is_combining
| Test | Expected | Actual | |
|---|---|---|---|
| U+0300: ◌̀ (COMBINING GRAVE ACCENT) | true | true | |
| U+0060: ` (GRAVE ACCENT) | false | false |
test_is_default_ignorable
| Test | Expected | Actual |
|---|
test_lookup_script
| Test | Expected | Actual | |
|---|---|---|---|
| U+0061: a (LATIN SMALL LETTER A) | Latn | Latn | |
| U+002F: / (SOLIDUS) | Zyyy | Zyyy | |
| U+0300: ◌̀ (COMBINING GRAVE ACCENT) | Zinh | Zinh | |
| U+0378 (<reserved-0378>) | Zzzz | Zzzz | |
| U+0398: Θ (GREEK CAPITAL LETTER THETA) | Grek | Grek | |
| U+03E2: Ϣ (COPTIC CAPITAL LETTER SHEI) | Copt | Copt | |
| U+2014: — (EM DASH) | Zyyy | Zyyy |
test_lookup_category
| Test | Expected | Actual | |
|---|---|---|---|
| U+0009 (<control-0009>) | Cc | Cc | |
| U+0020: (SPACE) | Zs | Zs | |
| U+005B: [ (LEFT SQUARE BRACKET) | Ps | Ps | |
| U+005D: ] (RIGHT SQUARE BRACKET) | Pe | Pe | |
| U+005E: ^ (CIRCUMFLEX ACCENT) | Sk | Sk | |
| U+0041: A (LATIN CAPITAL LETTER A) | Lu | Lu | |
| U+00AD (SOFT HYPHEN) | Cf | Cf | |
| U+00BE: ¾ (VULGAR FRACTION THREE QUARTERS) | No | No | |
| U+00AB: « (LEFT-POINTING DOUBLE ANGLE QUOTATION MARK) | Pi | Pi | |
| U+00BB: » (RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK) | Pf | Pf | |
| U+0300: ◌̀ (COMBINING GRAVE ACCENT) | Mn | Mn | |
| U+0488: ҈ (COMBINING CYRILLIC HUNDRED THOUSANDS SIGN) | Me | Me | |
| U+0663: ٣ (ARABIC-INDIC DIGIT THREE) | Nd | Nd | |
| U+5B50: 子 (CJK UNIFIED IDEOGRAPH-5B50) | Lo | Lo | |
| U+1FAE: ᾮ (GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI) | Lt | Lt | |
| U+1B44: ◌᭄ (BALINESE ADEG ADEG) | Mc | Mc | |
| U+2208: ∈ (ELEMENT OF) | Sm | Sm | |
| U+203F: ‿ (UNDERTIE) | Pc | Pc | |
| U+21B9: ↹ (LEFTWARDS ARROW TO BAR OVER RIGHTWARDS ARROW TO BAR) | So | So | |
| U+2E17: ⸗ (DOUBLE OBLIQUE HYPHEN) | Pd | Pd | |
| U+2167: Ⅷ (ROMAN NUMERAL EIGHT) | Nl | Nl | |
| U+2028 (LINE SEPARATOR) | Zl | Zl | |
| U+2029 (PARAGRAPH SEPARATOR) | Zp | Zp | |
| U+309E: ゞ (HIRAGANA VOICED ITERATION MARK) | Lm | Lm | |
| U+D800 (<surrogate-D800>) | Cs | Cs | |
| U+FFE1: £ (FULLWIDTH POUND SIGN) | Sc | Sc | |
| U+FFFF (<noncharacter-FFFF>) | Cn | nil | |
| U+100000 (<private-use-100000>) | Co | Co |
test_get_best_script
| Test | Expected | Actual | |
|---|---|---|---|
| %!?́: Zyyy (3), Zinh (1) | nil | nil | |
| ’ʼ“”†‡•‰′‽⁕ • Zyyy (11) |
nil | nil | |
| col·legi: Latn (7), Zyyy (1) | Latn | Latn | |
| HTML character references | |||
| 𐘀: Lina (1) | Lina | Lina | |
| 𐘀 • Zyyy (8), Latn (1) |
Lina | Lina | |
| – • Zyyy (1) |
nil | nil | |
| – • Latn (5), Zyyy (2) |
nil | nil | |
| Halkomelem | |||
| lá:yelhp: Latn (7), Zyyy (1) | Latn | Latn | |
| xʷməθkʷəy̓əm: Latn (10), Grek (1), Zinh (1) | nil | nil | |
| hən̓q̓əmin̓əm̓: Latn (10), Zinh (4) | Latn | Latn | |
| Quotes | |||
| Tant’è amara che poco è più morte; ma per trattar del ben ch’i’ vi trovai, dirò de l’altre cose ch’i’ v’ ho scorte. • Latn (83), Zyyy (32) |
Latn | Latn | |
| Díí Késhmish biyiʼ yáʼąąshdę́ę́ʼ ląʼígóó bee nikʼihojidlíi dooleeł. Niheechʼínáánáháhígíí biyiʼ iłhodeezyéél, iłhózhǫ́, ayóóʼóʼóʼní bee nikʼihojidlíi dooleeł. Tʼáá sahdiigiʼ átʼéego baa hózhǫ́ǫgo nihił hanááhoolzhiizhígí biyiʼ tʼáá ałtsojįʼ iłhózhǫ́ nííʼ dooleeł. • Latn (207), Zyyy (52), Zinh (5) |
Latn | Latn | |
| Μῆνῐν ᾰ̓́ειδε, θεᾱ́, Πηληῐ̈ᾰ́δεω Ᾰ̓χῐλῆος οὐλομένην, ἣ μῡρῐ́᾽ Ᾰ̓χαιοῖς ᾰ̓́λγε᾽ ἔθηκε, πολλᾱ̀ς δ᾽ ῐ̓φθῑ́μους ψῡχᾱ̀ς Ἄῐ̈δῐ προῐ̈́ᾰψεν ἡρώων, αὐτοὺς δὲ ἑλώρῐᾰ τεῦχε κῠ́νεσσιν οἰωνοῖσῐ́ τε πᾶσῐ· Δῐὸς δ᾽ ἐτελείετο βουλή· • Grek (160), Zyyy (36), Zinh (19) |
Grek | Grek | |
| Вот если вы не согласитесь с этим последним тезисом и ответите: «Не так» или «не всегда так», то я, пожалуй, и ободрюсь духом насчет значения героя моего Алексея Федоровича. Ибо не только чудак «не всегда» частность и обособление, а напротив, бывает так, что он-то, пожалуй, и носит в себе иной раз сердцевину целого, а остальные люди его эпохи — все, каким-нибудь наплывным ветром, на время почему-то от него оторвались… • Cyrl (329), Zyyy (93) |
Cyrl | Cyrl | |
| ॐ अग्निमीळे पुरोहितं यज्ञस्य देवमृत्विजम् । होतारं रत्नधातमम् ॥१॥ अग्निः पूर्वेभिरृषिभिरीड्यो नूतनैरुत । स देवाँ एह वक्षति ॥२॥ अग्निना रयिमश्नवत् पोषमेव दिवेदिवे । यशसं वीरवत्तमम् ॥३॥ अग्ने यं यज्ञमध्वरं विश्वतः परिभूरसि । स इद्देवेषु गच्छति ॥४॥ अग्निर्होता कविक्रतुः सत्यश्चित्रश्रवस्तमः । देवो देवेभिरा गमत् ॥५॥ यदङ्ग दाशुषे त्वमग्ने भद्रं करिष्यसि । तवेत् तत् सत्यमङ्गिरः ॥६॥ उप त्वाग्ने दिवेदिवे दोषावस्तर्धिया वयम् । नमो भरन्त एमसि ॥७॥ राजन्तमध्वराणां गोपामृतस्य दीदिविम् । वर्धमानं स्वे दमे ॥८॥ स नः पितेव सूनवेऽग्ने सूपायनो भव । सचस्वा नः स्वस्तये ॥९॥ • Deva (449), Zyyy (109) |
Deva | Deva | |
test_is_Latin
| Test | Expected | Actual | |
|---|---|---|---|
| %!?́: Zyyy (3), Zinh (1) | false | false | |
| ’ʼ“”†‡•‰′‽⁕ • Zyyy (11) |
false | false | |
| col·legi: Latn (7), Zyyy (1) | true | true | |
| HTML character references | |||
| 𐘀: Lina (1) | false | false | |
| 𐘀 • Zyyy (8), Latn (1) |
false | false | |
| – • Zyyy (1) |
false | false | |
| – • Latn (5), Zyyy (2) |
false | false | |
| Halkomelem | |||
| lá:yelhp: Latn (7), Zyyy (1) | true | true | |
| xʷməθkʷəy̓əm: Latn (10), Grek (1), Zinh (1) | false | false | |
| hən̓q̓əmin̓əm̓: Latn (10), Zinh (4) | true | true | |
| Quotes | |||
| Tant’è amara che poco è più morte; ma per trattar del ben ch’i’ vi trovai, dirò de l’altre cose ch’i’ v’ ho scorte. • Latn (83), Zyyy (32) |
true | true | |
| Díí Késhmish biyiʼ yáʼąąshdę́ę́ʼ ląʼígóó bee nikʼihojidlíi dooleeł. Niheechʼínáánáháhígíí biyiʼ iłhodeezyéél, iłhózhǫ́, ayóóʼóʼóʼní bee nikʼihojidlíi dooleeł. Tʼáá sahdiigiʼ átʼéego baa hózhǫ́ǫgo nihił hanááhoolzhiizhígí biyiʼ tʼáá ałtsojįʼ iłhózhǫ́ nííʼ dooleeł. • Latn (207), Zyyy (52), Zinh (5) |
true | true | |
| Μῆνῐν ᾰ̓́ειδε, θεᾱ́, Πηληῐ̈ᾰ́δεω Ᾰ̓χῐλῆος οὐλομένην, ἣ μῡρῐ́᾽ Ᾰ̓χαιοῖς ᾰ̓́λγε᾽ ἔθηκε, πολλᾱ̀ς δ᾽ ῐ̓φθῑ́μους ψῡχᾱ̀ς Ἄῐ̈δῐ προῐ̈́ᾰψεν ἡρώων, αὐτοὺς δὲ ἑλώρῐᾰ τεῦχε κῠ́νεσσιν οἰωνοῖσῐ́ τε πᾶσῐ· Δῐὸς δ᾽ ἐτελείετο βουλή· • Grek (160), Zyyy (36), Zinh (19) |
false | false | |
| Вот если вы не согласитесь с этим последним тезисом и ответите: «Не так» или «не всегда так», то я, пожалуй, и ободрюсь духом насчет значения героя моего Алексея Федоровича. Ибо не только чудак «не всегда» частность и обособление, а напротив, бывает так, что он-то, пожалуй, и носит в себе иной раз сердцевину целого, а остальные люди его эпохи — все, каким-нибудь наплывным ветром, на время почему-то от него оторвались… • Cyrl (329), Zyyy (93) |
false | false | |
| ॐ अग्निमीळे पुरोहितं यज्ञस्य देवमृत्विजम् । होतारं रत्नधातमम् ॥१॥ अग्निः पूर्वेभिरृषिभिरीड्यो नूतनैरुत । स देवाँ एह वक्षति ॥२॥ अग्निना रयिमश्नवत् पोषमेव दिवेदिवे । यशसं वीरवत्तमम् ॥३॥ अग्ने यं यज्ञमध्वरं विश्वतः परिभूरसि । स इद्देवेषु गच्छति ॥४॥ अग्निर्होता कविक्रतुः सत्यश्चित्रश्रवस्तमः । देवो देवेभिरा गमत् ॥५॥ यदङ्ग दाशुषे त्वमग्ने भद्रं करिष्यसि । तवेत् तत् सत्यमङ्गिरः ॥६॥ उप त्वाग्ने दिवेदिवे दोषावस्तर्धिया वयम् । नमो भरन्त एमसि ॥७॥ राजन्तमध्वराणां गोपामृतस्य दीदिविम् । वर्धमानं स्वे दमे ॥८॥ स नः पितेव सूनवेऽग्ने सूपायनो भव । सचस्वा नः स्वस्तये ॥९॥ • Deva (449), Zyyy (109) |
false | false | |
test_lookup_block
| Test | Expected | Actual | |
|---|---|---|---|
| U+0064: d | Basic Latin | Basic Latin | |
| U+030B: ◌̋ | Combining Diacritical Marks | Combining Diacritical Marks | |
| U+03A3: Σ | Greek and Coptic | Greek and Coptic | |
| U+0411: Б | Cyrillic | Cyrillic | |
| U+10E6: ღ | Georgian | Georgian | |
| U+3175: ㅵ | Hangul Compatibility Jamo | Hangul Compatibility Jamo | |
| U+AC01: 각 | Hangul Syllables | Hangul Syllables | |
| U+4E0A: 上 | CJK Unified Ideographs | CJK Unified Ideographs | |
| U+1F608: 😈 | Emoticons | Emoticons | |
| U+30000: 𰀀 | CJK Unified Ideographs Extension G | CJK Unified Ideographs Extension G | |
| U+10FFFF | Supplementary Private Use Area-B | Supplementary Private Use Area-B |
test_is_rtl
| Test | Expected | Actual | |
|---|---|---|---|
| أبو عبد الله محمد بن عبد الله اللواتي الطنجي بن بطوطة | true | true | |
| أدب القاضي Adab al-qādī | false | false | |
| ܛܘܼܒܲܝܗܘܿܢ ܠܐܲܝܠܹܝܢ ܕܲܕ݂ܟܹܝܢ ܒܠܸܒ̇ܗܘܿܢ܄ ܕܗܸܢ݂ܘܿܢ ܢܸܚܙܘܿܢ ܠܐܲܠܵܗܵܐ܂ | true | true | |
| בְּרֵאשִׁית, בָּרָא אֱלֹהִים, אֵת הַשָּׁמַיִם, וְאֵת הָאָרֶץ. | true | true | |
| 𞤀𞤣𞤤𞤢𞤥 𞤆𞤵𞤤𞤢𞤪 | true | true | |
| 𐬬𐬍𐬯𐬞𐬈 ⸱ 𐬰𐬍 ⸱ 𐬀𐬥𐬌𐬌𐬉 ⸱ 𐬨𐬀𐬜𐬃𐬢𐬵𐬋 ⸱ 𐬀𐬉𐬴𐬨𐬀 ⸱ 𐬵𐬀𐬗𐬌𐬧𐬙𐬈 ⸱ 𐬑𐬭𐬎𐬎𐬍𐬨 ⸱ 𐬛𐬭𐬎𐬎𐬋 ⸱ 𐬁𐬀𐬝 ⸱ 𐬵𐬋 ⸱ 𐬫𐬋 ⸱ 𐬵𐬀𐬊𐬨𐬀𐬵𐬈 ⸱ 𐬨𐬀𐬜𐬋 ⸱ 𐬀𐬴𐬀 ⸱ 𐬵𐬀𐬗𐬀𐬌𐬙𐬈 | true | true | |
| ދިވެހި | true | true | |
| 𐤀𐤓𐤍𐤟𐤆𐤐𐤏𐤋𐤟𐤀𐤕𐤁𐤏𐤋𐤟𐤁𐤍𐤀𐤇𐤓𐤌𐤟𐤌𐤋𐤊𐤂𐤁𐤋𐤟𐤋𐤀𐤇𐤓𐤌𐤟𐤀𐤁𐤄 | true | true | |
| ࡌࡀࡍࡃࡀ ࡖࡄࡉࡉࡀ | true | true | |
| ࠄࠟࠓࠂࠝࠓࠜࠉࠆࠜࠉࠌ | true | true | |
| %$!^& | false | false |
Content Disclaimer
Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.
- The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
- There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
- It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
- Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
- Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.