stri_trans_list: List Available Text Transforms and Transliterators
Description
Returns a list of available text transform identifiers. Each of them may be used in stri_trans_general
tasks.
Usage
stri_trans_list()
Value
Returns a character vector.
References
General Transforms – ICU User Guide, https://unicode-org.github.io/icu/userguide/transforms/general/
See Also
The official online manual of stringi at https://stringi.gagolewski.com/
Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, doi:10.18637/jss.v103.i02
Other transform: stri_trans_char()
, stri_trans_general()
, stri_trans_nfc()
, stri_trans_tolower()
Examples
stri_trans_list()
## [1] "Accents-Any" "am_FONIPA-am"
## [3] "am-am_FONIPA" "am-am_Latn/BGN"
## [5] "am-ar" "am-chr"
## [7] "am-fa" "Amharic-Latin/BGN"
## [9] "Any-Accents" "Any-am"
## [11] "Any-am_FONIPA" "Any-am_Latn/BGN"
## [13] "Any-Any" "Any-ar"
## [15] "Any-ar_Latn/BGN" "Any-Arab"
## [17] "Any-Arabic" "Any-Armenian"
## [19] "Any-Armn" "Any-az/BGN"
## [21] "Any-be_Latn/BGN" "Any-Beng"
## [23] "Any-Bengali" "Any-bg_Latn/BGN"
## [25] "Any-blt_FONIPA" "Any-Bopo"
## [27] "Any-Bopomofo" "Any-CanadianAboriginal"
## [29] "Any-Cans" "Any-ch_FONIPA"
## [31] "Any-chr" "Any-chr_FONIPA"
## [33] "Any-cs_FONIPA" "Any-cy_FONIPA"
## [35] "Any-Cyrillic" "Any-Cyrl"
## [37] "Any-Deva" "Any-Devanagari"
## [39] "Any-dsb_FONIPA" "Any-dv_Latn/BGN"
## [41] "Any-el_Latn/BGN" "Any-eo_FONIPA"
## [43] "Any-es_419_FONIPA" "Any-es_FONIPA"
## [45] "Any-Ethi" "Any-Ethiopic"
## [47] "Any-fa" "Any-fa_FONIPA"
## [49] "Any-fa_Latn/BGN" "Any-FCC"
## [51] "Any-FCD" "Any-Geor"
## [53] "Any-Georgian" "Any-Greek"
## [55] "Any-Greek/UNGEGN" "Any-Grek"
## [57] "Any-Grek/UNGEGN" "Any-Gujarati"
## [59] "Any-Gujr" "Any-Gurmukhi"
## [61] "Any-Guru" "Any-ha_NE"
## [63] "Any-Hang" "Any-Hangul"
## [65] "Any-Hans" "Any-Hant"
## [67] "Any-he_Latn/BGN" "Any-Hebr"
## [69] "Any-Hebrew" "Any-Hex"
## [71] "Any-Hex/C" "Any-Hex/Java"
## [73] "Any-Hex/Perl" "Any-Hex/Unicode"
## [75] "Any-Hex/XML" "Any-Hex/XML10"
## [77] "Any-Hira" "Any-Hiragana"
## [79] "Any-hy_AREVMDA_FONIPA" "Any-hy_FONIPA"
## [81] "Any-hy_Latn/BGN" "Any-ia_FONIPA"
## [83] "Any-Jamo" "Any-ka_Latn/BGN"
## [85] "Any-ka_Latn/BGN_1981" "Any-Kana"
## [87] "Any-Kannada" "Any-Katakana"
## [89] "Any-kk_FONIPA" "Any-kk_Latn/BGN"
## [91] "Any-Knda" "Any-ky_FONIPA"
## [93] "Any-ky_Latn/BGN" "Any-la_FONIPA"
## [95] "Any-Latin" "Any-Latn"
## [97] "Any-Lower" "Any-Malayalam"
## [99] "Any-mk_Latn/BGN" "Any-Mlym"
## [101] "Any-mn_Latn/BGN" "Any-mn_Latn/MNS"
## [103] "Any-my" "Any-my_FONIPA"
## [105] "Any-my_Latn" "Any-Name"
## [107] "Any-NFC" "Any-NFD"
## [109] "Any-NFKC" "Any-NFKD"
## [111] "Any-Null" "Any-nv_FONIPA"
## [113] "Any-Oriya" "Any-Orya"
## [115] "Any-pl_FONIPA" "Any-ps_Latn/BGN"
## [117] "Any-Publishing" "Any-Remove"
## [119] "Any-rm_FONIPA_SURSILV" "Any-ro_FONIPA"
## [121] "Any-ru" "Any-ru_Latn/BGN"
## [123] "Any-sat_FONIPA" "Any-si_FONIPA"
## [125] "Any-si_Latn" "Any-sk_FONIPA"
## [127] "Any-sr_Latn/BGN" "Any-Syrc"
## [129] "Any-Syriac" "Any-ta_FONIPA"
## [131] "Any-Tamil" "Any-Taml"
## [133] "Any-Telu" "Any-Telugu"
## [135] "Any-Thaa" "Any-Thaana"
## [137] "Any-Thai" "Any-Title"
## [139] "Any-tk/BGN" "Any-ug_FONIPA"
## [141] "Any-uk_Latn/BGN" "Any-und_FONIPA"
## [143] "Any-und_FONXSAMP" "Any-Upper"
## [145] "Any-ur" "Any-uz_Cyrl"
## [147] "Any-uz_Latn" "Any-uz/BGN"
## [149] "Any-vec_FONIPA" "Any-xh_FONIPA"
## [151] "Any-yo_BJ" "Any-zh"
## [153] "Any-zu_FONIPA" "ar-ar_Latn/BGN"
## [155] "Arab-Latn" "Arabic-Latin"
## [157] "Arabic-Latin/BGN" "Armenian-Latin"
## [159] "Armenian-Latin/BGN" "Armn-Latn"
## [161] "ASCII-Latin" "az_Cyrl-az/BGN"
## [163] "az-Lower" "az-Title"
## [165] "az-Upper" "Azerbaijani-Latin/BGN"
## [167] "be-be_Latn/BGN" "Belarusian-Latin/BGN"
## [169] "Beng-Arab" "Beng-Deva"
## [171] "Beng-Gujr" "Beng-Guru"
## [173] "Beng-Knda" "Beng-Latn"
## [175] "Beng-Mlym" "Beng-Orya"
## [177] "Beng-Taml" "Beng-Telu"
## [179] "Beng-ur" "Bengali-Arabic"
## [181] "Bengali-Devanagari" "Bengali-Gujarati"
## [183] "Bengali-Gurmukhi" "Bengali-Kannada"
## [185] "Bengali-Latin" "Bengali-Malayalam"
## [187] "Bengali-Oriya" "Bengali-Tamil"
## [189] "Bengali-Telugu" "bg-bg_Latn/BGN"
## [191] "blt-blt_FONIPA" "Bopo-Latn"
## [193] "Bopomofo-Latin" "Bulgarian-Latin/BGN"
## [195] "Burmese-Latin" "CanadianAboriginal-Latin"
## [197] "Cans-Latn" "ch-am"
## [199] "ch-ar" "ch-ch_FONIPA"
## [201] "ch-chr" "ch-fa"
## [203] "chr-chr_FONIPA" "cs_FONIPA-ja"
## [205] "cs_FONIPA-ko" "cs-am"
## [207] "cs-ar" "cs-chr"
## [209] "cs-cs_FONIPA" "cs-fa"
## [211] "cs-ja" "cs-ko"
## [213] "cy-cy_FONIPA" "Cyrillic-Latin"
## [215] "Cyrl-Latn" "de-ASCII"
## [217] "Deva-Arab" "Deva-Beng"
## [219] "Deva-Gujr" "Deva-Guru"
## [221] "Deva-Knda" "Deva-Latn"
## [223] "Deva-Mlym" "Deva-Orya"
## [225] "Deva-Taml" "Deva-Telu"
## [227] "Deva-ur" "Devanagari-Arabic"
## [229] "Devanagari-Bengali" "Devanagari-Gujarati"
## [231] "Devanagari-Gurmukhi" "Devanagari-Kannada"
## [233] "Devanagari-Latin" "Devanagari-Malayalam"
## [235] "Devanagari-Oriya" "Devanagari-Tamil"
## [237] "Devanagari-Telugu" "Digit-Tone"
## [239] "dsb-dsb_FONIPA" "dv-dv_Latn/BGN"
## [241] "el-el_Latn/BGN" "el-Lower"
## [243] "el-Title" "el-Upper"
## [245] "eo-am" "eo-ar"
## [247] "eo-chr" "eo-eo_FONIPA"
## [249] "eo-fa" "es_419-am"
## [251] "es_419-ar" "es_419-chr"
## [253] "es_419-fa" "es_419-ja"
## [255] "es_419-zh" "es_FONIPA-am"
## [257] "es_FONIPA-es_419_FONIPA" "es_FONIPA-ja"
## [259] "es_FONIPA-zh" "es-am"
## [261] "es-ar" "es-chr"
## [263] "es-es_FONIPA" "es-fa"
## [265] "es-ja" "es-zh"
## [267] "Ethi-Latn" "Ethiopic-Latin"
## [269] "fa-fa_FONIPA" "fa-fa_Latn/BGN"
## [271] "Fullwidth-Halfwidth" "Geor-Latn"
## [273] "Georgian-Latin" "Georgian-Latin/BGN"
## [275] "Greek-Latin" "Greek-Latin/BGN"
## [277] "Greek-Latin/UNGEGN" "Grek-Latn"
## [279] "Grek-Latn/UNGEGN" "Gujarati-Arabic"
## [281] "Gujarati-Bengali" "Gujarati-Devanagari"
## [283] "Gujarati-Gurmukhi" "Gujarati-Kannada"
## [285] "Gujarati-Latin" "Gujarati-Malayalam"
## [287] "Gujarati-Oriya" "Gujarati-Tamil"
## [289] "Gujarati-Telugu" "Gujr-Arab"
## [291] "Gujr-Beng" "Gujr-Deva"
## [293] "Gujr-Guru" "Gujr-Knda"
## [295] "Gujr-Latn" "Gujr-Mlym"
## [297] "Gujr-Orya" "Gujr-Taml"
## [299] "Gujr-Telu" "Gujr-ur"
## [301] "Gurmukhi-Arabic" "Gurmukhi-Bengali"
## [303] "Gurmukhi-Devanagari" "Gurmukhi-Gujarati"
## [305] "Gurmukhi-Kannada" "Gurmukhi-Latin"
## [307] "Gurmukhi-Malayalam" "Gurmukhi-Oriya"
## [309] "Gurmukhi-Tamil" "Gurmukhi-Telugu"
## [311] "Guru-Arab" "Guru-Beng"
## [313] "Guru-Deva" "Guru-Gujr"
## [315] "Guru-Knda" "Guru-Latn"
## [317] "Guru-Mlym" "Guru-Orya"
## [319] "Guru-Taml" "Guru-Telu"
## [321] "Guru-ur" "ha-ha_NE"
## [323] "Halfwidth-Fullwidth" "Han-Latin"
## [325] "Han-Latin/Names" "Hang-Latn"
## [327] "Hangul-Latin" "Hani-Latn"
## [329] "Hans-Hant" "Hant-Hans"
## [331] "he-he_Latn/BGN" "Hebr-Latn"
## [333] "Hebrew-Latin" "Hebrew-Latin/BGN"
## [335] "Hex-Any" "Hex-Any/C"
## [337] "Hex-Any/Java" "Hex-Any/Perl"
## [339] "Hex-Any/Unicode" "Hex-Any/XML"
## [341] "Hex-Any/XML10" "Hira-Kana"
## [343] "Hira-Latn" "Hiragana-Katakana"
## [345] "Hiragana-Latin" "hy_AREVMDA-am"
## [347] "hy_AREVMDA-ar" "hy_AREVMDA-chr"
## [349] "hy_AREVMDA-fa" "hy_AREVMDA-hy_AREVMDA_FONIPA"
## [351] "hy-am" "hy-ar"
## [353] "hy-chr" "hy-fa"
## [355] "hy-hy_FONIPA" "hy-hy_Latn/BGN"
## [357] "ia-am" "ia-ar"
## [359] "ia-chr" "ia-fa"
## [361] "ia-ia_FONIPA" "IPA-XSampa"
## [363] "it-am" "it-ja"
## [365] "ja_Hrkt-ja_Latn/BGN" "ja_Latn-ko"
## [367] "ja_Latn-ru" "Jamo-Latin"
## [369] "Jamo-Latn" "ka-ka_Latn/BGN"
## [371] "ka-ka_Latn/BGN_1981" "Kana-Hira"
## [373] "Kana-Latn" "Kannada-Arabic"
## [375] "Kannada-Bengali" "Kannada-Devanagari"
## [377] "Kannada-Gujarati" "Kannada-Gurmukhi"
## [379] "Kannada-Latin" "Kannada-Malayalam"
## [381] "Kannada-Oriya" "Kannada-Tamil"
## [383] "Kannada-Telugu" "Katakana-Hiragana"
## [385] "Katakana-Latin" "Katakana-Latin/BGN"
## [387] "Kazakh-Latin/BGN" "Kirghiz-Latin/BGN"
## [389] "kk-am" "kk-ar"
## [391] "kk-chr" "kk-fa"
## [393] "kk-kk_FONIPA" "kk-kk_Latn/BGN"
## [395] "Knda-Arab" "Knda-Beng"
## [397] "Knda-Deva" "Knda-Gujr"
## [399] "Knda-Guru" "Knda-Latn"
## [401] "Knda-Mlym" "Knda-Orya"
## [403] "Knda-Taml" "Knda-Telu"
## [405] "Knda-ur" "ko-ko_Latn/BGN"
## [407] "Korean-Latin/BGN" "ky-am"
## [409] "ky-ar" "ky-chr"
## [411] "ky-fa" "ky-ky_FONIPA"
## [413] "ky-ky_Latn/BGN" "la-la_FONIPA"
## [415] "Latin-Arabic" "Latin-Armenian"
## [417] "Latin-ASCII" "Latin-Bengali"
## [419] "Latin-Bopomofo" "Latin-CanadianAboriginal"
## [421] "Latin-Cyrillic" "Latin-Devanagari"
## [423] "Latin-Ethiopic" "Latin-Georgian"
## [425] "Latin-Greek" "Latin-Greek/UNGEGN"
## [427] "Latin-Gujarati" "Latin-Gurmukhi"
## [429] "Latin-Hangul" "Latin-Hebrew"
## [431] "Latin-Hiragana" "Latin-Jamo"
## [433] "Latin-Kannada" "Latin-Katakana"
## [435] "Latin-Malayalam" "Latin-NumericPinyin"
## [437] "Latin-Oriya" "Latin-Russian/BGN"
## [439] "Latin-Syriac" "Latin-Tamil"
## [441] "Latin-Telugu" "Latin-Thaana"
## [443] "Latin-Thai" "Latn-Arab"
## [445] "Latn-Armn" "Latn-Beng"
## [447] "Latn-Bopo" "Latn-Cans"
## [449] "Latn-Cyrl" "Latn-Deva"
## [451] "Latn-Ethi" "Latn-Geor"
## [453] "Latn-Grek" "Latn-Grek/UNGEGN"
## [455] "Latn-Gujr" "Latn-Guru"
## [457] "Latn-Hang" "Latn-Hebr"
## [459] "Latn-Hira" "Latn-Jamo"
## [461] "Latn-Kana" "Latn-Knda"
## [463] "Latn-Mlym" "Latn-Orya"
## [465] "Latn-Syrc" "Latn-Taml"
## [467] "Latn-Telu" "Latn-Thaa"
## [469] "Latn-Thai" "lt-Lower"
## [471] "lt-Title" "lt-Upper"
## [473] "Macedonian-Latin/BGN" "Malayalam-Arabic"
## [475] "Malayalam-Bengali" "Malayalam-Devanagari"
## [477] "Malayalam-Gujarati" "Malayalam-Gurmukhi"
## [479] "Malayalam-Kannada" "Malayalam-Latin"
## [481] "Malayalam-Oriya" "Malayalam-Tamil"
## [483] "Malayalam-Telugu" "Maldivian-Latin/BGN"
## [485] "mk-mk_Latn/BGN" "Mlym-Arab"
## [487] "Mlym-Beng" "Mlym-Deva"
## [489] "Mlym-Gujr" "Mlym-Guru"
## [491] "Mlym-Knda" "Mlym-Latn"
## [493] "Mlym-Orya" "Mlym-Taml"
## [495] "Mlym-Telu" "Mlym-ur"
## [497] "mn-mn_Latn/BGN" "mn-mn_Latn/MNS"
## [499] "Mongolian-Latin/BGN" "my-am"
## [501] "my-ar" "my-chr"
## [503] "my-fa" "my-my_FONIPA"
## [505] "my-my_Latn" "my-Zawgyi"
## [507] "Myanmar-Latin" "Name-Any"
## [509] "nl-Title" "NumericPinyin-Latin"
## [511] "NumericPinyin-Pinyin" "nv-nv_FONIPA"
## [513] "Oriya-Arabic" "Oriya-Bengali"
## [515] "Oriya-Devanagari" "Oriya-Gujarati"
## [517] "Oriya-Gurmukhi" "Oriya-Kannada"
## [519] "Oriya-Latin" "Oriya-Malayalam"
## [521] "Oriya-Tamil" "Oriya-Telugu"
## [523] "Orya-Arab" "Orya-Beng"
## [525] "Orya-Deva" "Orya-Gujr"
## [527] "Orya-Guru" "Orya-Knda"
## [529] "Orya-Latn" "Orya-Mlym"
## [531] "Orya-Taml" "Orya-Telu"
## [533] "Orya-ur" "Pashto-Latin/BGN"
## [535] "Persian-Latin/BGN" "Pinyin-NumericPinyin"
## [537] "pl_FONIPA-ja" "pl-am"
## [539] "pl-ar" "pl-chr"
## [541] "pl-fa" "pl-ja"
## [543] "pl-pl_FONIPA" "ps-ps_Latn/BGN"
## [545] "Publishing-Any" "rm_SURSILV-am"
## [547] "rm_SURSILV-ar" "rm_SURSILV-chr"
## [549] "rm_SURSILV-fa" "rm_SURSILV-rm_FONIPA_SURSILV"
## [551] "ro_FONIPA-ja" "ro-am"
## [553] "ro-ar" "ro-chr"
## [555] "ro-fa" "ro-ja"
## [557] "ro-ro_FONIPA" "ru_Latn-ru/BGN"
## [559] "ru-ja" "ru-ru_Latn/BGN"
## [561] "ru-zh" "Russian-Latin/BGN"
## [563] "sat_Olck-sat_FONIPA" "sat-am"
## [565] "sat-ar" "sat-chr"
## [567] "sat-fa" "Serbian-Latin/BGN"
## [569] "si-am" "si-ar"
## [571] "si-chr" "si-fa"
## [573] "si-si_FONIPA" "si-si_Latn"
## [575] "Simplified-Traditional" "sk_FONIPA-ja"
## [577] "sk-am" "sk-ar"
## [579] "sk-chr" "sk-fa"
## [581] "sk-ja" "sk-sk_FONIPA"
## [583] "sr-sr_Latn/BGN" "Syrc-Latn"
## [585] "Syriac-Latin" "ta-ta_FONIPA"
## [587] "Tamil-Arabic" "Tamil-Bengali"
## [589] "Tamil-Devanagari" "Tamil-Gujarati"
## [591] "Tamil-Gurmukhi" "Tamil-Kannada"
## [593] "Tamil-Latin" "Tamil-Malayalam"
## [595] "Tamil-Oriya" "Tamil-Telugu"
## [597] "Taml-Arab" "Taml-Beng"
## [599] "Taml-Deva" "Taml-Gujr"
## [601] "Taml-Guru" "Taml-Knda"
## [603] "Taml-Latn" "Taml-Mlym"
## [605] "Taml-Orya" "Taml-Telu"
## [607] "Taml-ur" "Telu-Arab"
## [609] "Telu-Beng" "Telu-Deva"
## [611] "Telu-Gujr" "Telu-Guru"
## [613] "Telu-Knda" "Telu-Latn"
## [615] "Telu-Mlym" "Telu-Orya"
## [617] "Telu-Taml" "Telu-ur"
## [619] "Telugu-Arabic" "Telugu-Bengali"
## [621] "Telugu-Devanagari" "Telugu-Gujarati"
## [623] "Telugu-Gurmukhi" "Telugu-Kannada"
## [625] "Telugu-Latin" "Telugu-Malayalam"
## [627] "Telugu-Oriya" "Telugu-Tamil"
## [629] "Thaa-Latn" "Thaana-Latin"
## [631] "Thai-Latin" "Thai-Latn"
## [633] "tk_Cyrl-tk/BGN" "tlh-am"
## [635] "tlh-ar" "tlh-chr"
## [637] "tlh-fa" "tlh-tlh_FONIPA"
## [639] "Tone-Digit" "tr-Lower"
## [641] "tr-Title" "tr-Upper"
## [643] "Traditional-Simplified" "Turkmen-Latin/BGN"
## [645] "ug-ug_FONIPA" "uk-uk_Latn/BGN"
## [647] "Ukrainian-Latin/BGN" "und_FONIPA-ar"
## [649] "und_FONIPA-chr" "und_FONIPA-fa"
## [651] "und_FONIPA-und_FONXSAMP" "und_FONXSAMP-und_FONIPA"
## [653] "uz_Cyrl-uz_Latn" "uz_Cyrl-uz/BGN"
## [655] "uz_Latn-uz_Cyrl" "Uzbek-Latin/BGN"
## [657] "vec-vec_FONIPA" "xh-am"
## [659] "xh-ar" "xh-chr"
## [661] "xh-fa" "xh-xh_FONIPA"
## [663] "XSampa-IPA" "yo-yo_BJ"
## [665] "Zawgyi-my" "zh_Latn_PINYIN-ru"
## [667] "zu-am" "zu-ar"
## [669] "zu-chr" "zu-fa"
## [671] "zu-zu_FONIPA"