stri_enc_fromutf32: Convert From UTF-32¶
Description¶
This function converts integer vectors, representing sequences of UTF-32 code points, to UTF-8 strings.
Usage¶
stri_enc_fromutf32(vec)
Arguments¶
|
a list of integer vectors (or objects coercible to such vectors) or |
Details¶
UTF-32 is a 32-bit encoding where each Unicode code point corresponds to exactly one integer value.
This function is a vectorized version of intToUtf8
. As usual in stringi, it returns character strings in UTF-8. See stri_enc_toutf32
for a dual operation.
If an ill-defined code point is given, a warning is generated and the corresponding string is set to NA
. Note that 0
s are not allowed in vec
, as they are used internally to mark the end of a string (in the C API).
See also stri_encode
for decoding arbitrary byte sequences from any given encoding.
Value¶
Returns a character vector (in UTF-8). NULL
s in the input list are converted to NA_character_
.
See Also¶
The official online manual of stringi at https://stringi.gagolewski.com/
Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, doi:10.18637/jss.v103.i02
Other encoding_conversion: about_encoding
, stri_enc_toascii()
, stri_enc_tonative()
, stri_enc_toutf32()
, stri_enc_toutf8()
, stri_encode()