16 releases (stable)

2.0.0-beta1 Nov 23, 2024
1.5.1 Jun 24, 2024
1.5.0 May 28, 2024
1.4.0 Nov 16, 2023
0.4.0 Nov 2, 2021

#119 in Internationalization (i18n)

Download history 64132/week @ 2024-08-22 65037/week @ 2024-08-29 76603/week @ 2024-09-05 75502/week @ 2024-09-12 105405/week @ 2024-09-19 120607/week @ 2024-09-26 129391/week @ 2024-10-03 116847/week @ 2024-10-10 108881/week @ 2024-10-17 99197/week @ 2024-10-24 388071/week @ 2024-10-31 787109/week @ 2024-11-07 883882/week @ 2024-11-14 914644/week @ 2024-11-21 978306/week @ 2024-11-28 894138/week @ 2024-12-05

3,819,729 downloads per month
Used in 28,363 crates (24 directly)

Unicode-3.0

2MB
30K SLoC

icu_properties crates.io

Definitions of Unicode Properties and APIs for retrieving property data in an appropriate data structure.

This module is published as its own crate (icu_properties) and as part of the icu crate. See the latter for more details on the ICU4X project.

APIs that return a CodePointSetData exist for binary properties and certain enumerated properties.

APIs that return a CodePointMapData exist for certain enumerated properties.

Examples

Property data as CodePointSetDatas

use icu::properties::{CodePointSetData, CodePointMapData};
use icu::properties::props::{GeneralCategory, Emoji};

// A binary property as a `CodePointSetData`

assert!(CodePointSetData::new::<Emoji>().contains('🎃')); // U+1F383 JACK-O-LANTERN
assert!(!CodePointSetData::new::<Emoji>().contains('')); // U+6728

// An individual enumerated property value as a `CodePointSetData`

let line_sep_data = CodePointMapData::<GeneralCategory>::new()
    .get_set_for_value(GeneralCategory::LineSeparator);
let line_sep = line_sep_data.as_borrowed();

assert!(line_sep.contains('\u{2028}'));
assert!(!line_sep.contains('\u{2029}'));

Property data as CodePointMapDatas

use icu::properties::CodePointMapData;
use icu::properties::props::Script;

assert_eq!(CodePointMapData::<Script>::new().get('🎃'), Script::Common); // U+1F383 JACK-O-LANTERN
assert_eq!(CodePointMapData::<Script>::new().get(''), Script::Han); // U+6728

More Information

For more information on development, authorship, contributing etc. please visit ICU4X home page.

Dependencies

~0.7–1.4MB
~28K SLoC