#set-operations #immutability #set #map #persistent #functional

no-std immutable-chunkmap

A fast immutable map and set with batch insert and update methods, COW operations, and big O efficient implementations of set and merge operations

31 releases (13 stable)

2.0.6 Oct 21, 2024
2.0.5 May 20, 2024
2.0.4 Feb 4, 2024
2.0.2 Oct 23, 2023
0.1.2 Dec 26, 2017

#98 in Data structures

Download history 20805/week @ 2024-09-06 21239/week @ 2024-09-13 25077/week @ 2024-09-20 29973/week @ 2024-09-27 28487/week @ 2024-10-04 31075/week @ 2024-10-11 35244/week @ 2024-10-18 36795/week @ 2024-10-25 41518/week @ 2024-11-01 33025/week @ 2024-11-08 31040/week @ 2024-11-15 31367/week @ 2024-11-22 36783/week @ 2024-11-29 34340/week @ 2024-12-06 35247/week @ 2024-12-13 25348/week @ 2024-12-20

137,617 downloads per month
Used in 456 crates (5 directly)

Apache-2.0 OR MIT

150KB
4K SLoC

immutable chunk map

A cache efficient immutable map with lookup performance close to BTreeMap and reasonably good insertion performance. Optional copy on write mutable operations bring modification performance within 2x of BTreeMap in the best case while still offering snapshotting, and big O efficient set operations of a persistant data structure.

A graph of lookup performance of various data structures using usize keys. Full test data in the bench/charts directory. Tests performed on an Intel Core i7 8550U under Linux with a locked frequency of 1.8 GHz.

  • OCaml: core map (from the Jane Street core library), an AVL tree with distinct leaf nodes and a relaxed balance constraint.
  • Chunkmap: this library
  • Chunkmap COW: this library using only COW operations
  • BTreeMap: from the Rust standard library
  • HashMap: from the Rust standard library

alt text

Chunkmap is very close to BTreeMap for random accesses using keys without hashing. Obviously if you don't need ordered data use a HashMap.

alt text

Insertion performance, while not as good as most mutable data structures, is not awful when using COW mode exclusively. In the case where you have many updates to do at once you can go even faster by using insert_many. In some cases, e.g. building a map from scratch using sorted inputs this can be faster than even a HashMap. The below case is more typical, adding 10% of a data set to the map.

alt text

A note about the COW bar on this graph. It represents using only mutable COW operations on the map, it is perfectly possible to use an actual insert_many call instead of mutable COW operations if it's faster in your application, which as you can see, depends on the size of the map.

License

This project is dual licensed under the MIT or the Apache 2 at your discretion.

Dependencies

~63–490KB
~10K SLoC