4.14 Hash Tables
Hash Tables in The Racket Guide introduces hash tables.
A hash table (or simply hash) maps each of its keys to a single value. For a given hash table, keys are equivalent via equal?, eqv?, or eq?, and keys are retained either strongly, weakly (see Weak Boxes), or like ephemerons. A hash table is also either mutable or immutable. Immutable hash tables support effectively constant-time access and update, just like mutable hash tables; the constant on immutable operations is usually larger, but the functional nature of immutable hash tables can pay off in certain algorithms. Use immutable? to check whether a hash table is immutable.
Immutable hash tables actually provide O(log N) access and update. Since N is limited by the address space so that log N is limited to less than 30 or 62 (depending on the platform), log N can be treated reasonably as a constant.
For equal?-based hashing, the built-in hash functions on strings, pairs, lists, vectors, prefab or transparent structures, etc., take time proportional to the size of the value. The hash code for a compound data structure, such as a list or vector, depends on hashing each item of the container, but the depth of such recursive hashing is limited (to avoid potential problems with cyclic data). For a non-list pair, both car and cdr hashing is treated as a deeper hash, but the cdr of a list is treated as having the same hashing depth as the list.
A hash table can be used as a two-valued sequence (see Sequences). The keys and values of the hash table serve as elements of the sequence (i.e., each element is a key and its associated value). If a mapping is added to or removed from the hash table during iteration, then an iteration step may fail with exn:fail:contract, or the iteration may skip or duplicate keys and values. See also in-hash, in-hash-keys, in-hash-values, and in-hash-pairs.
Two hash tables cannot be equal? unless they have the same mutability, use the same key-comparison procedure (equal?, eqv?, or eq?), both hold keys strongly, weakly, or like ephemerons. Empty immutable hash tables are eq? when they are equal?.
Changed in version 7.2.0.9 of package base: Made empty immutable hash tables eq? when they are equal?.
Caveats concerning concurrent modification: A mutable hash table can be manipulated with hash-ref, hash-set!, and hash-remove! concurrently by multiple threads, and the operations are protected by a table-specific semaphore as needed. Several caveats apply, however:
If a thread is terminated while applying hash-ref, hash-ref-key, hash-set!, hash-remove!, hash-ref!, hash-update!, or hash-clear! to a hash table that uses equal? or eqv? key comparisons, all current and future operations on the hash table may block indefinitely.
The hash-map, hash-for-each, and hash-clear! procedures do not use the table’s semaphore to guard the traversal as a whole (if a traversal is needed, in the case of hash-clear!). Changes by one thread to a hash table can affect the keys and values seen by another thread part-way through its traversal of the same hash table.
The hash-update! and hash-ref! functions use a table’s semaphore independently for the hash-ref and hash-set! parts of their functionality, which means that the update as a whole is not “atomic.”
Adding a mutable hash table as a key in itself is trouble on the grounds that the key is being mutated (see the caveat below), but it is also a kind of concurrent use of the hash table: computing a hash table’s hash code may require waiting on the table’s semaphore, but the semaphore is already held for modifying the hash table, so the hash-table addition can block indefinitely.
Caveat concerning mutable keys: If a key in an equal?-based hash table is mutated (e.g., a key string is modified with string-set!), then the hash table’s behavior for insertion and lookup operations becomes unpredictable.
A literal or printed hash table starts with #hash, #hasheqv, or #hasheq. See Reading Hash Tables for information on reading hash tables and Printing Hash Tables for information on printing hash tables.
procedure
(hash-equal? hash) → boolean?
hash : hash?
procedure
(hash-strong? hash) → boolean?
hash : hash?
Added in version 8.0.0.10 of package base.
procedure
(hash-weak? hash) → boolean?
hash : hash?
procedure
(hash-ephemeron? hash) → boolean?
hash : hash?
Added in version 8.0.0.10 of package base.
procedure
(hash key val ... ...) → (and/c hash? hash-equal? immutable?)
key : any/c val : any/c
procedure
(hasheq key val ... ...) → (and/c hash? hash-eq? immutable?)
key : any/c val : any/c
procedure
(hasheqv key val ... ...) → (and/c hash? hash-eqv? immutable?)
key : any/c val : any/c
The hash procedure creates a table where keys are compared with equal?, hasheq procedure creates a table where keys are compared with eq?, and hasheqv procedure creates a table where keys are compared with eqv?.
The key to val mappings are added to the table in the order that they appear in the argument list, so later mappings can hide earlier mappings if the keys are equal.
procedure
(make-hash [assocs]) → (and/c hash? hash-equal?)
assocs : (listof pair?) = null
procedure
(make-hasheqv [assocs]) → (and/c hash? hash-eqv?)
assocs : (listof pair?) = null
procedure
(make-hasheq [assocs]) → (and/c hash? hash-eq?)
assocs : (listof pair?) = null
The make-hash procedure creates a table where keys are compared with equal?, make-hasheq procedure creates a table where keys are compared with eq?, and make-hasheqv procedure creates a table where keys are compared with eqv?.
The table is initialized with the content of assocs. In each element of assocs, the car is a key, and the cdr is the corresponding value. The mappings are added to the table in the order that they appear in assocs, so later mappings can hide earlier mappings.
See also make-custom-hash.
procedure
(make-weak-hash [assocs]) → (and/c hash? hash-equal? hash-weak?)
assocs : (listof pair?) = null
procedure
(make-weak-hasheqv [assocs]) → (and/c hash? hash-eqv? hash-weak?)
assocs : (listof pair?) = null
procedure
(make-weak-hasheq [assocs]) → (and/c hash? hash-eq? hash-weak?)
assocs : (listof pair?) = null
Beware that values in a weak hash table are retained normally. If a value in the table refers back to its key, then the table will retain the value and therefore the key; the mapping will never be removed from the table even if the key becomes otherwise inaccessible. To avoid that problem, use an ephemeron hash table as created by make-ephemeron-hash, make-ephemeron-hasheqv, or make-ephemeron-hasheq. For values that do not refer to keys, there is a modest extra cost to using an ephemeron hash table instead of a weak hash table, but prefer an ephemeron hash table when in doubt.
procedure
(make-ephemeron-hash [assocs])
→ (and/c hash? hash-equal? hash-ephemeron?) assocs : (listof pair?) = null
procedure
(make-ephemeron-hasheqv [assocs])
→ (and/c hash? hash-eqv? hash-ephemeron?) assocs : (listof pair?) = null
procedure
(make-ephemeron-hasheq [assocs])
→ (and/c hash? hash-eq? hash-ephemeron?) assocs : (listof pair?) = null
Using an ephemeron hash table is like using a weak hash table and mapping each key to a ephemeron that pairs the key and value. An advantage of an ephemeron hash table is that the value need not be extracted with ephemeron-value from the result of functions like hash-ref. An ephemeron hash table might also be represented more compactly than a weak hash table with explicit ephemeron values.
Added in version 8.0.0.10 of package base.
procedure
(make-immutable-hash [assocs])
→ (and/c hash? hash-equal? immutable?) assocs : (listof pair?) = null
procedure
(make-immutable-hasheqv [assocs])
→ (and/c hash? hash-eqv? immutable?) assocs : (listof pair?) = null
procedure
(make-immutable-hasheq [assocs])
→ (and/c hash? hash-eq? immutable?) assocs : (listof pair?) = null
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
procedure
(hash-set*! hash key v ... ...) → void?
hash : (and/c hash? (not/c immutable?)) key : any/c v : any/c
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
procedure
(hash-set hash key v) → (and/c hash? immutable?)
hash : (and/c hash? immutable?) key : any/c v : any/c
See also the caveat concerning mutable keys above.
procedure
(hash-set* hash key v ... ...) → (and/c hash? immutable?)
hash : (and/c hash? immutable?) key : any/c v : any/c
See also the caveat concerning mutable keys above.
procedure
hash : hash? key : any/c
failure-result : failure-result/c =
(lambda () (raise (make-exn:fail:contract ....)))
If failure-result is a procedure, it is called (through a tail call) with no arguments to produce the result.
Otherwise, failure-result is returned as the result.
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
procedure
(hash-ref-key hash key [failure-result]) → any
hash : hash? key : any/c
failure-result : failure-result/c =
(lambda () (raise (make-exn:fail:contract ....)))
If hash is not an impersonator, then the returned key, assuming it is found, will be eq?-equivalent to the one actually retained by hash:
> (define original-key "hello") > (define key-copy (string-copy original-key)) > (equal? original-key key-copy) #t
> (eq? original-key key-copy) #f
> (define table (make-hash)) > (hash-set! table original-key 'value) > (eq? (hash-ref-key table "hello") original-key) #t
> (eq? (hash-ref-key table "hello") key-copy) #f
If a mutable hash is updated multiple times using keys that are not eq?-equivalent but are equivalent according to the hash’s key-comparison procedure, the hash retains the first one:
> (define original-key "hello") > (define key-copy (string-copy original-key)) > (define table (make-hash)) > (hash-set! table original-key 'one) > (hash-set! table key-copy 'two) > (eq? (hash-ref-key table "hello") original-key) #t
> (eq? (hash-ref-key table "hello") key-copy) #f
> (define original-key "hello") > (define key-copy (string-copy original-key)) > (define table0 (hash)) > (define table1 (hash-set table0 original-key 'one)) > (define table2 (hash-set table1 key-copy 'two)) > (eq? (hash-ref-key table2 "hello") original-key) #f
> (eq? (hash-ref-key table2 "hello") key-copy) #t
If hash is an impersonator, then the returned key will be determined as described in the documentation to impersonate-hash.
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
Added in version 7.4.0.3 of package base.
procedure
hash : hash? key : any/c to-set : failure-result/c
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
procedure
(hash-has-key? hash key) → boolean?
hash : hash? key : any/c
procedure
(hash-update! hash key updater [ failure-result]) → void? hash : (and/c hash? (not/c immutable?)) key : any/c updater : (any/c . -> . any/c)
failure-result : failure-result/c =
(lambda () (raise (make-exn:fail:contract ....)))
(define h (make-hash)) (hash-set! h 'a 5)
> (hash-update! h 'a add1) > h '#hash((a . 6))
The optional failure-result argument is used when no mapping exists for key already, in the same manner as in hash-ref.
(define h (make-hash))
> (hash-update! h 'b add1) hash-update!: no value found for key: 'b
> (hash-update! h 'b add1 0) > h '#hash((b . 1))
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
procedure
(hash-update hash key updater [failure-result])
→ (and/c hash? immutable?) hash : (and/c hash? immutable?) key : any/c updater : (any/c . -> . any/c)
failure-result : failure-result/c =
(lambda () (raise (make-exn:fail:contract ....)))
(define h (hash 'a 5))
> (hash-update h 'a add1) '#hash((a . 6))
The optional failure-result argument is used when no mapping exists for key already, in the same manner as in hash-ref.
(define h (hash))
> (hash-update h 'b add1) hash-update: no value found for key: 'b
> (hash-update h 'b add1 0) '#hash((b . 1))
See also the caveat concerning mutable keys above.
procedure
(hash-remove! hash key) → void?
hash : (and/c hash? (not/c immutable?)) key : any/c
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
procedure
(hash-remove hash key) → (and/c hash? immutable?)
hash : (and/c hash? immutable?) key : any/c
See also the caveat concerning mutable keys above.
procedure
(hash-clear! hash) → void?
hash : (and/c hash? (not/c immutable?))
If hash is not an impersonator, then all mappings are removed in constant time. If hash is an impersonator, then each key is removed one-by-one using hash-remove!.
See also the caveats concerning concurrent modification and the caveat concerning mutable keys above.
procedure
(hash-clear hash) → (and/c hash? immutable?)
hash : (and/c hash? immutable?)
If hash is not a chaperone, then clearing is equivalent to creating a new hash table, and the operation is performed in constant time. If hash is a chaperone, then each key is removed one-by-one using hash-remove.
procedure
(hash-copy-clear hash) → hash?
hash : hash?
If a hash table is extended with new keys (either through proc or by another thread) while a hash-map or hash-for-each traversal is in process, arbitrary key–value pairs can be dropped or duplicated in the traversal. Key mappings can be deleted or remapped (by any thread) with no adverse affects; the change does not affect a traversal if the key has been seen already, otherwise the traversal skips a deleted key or uses the remapped key’s new value.
See also the caveats concerning concurrent modification above.
If try-order? is true, then the order of keys and values
passed to proc is normalized under certain
circumstances—
booleans sorted #f before #t;
characters sorted by char<?;
real numbers sorted by <;
symbols sorted with uninterned symbols before unreadable symbols before interned symbols, then sorted by symbol<?;
byte strings sorted by bytes<?;
null;
#<void>; and
eof.
Changed in version 6.3 of package base: Added the try-order? argument.
Changed in version 7.1.0.7: Added guarantees for try-order?.
procedure
(hash-values hash) → (listof any/c)
hash : hash?
procedure
(hash-keys-subset? hash1 hash2) → boolean?
hash1 : hash? hash2 : hash?
Using hash-keys-subset? on immutable hash tables can be much faster than iterating through the keys of hash1 to make sure that each is in hash2.
Added in version 6.5.0.8 of package base.
procedure
(hash-for-each hash proc [try-order?]) → void?
hash : hash? proc : (any/c any/c . -> . any) try-order? : any/c = #f
Changed in version 6.3 of package base: Added the try-order? argument.
Changed in version 7.1.0.7: Added guarantees for try-order?.
procedure
(hash-count hash) → exact-nonnegative-integer?
hash : hash?
For the CS implementation of Racket, the result is always computed in constant time and atomically. For the BC implementation of Racket, the result is computed in constant time and atomically only if hash does not retain keys weakly or like an ephemeron, otherwise, a traversal is required to count the keys.
procedure
(hash-empty? hash) → boolean?
hash : hash?
procedure
(hash-iterate-first hash)
→ (or/c #f exact-nonnegative-integer?) hash : hash?
For a mutable hash, this index is guaranteed to refer to the first item only as long as no items are added to or removed from hash. More generally, an index is guaranteed to be a valid hash index for a given hash table only as long it comes from hash-iterate-first or hash-iterate-next, and only as long as the hash table is not modified. In the case of a hash table with weakly held keys or keys held like ephemerons, the hash table can be implicitly modified by the garbage collector (see Garbage Collection) when it discovers that the key is not reachable.
procedure
(hash-iterate-next hash pos)
→ (or/c #f exact-nonnegative-integer?) hash : hash? pos : exact-nonnegative-integer?
If pos is not a valid hash index of hash, then the result may be #f or it may be the next later index that remains valid. The latter result is guaranteed if a hash table has been modified only by the removal of keys.
Changed in version 7.0.0.10 of package base: Handle an invalid index by returning #f instead of raising exn:fail:contract.
procedure
(hash-iterate-key hash pos) → any/c
hash : hash? pos : exact-nonnegative-integer?
procedure
(hash-iterate-key hash pos bad-index-v) → any/c
hash : hash? pos : exact-nonnegative-integer? bad-index-v : any/c
If pos is not a valid hash index for hash, the result is bad-index-v if provided, otherwise the exn:fail:contract exception is raised.
Changed in version 7.0.0.10 of package base: Added the optional bad-index-v argument.
procedure
(hash-iterate-value hash pos) → any
hash : hash? pos : exact-nonnegative-integer?
procedure
(hash-iterate-value hash pos bad-index-v) → any
hash : hash? pos : exact-nonnegative-integer? bad-index-v : any/c
If pos is not a valid hash index for hash, the result is bad-index-v if provided, otherwise the exn:fail:contract exception is raised.
Changed in version 7.0.0.10 of package base: Added the optional bad-index-v argument.
procedure
(hash-iterate-pair hash pos) → (cons any/c any/c)
hash : hash? pos : exact-nonnegative-integer?
procedure
(hash-iterate-pair hash pos bad-index-v) → (cons any/c any/c)
hash : hash? pos : exact-nonnegative-integer? bad-index-v : any/c
If pos is not a valid hash index for hash, the result is (cons bad-index-v bad-index-v) if bad-index-v is provided, otherwise the exn:fail:contract exception is raised.
Added in version 6.4.0.5 of package base.
Changed in version 7.0.0.10: Added the optional bad-index-v argument.
procedure
(hash-iterate-key+value hash pos) →
any/c any/c hash : hash? pos : exact-nonnegative-integer?
procedure
(hash-iterate-key+value hash pos bad-index-v) →
any/c any/c hash : hash? pos : exact-nonnegative-integer? bad-index-v : any/c
If pos is not a valid hash index for hash, the result is (values bad-index-v bad-index-v) if bad-index-v is provided, otherwise the exn:fail:contract exception is raised.
Added in version 6.4.0.5 of package base.
Changed in version 7.0.0.10: Added the optional bad-index-v argument.
4.14.1 Additional Hash Table Functions
(require racket/hash) | package: base |
procedure
(hash-union h0 h ... [ #:combine combine #:combine/key combine/key]) → (and/c hash? immutable?) h0 : (and/c hash? immutable?) h : hash?
combine : (-> any/c any/c any/c) = (lambda _ (error 'hash-union ....))
combine/key : (-> any/c any/c any/c any/c) = (lambda (k a b) (combine a b))
> (hash-union (make-immutable-hash '([1 . one])) (make-immutable-hash '([2 . two])) (make-immutable-hash '([3 . three]))) '#hash((1 . one) (2 . two) (3 . three))
> (hash-union (make-immutable-hash '([1 one uno] [2 two dos])) (make-immutable-hash '([1 eins un] [2 zwei deux])) #:combine/key (lambda (k v1 v2) (append v1 v2))) '#hash((1 . (one uno eins un)) (2 . (two dos zwei deux)))
procedure
(hash-union! h0 h ... [ #:combine combine #:combine/key combine/key]) → void? h0 : (and/c hash? (not/c immutable?)) h : hash?
combine : (-> any/c any/c any/c) = (lambda _ (error 'hash-union ....))
combine/key : (-> any/c any/c any/c any/c) = (lambda (k a b) (combine a b))
> (define h (make-hash)) > h '#hash()
> (hash-union! h (make-immutable-hash '([1 one uno] [2 two dos]))) > h '#hash((1 . (one uno)) (2 . (two dos)))
> (hash-union! h (make-immutable-hash '([1 eins un] [2 zwei deux])) #:combine/key (lambda (k v1 v2) (append v1 v2))) > h '#hash((1 . (one uno eins un)) (2 . (two dos zwei deux)))
procedure
(hash-intersect h0 h ... [ #:combine combine #:combine/key combine/key]) → (and/c hash? immutable?) h0 : (and/c hash? immutable?) h : hash?
combine : (-> any/c any/c any/c) = (lambda _ (error 'hash-intersect ...))
combine/key : (-> any/c any/c any/c any/c) = (lambda (k a b) (combine a b))
> (hash-intersect (make-immutable-hash '((a . 1) (b . 2) (c . 3))) (make-immutable-hash '((a . 4) (b . 5))) #:combine +) '#hash((a . 5) (b . 7))
> (hash-intersect (make-immutable-hash '((a . 1) (b . 2) (c . 3))) (make-immutable-hash '((a . 4) (b . 5))) #:combine/key (lambda (k v1 v2) (if (eq? k 'a) (+ v1 v2) (- v1 v2)))) '#hash((a . 5) (b . -3))
Added in version 7.9.0.1 of package base.