4.1 Equality
Equality is the concept of whether two values are “the same.” Racket supports a few different kinds of equality by default, although equal? is preferred for most uses.
Datatypes with further specification of equal? include strings, byte strings, pairs, mutable pairs, vectors, boxes, hash tables, and inspectable structures. In the last six cases, equality is recursively defined; if both v1 and v2 contain reference cycles, they are equal when the infinite unfoldings of the values would be equal. See also gen:equal+hash and prop:impersonator-of.
> (equal? 'yes 'yes) #t
> (equal? 'yes 'no) #f
> (equal? (* 6 7) 42) #t
> (equal? (expt 2 100) (expt 2 100)) #t
> (equal? 2 2.0) #f
> (let ([v (mcons 1 2)]) (equal? v v)) #t
> (equal? (mcons 1 2) (mcons 1 2)) #t
> (equal? (integer->char 955) (integer->char 955)) #t
> (equal? (make-string 3 #\z) (make-string 3 #\z)) #t
> (equal? #t #t) #t
procedure
(equal-always? v1 v2) → boolean?
v1 : any/c v2 : any/c
Two values v1 and v2 are equal-always? if and only if there exists a third value v3 such that v1 and v2 are both chaperones of v3, meaning (chaperone-of? v1 v3) and (chaperone-of? v2 v3) are both true.
For values that include no chaperones or other impersonators, v1 and v2 can be considered equal-always if they are equal?, except that corresponding mutable vectors, boxes, hash tables, strings, byte strings, mutable pairs, and mutable structures within v1 and v2 must be eq?, and equality on structures can be specialized for equal-always? through gen:equal-mode+hash.
> (equal-always? 'yes 'yes) #t
> (equal-always? 'yes 'no) #f
> (equal-always? (* 6 7) 42) #t
> (equal-always? (expt 2 100) (expt 2 100)) #t
> (equal-always? 2 2.0) #f
> (equal-always? (list 1 2) (list 1 2)) #t
> (let ([v (mcons 1 2)]) (equal-always? v v)) #t
> (equal-always? (mcons 1 2) (mcons 1 2)) #f
> (equal-always? (integer->char 955) (integer->char 955)) #t
> (equal-always? (make-string 3 #\z) (make-string 3 #\z)) #f
> (equal-always? (string->immutable-string (make-string 3 #\z)) (string->immutable-string (make-string 3 #\z))) #t
> (equal-always? #t #t) #t
Added in version 8.5.0.3 of package base.
The number and character datatypes are the only ones for which
eqv? differs from eq?. Two numbers are eqv? when
they have the same exactness, precision, and are both equal and non-zero, both
+0.0, both +0.0f0, both -0.0,
both -0.0f0, both +nan.0, or both
+nan.f—
Generally, eqv? is identical to equal? except that the former cannot recursively compare the contents of compound data types (such as lists and structs) and cannot be customized by user-defined data types. The use of eqv? is lightly discouraged in favor of equal?.
> (eqv? 'yes 'yes) #t
> (eqv? 'yes 'no) #f
> (eqv? (* 6 7) 42) #t
> (eqv? (expt 2 100) (expt 2 100)) #t
> (eqv? 2 2.0) #f
> (let ([v (mcons 1 2)]) (eqv? v v)) #t
> (eqv? (mcons 1 2) (mcons 1 2)) #f
> (eqv? (integer->char 955) (integer->char 955)) #t
> (eqv? (make-string 3 #\z) (make-string 3 #\z)) #f
> (eqv? #t #t) #t
> (eq? 'yes 'yes) #t
> (eq? 'yes 'no) #f
> (eq? (* 6 7) 42) #t
> (eq? (expt 2 100) (expt 2 100)) #f
> (eq? 2 2.0) #f
> (let ([v (mcons 1 2)]) (eq? v v)) #t
> (eq? (mcons 1 2) (mcons 1 2)) #f
> (eq? (integer->char 955) (integer->char 955)) #t
> (eq? (make-string 3 #\z) (make-string 3 #\z)) #f
> (eq? #t #t) #t
procedure
(equal?/recur v1 v2 recur-proc) → boolean?
v1 : any/c v2 : any/c recur-proc : (any/c any/c -> any/c)
> (equal?/recur 1 1 (lambda (a b) #f)) #t
> (equal?/recur '(1) '(1) (lambda (a b) #f)) #f
> (equal?/recur '#(1 1 1) '#(1 1.2 3/4) (lambda (a b) (<= (abs (- a b)) 0.25))) #t
procedure
(equal-always?/recur v1 v2 recur-proc) → boolean?
v1 : any/c v2 : any/c recur-proc : (any/c any/c -> any/c)
> (equal-always?/recur 1 1 (lambda (a b) #f)) #t
> (equal-always?/recur '(1) '(1) (lambda (a b) #f)) #f
> (equal-always?/recur (vector-immutable 1 1 1) (vector-immutable 1 1.2 3/4) (lambda (a b) (<= (abs (- a b)) 0.25))) #t
4.1.1 Object Identity and Comparisons
The eq? operator compares two values, returning #t when the values refer to the same object. This form of equality is suitable for comparing objects that support imperative update (e.g., to determine that the effect of modifying an object through one reference is visible through another reference). Also, an eq? test evaluates quickly, and eq?-based hashing is more lightweight than equal?-based hashing in hash tables.
In some cases, however, eq? is unsuitable as a comparison operator, because the generation of objects is not clearly defined. In particular, two applications of + to the same two exact integers may or may not produce results that are eq?, although the results are always equal?. Similarly, evaluation of a lambda form typically generates a new procedure object, but it may re-use a procedure object previously generated by the same source lambda form.
The behavior of a datatype with respect to eq? is generally specified with the datatype and its associated procedures.
4.1.2 Equality and Hashing
All comparable values have at least one hash code —
procedure
(equal-hash-code v) → fixnum?
v : any/c
For any v that could be produced by read, if v2 is
produced by read for the same input characters, the
(equal-hash-code v) is the same as (equal-hash-code v2) —
Changed in version 6.4.0.12 of package base: Strengthened guarantee for readable values.
procedure
v : any/c
procedure
v : any/c
As equal-always-hash-code traverses v, immutable values within v are hashed with equal-hash-code, while mutable values within v are hashed with eq-hash-code.
procedure
v : any/c
procedure
(eq-hash-code v) → fixnum?
v : any/c
procedure
(eqv-hash-code v) → fixnum?
v : any/c
4.1.3 Implementing Equality for Custom Types
value
equal-proc : (-> any/c any/c (-> any/c any/c boolean?) any/c) —
tests whether the first two arguments are equal, where both values are instances of the structure type to which the generic interface is associated (or a subtype of the structure type). The third argument is an equal? predicate to use for recursive equality checks; use the given predicate instead of equal? to ensure that data cycles are handled properly and to work with equal?/recur (but beware that an arbitrary function can be provided to equal?/recur for recursive checks, which means that arguments provided to the predicate might be exposed to arbitrary code).
The equal-proc is called for a pair of structures only when they are not eq?, and only when they both have a gen:equal+hash value inherited from the same structure type. With this strategy, the order in which equal? receives two structures does not matter. It also means that, by default, a structure sub-type inherits the equality predicate of its parent, if any.
hash-proc : (-> any/c (-> any/c exact-integer?) exact-integer?) —
computes a hash code for the given structure, like equal-hash-code. The first argument is an instance of the structure type (or one of its subtypes) to which the generic interface is associated. The second argument is an equal-hash-code-like procedure to use for recursive hash-code computation; use the given procedure instead of equal-hash-code to ensure that data cycles are handled properly.
Although the result of hash-proc can be any exact integer, it will be truncated for most purposes to a fixnum (e.g., for the result of equal-hash-code). Roughly, truncation uses bitwise-and to take the lower bits of the number. Thus, variation in the hash-code computation should be reflected in the fixnum-compatible bits of hash-proc’s result. Consumers of a hash code are expected to use variation within the fixnum range appropriately, and producers are not responsible to reflect variation in hash codes across the full range of bits that fit within a fixnum.
hash2-proc : (-> any/c (-> any/c exact-integer?) exact-integer?) —
computes a secondary hash code for the given structure. This procedure is like hash-proc, but analogous to equal-secondary-hash-code.
Take care to ensure that hash-proc and hash2-proc are consistent with equal-proc. Specifically, hash-proc and hash2-proc should produce the same value for any two structures for which equal-proc produces a true value.
The equal-proc is not only used for equal?, it is also used for equal?/recur, and impersonator-of?. Furthermore, if the structure type has no mutable fields, equal-proc is used for equal-always?, and chaperone-of?. Likewise hash-proc and hash2-proc are used for equal-always-hash-code and equal-always-secondary-hash-code, respectively, when the structure type has no mutable fields. Instances of these methods should follow the guidelines in Honest Custom Equality to implement all of these operations reasonably. In particular, these methods should not access mutable data unless the struct is declared mutable.
When a structure type has no gen:equal+hash or gen:equal-mode+hash implementation, then transparent structures (i.e., structures with an inspector that is controlled by the current inspector) are equal? when they are instances of the same structure type (not counting sub-types), and when they have equal? field values. For transparent structures, equal-hash-code and equal-secondary-hash-code (in the case of no mutable fields) derive hash code using the field values. For a transparent structure type with at least one mutable field, equal-always? is the same as eq?, and an equal-secondary-hash-code result is based only on eq-hash-code. For opaque structure types, equal? is the same as eq?, and equal-hash-code and equal-secondary-hash-code results are based only on eq-hash-code. If a structure has a prop:impersonator-of property, then the prop:impersonator-of property takes precedence over gen:equal+hash if the property value’s procedure returns a non-#f value when applied to the structure.
(define (farm=? farm1 farm2 recursive-equal?) (and (= (farm-apples farm1) (farm-apples farm2)) (= (farm-oranges farm1) (farm-oranges farm2)) (= (farm-sheep farm1) (farm-sheep farm2)))) (define (farm-hash-code farm recursive-equal-hash) (+ (* 10000 (farm-apples farm)) (* 100 (farm-oranges farm)) (* 1 (farm-sheep farm)))) (define (farm-secondary-hash-code farm recursive-equal-hash) (+ (* 10000 (farm-sheep farm)) (* 100 (farm-apples farm)) (* 1 (farm-oranges farm)))) (struct farm (apples oranges sheep) #:methods gen:equal+hash [(define equal-proc farm=?) (define hash-proc farm-hash-code) (define hash2-proc farm-secondary-hash-code)]) (define eastern-farm (farm 5 2 20)) (define western-farm (farm 18 6 14)) (define northern-farm (farm 5 20 20)) (define southern-farm (farm 18 6 14))
> (equal? eastern-farm western-farm) #f
> (equal? eastern-farm northern-farm) #f
> (equal? western-farm southern-farm) #t
value
equal-mode-proc : (-> any/c any/c (-> any/c any/c boolean?) boolean? any/c) —
the first two arguments are the values to compare, the third argument is an equality function to use for recursive comparisons, and the last argument is the mode: #t for an equal? or impersonator-of? comparison or #f for an equal-always? or chaperone-of? comparison. hash-mode-proc : (-> any/c (-> any/c exact-integer?) boolean? exact-integer?) —
the first argument is the value to compute a hash code for, the second argument is a hashing function to use for recursive hashing, and the last argument is the mode: #t for equal? hashing or #f for equal-always? hashing.
The hash-mode-proc implementation is used both for a primary hash code and secondary hash code.
When implementing these methods, follow the guidelines in Honest Custom Equality. In particular, these methods should only access mutable data if the “mode” argument is true to indicate equal? or impersonator-of?.
> (define (get gs) ((getset-getter gs))) > (define (set gs new) ((getset-setter gs) new))
> (struct getset (getter setter) #:methods gen:equal-mode+hash [(define (equal-mode-proc self other rec mode) (and mode (rec (get self) (get other)))) (define (hash-mode-proc self rec mode) (if mode (rec (get self)) (eq-hash-code self)))]) > (define x 1) > (define y 2) > (define gsx (getset (lambda () x) (lambda (new) (set! x new)))) > (define gsy (getset (lambda () y) (lambda (new) (set! y new)))) > (equal? gsx gsy) #f
> (equal-always? gsx gsy) #f
> (set gsx 3) > (set gsy 3) > (equal? gsx gsy) #t
> (equal-always? gsx gsy) #f
> (equal-always? gsx gsx) #t
Added in version 8.5.0.3 of package base.
A prop:equal+hash property value is a list of either three procedures (list equal-proc hash-proc hash2-proc) or two procedures (list equal-mode-proc hash-mode-proc):
The three-procedure case corresponds to the procedures of gen:equal-hash:
equal-proc : (-> any/c any/c (-> any/c any/c boolean?) any/c)
hash-proc : (-> any/c (-> any/c exact-integer?) exact-integer?)
hash2-proc : (-> any/c (-> any/c exact-integer?) exact-integer?)
The two-procedure case corresponds to the procedures of gen:equal-mode-hash:
When implementing these methods, follow the guidelines in Honest Custom Equality. In particular, these methods should only access mutable data if the struct is declared mutable or the mode is true.
Changed in version 8.5.0.3 of package base: Added support for two-procedure values to customize equal-always?.
4.1.4 Honest Custom Equality
Since the equal-proc or equal-mode-proc is used for more than just equal?, instances of them should follow certain guidelines to make sure that they work correctly for equal-always?, chaperone-of?, and impersonator-of?.
Due to the differences between these operations, avoid calling equal? within them. Instead, use the third argument to “recur” on the pieces, which allows equal?/recur to work properly, lets the other operations behave in their own distinct ways on the pieces, and enables some cycle detection.
|
|
The operations equal? and equal-always? should be symmetric, so equal-proc instances should not change their answer when the arguments swap:
|
|
However, the operations chaperone-of? and impersonator-of? are not symmetric, so when calling the third argument to “recur” on pieces, pass the pieces in the same order they came in:
|
|
Mutable structs will only use the custom equality for equal? and impersonator-of?, so that equal-always? and chaperone-of? don’t change on mutation. Structs that represent mutable data should either be declared mutable, or use equal-mode-proc from gen:equal-mode+hash instead of equal-proc from gen:equal+hash, and only access mutable data when the mode is true:
|
|
|
|