In Common Lisp, replacing "get-setf-method" with "get-setf-expansion" is not always working

Question

The sortf function from Chapter 12 of the book "On Lisp" originally used get-setf-method, which is currently unavailable in a recent SBCL by default. Therefore, get-setf-expansion was used instead of get-setf-method in a modified version of sortf. However, the modified sortf does not work, because get-setf-expansion returns the same symbol with the fixed name #:NEW1 rather than a symbol created from gensym.

The problem of the same symbol name results in different macro expansions between two versions of sortf. For example, when executing

(macroexpand-1 '(sortf > x (aref ar (incf i)) (car lst)))

the output from the original sortf is

;; Extracted from "On Lisp":
;; https://github.com/showgood/onlisp/blob/fc91b63b1854afb2596d4f27b3e87202214c01c8/onlisp.org?plain=1#L9046

(let* ((#:g1 x)
       (#:g4 ar)
       (#:g3 (incf i))
       (#:g2 (aref #:g4 #:g3))
       (#:g6 lst)
       (#:g5 (car #:g6)))
  (unless (> #:g1 #:g2)
    (rotatef #:g1 #:g2))
  (unless (> #:g1 #:g5)
    (rotatef #:g1 #:g5))
  (unless (> #:g2 #:g5)
    (rotatef #:g2 #:g5))
  (setq x #:g1)
  (system:set-aref #:g2 #:g4 #:g3)
  (system:set-car #:g6 #:g5))

and the output from the modified sortf is

;; Executed with "SBCL 2.1.11.debian"

(LET* ((#:NEW1 X)
       (#:AR636 AR)
       (#:G637 (INCF I))
       (#:NEW1 (AREF #:AR636 #:G637))
       (#:LST638 LST)
       (#:NEW1 (CAR #:LST638)))
  (UNLESS (> #:NEW1 #:NEW1) (ROTATEF #:NEW1 #:NEW1))
  (UNLESS (> #:NEW1 #:NEW1) (ROTATEF #:NEW1 #:NEW1))
  (UNLESS (> #:NEW1 #:NEW1) (ROTATEF #:NEW1 #:NEW1))
  (SETQ X #:NEW1)
  (FUNCALL #'(SETF AREF) #:NEW1 #:AR636 #:G637)
  (SB-KERNEL:%RPLACA #:LST638 #:NEW1))

where the symbol #:NEW1 is repeatedly used.

I'm wondering if there is a better replacement for get-setf-method than get-setf-expansion.

UPDATE

I have tested the sortf that uses get-setf-expansion again, and it works well in most cases, although the result of macroexpand-1 is weird. One exceptional case is

(let ((i 0)
      (x 10)
      (ar #(10 20 30))
      (lst  '(25 30 35))         ; This raise error
      ;; (lst  (list 25 30 35))  ; This make code works
      )
  (sortf > x (aref ar (incf i)) (car lst))
  (list x (aref ar 1) (car lst)))

whose output is (25 20 25) rather than (25 20 10). I think it's not a problem from get-setf-expansion.

Summary of Replies

The #:NEW1 symbols that are generated by get-setf-expansion are different uninterned symbols. Therefore, get-setf-method can be replaced with get-setf-expansion in a macro such as sortf, and the modified macro will work in the same way. To distinguish uninterned symbols of the same name, cl:*print-circle* can be set as t.

In addition, modifying a literal, such as '(25 30 35), with an operation like setf can results in weird output. In particular, the result of modifying a literal can vary depending on the level of compiler optimization of SBCL. In another compiler, modifying a literal can raise compile or runtime error.

Answer 1 · 2025-10-30 08:53:22Z

The following function replaces the 3rd output from get-setf-expansion with newly generated gensym symbols:

(defun get-unique-setf-expansion (place &optional environment)
  "Calls GET-SETF-EXPANSION but replaces all store-vars with fresh gensyms."

  (multiple-value-bind (vars vals store-vars writer-form reader-form)
      (get-setf-expansion place environment)
    
    ;; 1. Generate new unique symbols for the store-vars list
    (let ((new-store-vars (loop for var in store-vars collect (gensym (concatenate 'string (symbol-name var) "-")))))
      
      ;; 2. Create an association list to map the old symbols to the new ones
      (let ((substitution-list (mapcar #'list* store-vars new-store-vars)))
        
        ;; 3. Substitute the old symbols with the new ones in the writer form
        ;;    (Reader-form and other lists usually don't contain store-vars)
        (let ((new-writer-form (sublis substitution-list writer-form)))
          
          ;; Return the new expansion
          (values vars vals new-store-vars new-writer-form reader-form))))))

When get-unique-setf-expansion substitutes for get-unique-method or get-unique-expansion, the expanded expression of the new sortf is similar with that of the original sortf:

;; Executed with "SBCL 2.1.11.debian"

(LET* ((#:NEW1-714 X)
       (#:AR715 AR)
       (#:G716 (INCF I))
       (#:NEW1-717 (AREF #:AR715 #:G716))
       (#:LST718 LST)
       (#:NEW1-719 (CAR #:LST718)))
  (UNLESS (> #:NEW1-714 #:NEW1-717) (ROTATEF #:NEW1-714 #:NEW1-717))
  (UNLESS (> #:NEW1-714 #:NEW1-719) (ROTATEF #:NEW1-714 #:NEW1-719))
  (UNLESS (> #:NEW1-717 #:NEW1-719) (ROTATEF #:NEW1-717 #:NEW1-719))
  (SETQ X #:NEW1-714)
  (FUNCALL #'(SETF AREF) #:NEW1-717 #:AR715 #:G716)
  (SB-KERNEL:%RPLACA #:LST718 #:NEW1-719))

Answer 2 · 2025-10-30 09:55:40Z

Shawn

• Oct 30 at 9:55

It looks like get-setf-method was removed from the CL spec in 1991 (in Issue SETF-METHOD-VS-SETF-METHOD). So it's no wonder SBCL doesn't have it.

(I feel like this should have been a regular question, not the new advice thing)

Answer 3 · 2025-10-30 10:08:38Z

dhnam

• Oct 30 at 10:08

I wrongly chose the type of question as "General Advice/Other", as @Shawn commented. I tried to change the type, but it cannot be changed.

Answer 4 · 2025-10-30 10:15:31Z

Shawn

• Oct 30 at 10:15

Also, I bet the first one is working because the two instances of #:NEW1 are different uninterned symbols that just happen to print the same, but are treated distinctly when the code is executed. (eq '#:NEW1 '#:NEW1) is nil, after all.

Answer 5 · 2025-10-30 10:29:15Z

dhnam

• Oct 30 at 10:29

@Shawn Thank you for explaining why the code works with uninterned symbols, which I was not familiar with.

Answer 6 · 2025-10-30 10:52:14Z

dhnam

• Oct 30 at 10:52

Another reason that causes my misunderstanding was the use of a literal list. In the updated part of the question, the literal list '(25 30 35) is used.

Similarly, when I execute the following code in SBCL

(let ((lst '(0 1 2)))
  (setf (car lst) 10)
  (values
   (car lst)
   lst))

The output is

0
(10 1 2)

Answer 7 · 2025-10-30 11:16:26Z

Shawn

• Oct 30 at 11:16

Modifying a literal list (or other object) is undefined, so weird stuff can happen if you try, yeah.

Answer 8 · 2025-10-30 11:45:18Z

dhnam

• modified Oct 30 at 21:49

I guess that the result of modifying a literal depends on a compiler, as @Shawn commented that modifying a literal is undefined. In SBCL, if I executes (declare (optimize (debug 3))) in advance, the output is (values 10 (10 1 2)). So, high optimization made the inconsistent output (values 0 (10 1 2)). In addition, I should've carefully check the warning message about a literal by SBCL.

Answer 9 · 2025-10-30 16:03:22Z

The literal list issue is a simple compiler optimization. It assumes that the list won't be modified, so (car lst) is replaced with the original car of the list.

It's a little surprising that the result wasn't 0 (0 1 2) since it could have optimized away the reference to lst itself. But since local variable references are cheap (maybe no more expensive than referencing the literal itself), it might not bother with this.

Answer 10 · 2025-10-30 21:54:35Z

dhnam

• Oct 30 at 21:54

@Barmar, Thank you for the explanation about compiler optimization. As you said, calling the car function sounds more expensive than referencing a literal or a variable.

Answer 11 · 2025-10-31 01:07:59Z

The representation #:NEW denotes an uninterned symbol.

Just because you see the same #:NEW in two different instances of a macro expansion doesn't meant that there is any issue.

First of all, we don't know whether it is actually the same symbol, because, look:

[1]> (make-symbol "ABC")
#:ABC
[2]> (make-symbol "ABC")
#:ABC
[3]> (eq '#:ABC '#:ABC)
NIL

The #: notation denotes a symbol with no home package, which is usually an uninterned symbol. (I have to say usually because Ron Garrett may be reading).

Secondly, even if a macro re-uses the same uninterned symbol for multiple expansions, that does not mean there is a hygiene problem. If only that macro knows about that symbol (it cannot clash with anything outside the macro) and if nested expansions of the macro do not clash with each other because of that symbol, then it is cool. For instance we can write a repeat macro like this:

(defmacro repeat (count-form &rest body-forms)
  `(let ((#1=#:count ,count-form))
     (do ((#2=#:dummy 0 (1+ #2#)))
         ((>= #2# #1#))
       ,@body-forms)))

This works fine nested:

[2]> (let ((count 0)) (repeat 3 (repeat 4 (print (incf count)))))
1
2
3
4
5
6
7
8
9
10
11
12
NIL

The expansion appears to repeat symbols:

[3]> (macroexpand '(repeat x a b c))
(LET ((#:COUNT X)) (DO ((#:DUMMY 0 (1+ #:DUMMY))) ((>= #:DUMMY #:COUNT)) A B C)) ;
T
[4]> (macroexpand '(repeat y e f g))
(LET ((#:COUNT Y)) (DO ((#:DUMMY 0 (1+ #:DUMMY))) ((>= #:DUMMY #:COUNT)) E F G)) ;
T

And in fact they are repeated. The #:COUNT and #:DUMMY in both expansions are the same symbols. But, so what? The symbols are not known outside of the macro because they were created when the macro was read. The read notation #: produces a fresh symbol every time (as the eq expression above shows).

The macro uses these symbols in such a way that they are lexically scoped, so when the macro is nested with itself, the reuse makes no difference.

More Lisp macros should arguably be using symbols like this and not wasting resources allocating new gensyms on every call.

I didn't even bother using local variables in the macro to refer to these symbols; I used the circle notation to refer to them in multiple places.

#1=#:COUNT means the same thing as #:COUNT but #:COUNT is associated with the integer label 1. Later in the same expression, wherever we write #1#, it will reproduce the #:COUNT object.

Instead of the circle notation, we can bind those uninterned symbols to local variables and insert the local variables, exactly like we do with gensyms:

(defmacro repeat (count-form &rest body-forms)
   (let ((count '#:count) ;; usually we use (gensym)
         (dummy '#:dummy))
  `(let ((,count ,count-form))
     (do ((,dummy 0 (1+ ,dummy)))
         ((>= ,dummy ,count))
       ,@body-forms))))

Needless to say, if we write some other macro which also uses #:count, that #:count will be a different symbol. It's only the same symbol across multiple expansions of repeat, and we know that not to be a problem.

There is a risk to this techique and it occurs when a macro refers to itself in its own expansion. Suppose repeat had some reason to use (repeat ...) inside the expansion, and suppose that repeat inserted some of the variables into that syntax: (repeat ... ,dummy). Now it is in trouble because the dummy symbol is lexically intercepted inside the inner repeat. For that case, we need gensym. gensyms ensure correctness of even nestings that are instigated by the macro itself, rather than external invocation nestings.

Answer 12 · 2025-10-31 03:11:38Z

@Kaz, Thank you for introducing the #: notation and the circle notation. As you said, gensym can be omitted in many macros, without creating many different uninterned symbols that share the same names.

In contrast, in the sortf macro, different #:NEW1 symbols should be used, and get-setf-expansion already makes a new uninterned #:NEW1 whenever it's used. In sortf, the number of #:NEW1 symbols is proportional to the number of arguments that are passed to sortf. Therefore, sortf cannot use the fixed number of symbols that can be generated by the #: notation.

Answer 13 · 2025-10-31 10:08:28Z

Rainer Joswig

• Oct 31 at 10:08

use cl:*print-circle* set to t to see identity (-> which printed objects are the same) of objects in the list.

Answer 14 · 2025-10-31 10:16:31Z

Rainer Joswig

• Oct 31 at 10:16

You are using an 4 year old version of SBCL. The current version of SBCL is 2.5.10.

Answer 15 · 2025-10-31 10:27:47Z

In addition, modifying a literal, such as '(25 30 35), with an operation like setf can be problematic. In particular, the result of modifying a literal can vary depending on the level of compiler optimization.

There are possibly several reasons why one should not modify a literal constant. Effects of compiler optimization is only one.

(defun foo ()
  (let ((a '(1 2 3)))
    (setf (first a) 10)))

In above code one tries to modify a literal list, which is a part of the code. There is no compiler optimization involved -> one is modifying code.

This can have unwanted observable effects or may even lead to an error (runtime or compile-time). Imagine a Common Lisp compiler which places code into non-writeable memory -> then the runtime (or even the CPU) might detect that attempt to change code. This gets more relevant on architectures, where code is protected against runtime changes. In addition a compiler might detect at compile time the part which attempts to modify code and will report that.

Answer 16 · 2025-10-31 12:09:36Z

@rainer-joswig, Thank you for the tip of cl:*print-circle* and the explanation about the problem of modifying a literal. With cl:*print-circle* that is set as t, macroexpand-1 explicitly shows differences between uninterned symbols of the same name. In addition, modifying a literal can be a problem even if compiler optimization is not applied, as a literal is defined as a fixed value which should not be updated while running a program.

Collectives™ on Stack Overflow

In Common Lisp, replacing "get-setf-method" with "get-setf-expansion" is not always working

16 Replies 16

Your Reply

Collectives™ on Stack Overflow

In Common Lisp, replacing "get-setf-method" with "get-setf-expansion" is not always working

16 Replies 16

Your Reply

Sign up or log in

Post as a guest