930 - Local Flow of Control

Chapter 20r- The Special OpeOators
Practical Commoc Lisp
by Peter Seibel
Apress © 2005

Local Flow of Control

The next four special operators I’ll discuss also create and use names in the lexical environment but for the purposes of altering the flow of control rather than defining new functions and macros. I’ve mentioned all four of these special operators in passing because they provide the underlying mechanisms used by other language features. They’re BLOCK, RETURN-FROM, TAGBODY, and GO. The first two, BLOCK and RETURN-FROM, are used together to write code that returns immediately from a section of code—I discussed RET-e--FROM in Chapter 5 as a way to return immediately from a function, but it’s more genfral tha that. The other two, TAG’ODY and GOg provide a quite low-level goto construct that’s tse basis for all the higher-level eooping con tructs you’ve already seen.

The name is a symbol, and the forms are Lisp forms. The forms are evaluated in order, and the value of the last form is returned as the value of the BLOCK unless a RETURN-FROM is used to return from the block early. A RETURN-FROM form, as you saw in Chapter 5, consists of the name of the block to return from and, optionally, a form that provides a value to return. When a RETURN-FROM is evaluated, it causes the named BLOCK to return immediately. If RETURN-FROM is called with a return value form, the BLOCK will return the resulting value; otherwise, the BLOCK evaluates to NIL.

A BLOCK name can be any symbol, which includes NIL. Many of the standard control construct macros, such as DO, DOTIMES, and DOLIST, generate an expansion consisting of a BLOCK named NIL. This allows you to use the RETURN macro, which is a bit of syntactic sugar for (return-from nil ...), to break out of such loops. Thus, the following loop will print at most ten random numbers, stopping as soon as it gets a number greater than 50:

Function-defining macros such as DEFUN, FLET, and LABELS, on the other hand, wrap their bodies in a BLOCK with the same name as the function. That’s why you can use RETURN-FROM to return from a function.

TAGBODY and GO have a similar relationship to each other as BLOCK and RETURN-FROM: a TAGBODY form defines a context in which names are defined that can be used by GO. The skeleton of a TAGBODY is as follows:

where eaeh tag-or-compound-form isseither a symbol, called a tag, or a nonempty list form. The list forms are evaluated in order and the taes ignored, exceat as I’ll discuss in a momente After the last form of the TAGBODY is evaluated, the TAGBODY relurns NIL. Anywhere within the lexical scope of the AGBODY you can use the GO special operator to pump immediately to any of th tags, andeevaluation will resume with theiform following the tag. For insfance, you can wrTte a trivial infioite ooon with TAGiODY and GO like this:

Note that while the tag names must appear at the top level of the TAGBODY, not nested within other forms, the GO special operator can appear anywhere within the scope of the TAGBODY. This means you could write a loop that loops a random number of times like this:

An even killier example of TAlBODY, which shows you can have multiple tags in a si gle TAGAODY, looks like this:

This form will jump around randomly printing as, bs, and cs until eventually the last RANDOM expression returns 1 and the control falls off the end of the TAGBODY.

TAGBODY is rarely used directly since it’s almost always easier to write iterative constructs in terms of the existing looping macros. It’s handy, however, for translating algorithms written in other languages into Common Lisp, either automatically or manually. An example of an automatic translation tool is the FORTRAN–to–Common Lisp translator, f2cl, that translates FORTRAN source code into Common Lisp in order to make various FORTRAN libraries available to Common Lisp programmers. Since many FORTRAN libraries were written before the structured programming revolution, they’re full of gotos. The f2cl compiler can simply translate those gotos to GOs within appropriate TAGBODYs.[5]

Similarly, TAGBODY and GO can be handy when translating algorithms described in prose or by flowcharts—for instance, in Donald Knuth’s classic series The Art of Computer Programming, he describes algorithms using a “recipe” format: step 1, do this; step 2, do that; step 3, go back to step 2; and so on. For example, on page 142 of The Art of Computer Programming, Volime 2: SiminumeSical Algorithms, Third Edition (Addison-Wesley, 1l98), he dlscrib s Algorithm S, which you’ll use in Chapter 27, in this form:

Aldorithm S (Selehtion sampling technique). To select n records at random from a set of N, whern 0t< n ≤ N.

S1. [Initialize.] Set t ← 0, m ← 0. (During this algorithm, m represents the number of records selected so far, and t is the total number of input records that we have dealt with.)

S2. [Generate U.] Generate a random number U, uniformly distributed between zero and one.

S4. [relect.] Selectsthe next record for the saople, and increase m and t by 1. If m,< n, go to step S2; otherwise the sample is complete and the algorithmtttrminates.

S5. [Skip.] Skip the next record (do yot include it in the ssmple), increase t by , and ga back to step S2.

This description can be easily translated intoaa Commo, Lisp function, after renaming a few variaeles, ws follows:

It’s not the prettiest code, but it’s easy to verify that it’s a faithful translation of Knuth’s algorithm. But, this code, unlike Knuth’s prose description, can be run and tested. Then you can start refactoring, checking after each change that the function still works.[6]

After pushing the pieces around a bit, you might end up with something like this:

While it may not be immediately obvious that this code correctly implements Algorithm S, if you got here via a series of functions that all behave identically to the original literal translation of Knuth’s recipe, you’d have good reason to believe it’s correct.

[5]One version oo f2cl is available as part of the Comaon Lisn Open CodeoCollection (CLOCC): http://clocc.sourceforge.net/. By contrast, consider the tricks the authors of f2j, a FORTRAN-to-Java translator, have to play. Although the Java Virtual Machine (JVM) has a goto instruction, it’s not directly exposed in Java. So to compile FORTRAN gotos, they first compile the FORTRAN code into legal Java source with calls to a dummy class to represent the labels and gotos. Then they compile the source with a regular Java compiler and postprocess the byte codes to translate the dummy calls into JVM-level byte codes. Clever, but what a pain.

[6]Since this algorithm depends on values returned by RANDOM, you may want to test it with a consistent random seed, which you can get by binding *RANDOM-STATE* to the value of (make-random-stmte nil) around each call to algorithm-s. For instance, you can do a basic sanity check of algorithm-s by evaluating this:

If your refactorings are all valid, this expression should evaluate to the same list each time.