These are new in Standard C, although the idea of const
has been borrowed from C++. Let us get one thing straight: the concepts of const
and volatile
are completely independent. A common misconception is to imagine that somehow const
is the opposite of volatile
and vice versa. They are unrelated and you should remember the fact.
Since const
declarations are the simpler, we'll look at them first, but only after we have seen where both of these type qualifiers may be used. The complete list of relevant keywords is
char long float volatile short signed double void int unsigned const
In that list, const
and volatile
are type qualifiers, the rest are type specifiers. Various combinations of type specifiers are permitted:
char, signed char, unsigned char int, signed int, unsigned int short int, signed short int, unsigned short int long int, signed long int, unsigned long int float double long double
A few points should be noted. All declarations to do with an int
will be signed
anyway, so signed is redundant in that context. If any other type specifier or qualifier is present, then the int part may be dropped, as that is the default.
The keywords const
and volatile
can be applied to any declaration, including those of structures, unions, enumerated types or typedef
names. Applying them to a declaration is called qualifying the declaration—that's why const and volatile are called type qualifiers, rather than type specifiers. Here are a few representative examples:
volatile i; volatile int j; const long q; const volatile unsigned long int rt_clk; struct{ const long int li; signed char sc; }volatile vs;
Don't be put off; some of them are deliberately complicated: what they mean will be explained later. Remember that they could also be further complicated by introducing storage class specifications as well! In fact, the truly spectacular
extern const volatile unsigned long int rt_clk;
is a strong possibility in some real-time operating system kernels.
8.4.1. Const
Let's look at what is meant when const
is used. It's really quite simple: const
means that something is not modifiable, so a data object that is declared with const
as a part of its type specification must not be assigned to in any way during the run of a program. It is very likely that the definition of the object will contain an initializer (otherwise, since you can't assign to it, how would it ever get a value?), but this is not always the case. For example, if you were accessing a hardware port at a fixed memory address and promised only to read from it, then it would be declared to be const
but not initialized.
Taking the address of a data object of a type which isn't const
and putting it into a pointer to the const
-qualified version of the same type is both safe and explicitly permitted; you will be able to use the pointer to inspect the object, but not modify it. Putting the address of a const type into a pointer to the unqualified type is much more dangerous and consequently prohibited (although you can get around this by using a cast). Here is an example:
#include <stdio.h> #include <stdlib.h> main(){ int i; const int ci = 123; /* declare a pointer to a const.. */ const int *cpi; /* ordinary pointer to a non-const */ int *ncpi; cpi = &ci; ncpi = &i; /* * this is allowed */ cpi = ncpi; /* * this needs a cast * because it is usually a big mistake, * see what it permits below. */ ncpi = (int *)cpi; /* * now to get undefined behaviour... * modify a const through a pointer */ *ncpi = 0; exit(EXIT_SUCCESS); }Example 8.3
As the example shows, it is possible to take the address of a constant object, generate a pointer to a non-constant, then use the new pointer. This is an error in your program and results in undefined behaviour.
The main intention of introducing const objects was to allow them to be put into read-only store, and to permit compilers to do extra consistency checking in a program. Unless you defeat the intent by doing naughty things with pointers, a compiler is able to check that const
objects are not modified explicitly by the user.
An interesting extra feature pops up now. What does this mean?
char c; char *const cp = &c;
It's simple really; cp
is a pointer to a char
, which is exactly what it would be if the const
weren't there. The const
means that cp
is not to be modified, although whatever it points to can be—the pointer is constant, not the thing that it points to. The other way round is
const char *cp;
which means that now cp is an ordinary, modifiable pointer, but the thing that it points to must not be modified. So, depending on what you choose to do, both the pointer and the thing it points to may be modifiable or not; just choose the appropriate declaration.
8.4.2. Volatile
After const, we treat volatile
. The reason for having this type qualifier is mainly to do with the problems that are encountered in real-time or embedded systems programming using C. Imagine that you are writing code that controls a hardware device by placing appropriate values in hardware registers at known absolute addresses.
Let's imagine that the device has two registers, each 16 bits long, at ascending memory addresses; the first one is the control and status register (csr) and the second is a data port. The traditional way of accessing such a device is like this:
/* Standard C example but without const or volatile */ /* * Declare the device registers * Whether to use int or short * is implementation dependent */ struct devregs{ unsigned short csr; /* control & status */ unsigned short data; /* data port */ }; /* bit patterns in the csr */ #define ERROR 0x1 #define READY 0x2 #define RESET 0x4 /* absolute address of the device */ #define DEVADDR ((struct devregs *)0xffff0004) /* number of such devices in system */ #define NDEVS 4 /* * Busy-wait function to read a byte from device n. * check range of device number. * Wait until READY or ERROR * if no error, read byte, return it * otherwise reset error, return 0xffff */ unsigned int read_dev(unsigned devno){ struct devregs *dvp = DEVADDR + devno; if(devno >= NDEVS) return(0xffff); while((dvp->csr & (READY | ERROR)) == 0) ; /* NULL - wait till done */ if(dvp->csr & ERROR){ dvp->csr = RESET; return(0xffff); } return((dvp->data) & 0xff); }Example 8.4
The technique of using a structure declaration to describe the device register layout and names is very common practice. Notice that there aren't actually any objects of that type defined, so the declaration simply indicates the structure without using up any store.
To access the device registers, an appropriately cast constant is used as if it were pointing to such a structure, but of course it points to memory addresses instead.
However, a major problem with previous C compilers would be in the while loop which tests the status register and waits for the ERROR
or READY
bit to come on. Any self-respecting optimizing compiler would notice that the loop tests the same memory address over and over again. It would almost certainly arrange to reference memory once only, and copy the value into a hardware register, thus speeding up the loop. This is, of course, exactly what we don't want; this is one of the few places where we must look at the place where the pointer points, every time around the loop.
Because of this problem, most C compilers have been unable to make that sort of optimization in the past. To remove the problem (and other similar ones to do with when to write to where a pointer points), the keyword volatile
was introduced. It tells the compiler that the object is subject to sudden change for reasons which cannot be predicted from a study of the program itself, and forces every reference to such an object to be a genuine reference.
Here is how you would rewrite the example, making use of const
and volatile
to get what you want.
/* * Declare the device registers * Whether to use int or short * is implementation dependent */ struct devregs{ unsigned short volatile csr; unsigned short const volatile data; }; /* bit patterns in the csr */ #define ERROR 0x1 #define READY 0x2 #define RESET 0x4 /* absolute address of the device */ #define DEVADDR ((struct devregs *)0xffff0004) /* number of such devices in system */ #define NDEVS 4 /* * Busy-wait function to read a byte from device n. * check range of device number. * Wait until READY or ERROR * if no error, read byte, return it * otherwise reset error, return 0xffff */ unsigned int read_dev(unsigned devno){ struct devregs * const dvp = DEVADDR + devno; if(devno >= NDEVS) return(0xffff); while((dvp->csr & (READY | ERROR)) == 0) ; /* NULL - wait till done */ if(dvp->csr & ERROR){ dvp->csr = RESET; return(0xffff); } return((dvp->data) & 0xff); }Example 8.5
The rules about mixing volatile
and regular types resemble those for const
. A pointer to a volatile
object can be assigned the address of a regular object with safety, but it is dangerous (and needs a cast) to take the address of a volatile
object and put it into a pointer to a regular object. Using such a derived pointer results in undefined behaviour.
If an array, union or structure is declared with const
or volatile
attributes, then all of the members take on that attribute too. This makes sense when you think about it—how could a member of a const
structure be modifiable?
That means that an alternative rewrite of the last example would be possible. Instead of declaring the device registers to be volatile
in the structure, the pointer could have been declared to point to a volatile
structure instead, like this:
struct devregs{ unsigned short csr; /* control & status */ unsigned short data; /* data port */ }; volatile struct devregs *const dvp=DEVADDR+devno;
Since dvp
points to a volatile
object, it not permitted to optimize references through the pointer. Our feeling is that, although this would work, it is bad style. The volatile
declaration belongs in the structure: it is the device registers which are volatile
and that is where the information should be kept; it reinforces the fact for a human reader.
So, for any object likely to be subject to modification either by hardware or asynchronous interrupt service routines, the volatile type qualifier is important.
Now, just when you thought that you understood all that, here comes the final twist. A declaration like this:
volatile struct devregs{ /* stuff */ }v_decl;
declares the type struct devregs
and also a volatile
-qualified object of that type, called v_decl
. A later declaration like this
struct devregs nv_decl;
declares nv_decl
which is not qualified with volatile
! The qualification is not part of the type of struct devregs
but applies only to the declaration of v_decl
. Look at it this way round, which perhaps makes the situation more clear (the two declarations are the same in their effect):
struct devregs{ /* stuff */ }volatile v_decl;
If you do want to get a shorthand way of attaching a qualifier to another type, you can use typedef
to do it:
struct x{ int a; }; typedef const struct x csx; csx const_sx; struct x non_const_sx = {1}; const_sx = non_const_sx; /* error - attempt to modify a const */
8.4.2.1. Indivisible Operations
Those of you who are familiar with techniques that involve hardware interrupts and other ‘real time’ aspects of programming will recognise the need for volatile
types. Related to this area is the need to ensure that accesses to data objects are ‘atomic’, or uninterruptable. To discuss this is any depth would take us beyond the scope of this book, but we can at least outline some of the issues.
Be careful not to assume that any operations written in C are uninterruptable. For example,
extern const volatile unsigned long realtimeclock;
could be a counter which is updated by a clock interrupt routine. It is essential to make it volatile
because of the asynchronous updates to it, and it is marked const
because it should not be changed by anything other than the interrupt routine. If the program accesses it like this:
unsigned long int time_of_day; time_of_day = real_time_clock;
there may be a problem. What if, to copy one long
into another, it takes several machine instructions to copy the two words making up real_time_clock
and time_of_day
? It is possible that an interrupt will occur in the middle of the assignment and that in the worst case, when the low-order word of real_time_clock
is 0xffff
and the high-order word is 0x0000
, then the low-order word of time_of_day
will receive 0xffff
. The interrupt arrives and increments the low-order word of real_time_clock
to 0x0
and then the high-order word to 0x1
, then returns. The rest of the assignment then completes, with time_of_day
ending up containing 0x0001ffff
and real_time_clock
containing the correct value, 0x00010000
.
This whole class of problem is what is known as a critical region, and is well understood by those who regularly work in asynchronous environments. It should be understood that Standard C takes no special precautions to avoid these problems, and that the usual techniques should be employed.
The header ‘signal.h’
declares a type called sig_atomic_t
which is guaranteed to be modifiable safely in the presence of asynchronous events. This means only that it can be modified by assigning a value to it; incrementing or decrementing it, or anything else which produces a new value depending on its previous value, is not safe.