What is the difference between integrity constraints and referential integrity?
Good question. Referential integrity, also known as relational integrity, means that if a table contains a foreign key column, then every value in that column (except NULL, if it is allowed) will be found in the primary key of the table that it is related to, or references (the syntax for declaring the foreign key uses the keyword REFERENCES).
For example, let's say you have a table of users and a table of images, where each image belongs to a user:
users id name 23 Tom 25 Dick 31 Harry images id user path 187 23 /images/emo.jpg 188 31 /images/kitten.jpg 189 24 /images/manamana.jpg 190 25 /images/phlogiston.jpg
Can you spot the referential integrity problem?
Sometimes it's difficult to convince experienced programmers that they should declare their tables with foreign keys. "Why couldn't we," they ask, "just do a lookup on the user to ensure it exists, and then just insert the image?" The answer to that question, of course, is that (a) not all updates to the database are going to happen through the application (updates can also occur via bulk loads, SQL windows, command lines, etc.), and (b) why do programmers feel they need to reinvent the wheel? The database will look it up for you if the foreign key has been declared.
Referential integrity is only one type of database integrity, although arguably it might be the most important. Integrity constraints are any constraints that ensure integrity in databases. The various types of integrity constraints are:
- NOT NULL
- ensures that a column cannot have a NULL value.
- UNIQUE KEY
- ensures that each value in the key occurs at most once (note that unique keys can consist of more than one column).
- PRIMARY KEY
- same as unique key, except a primary key cannot be NULL. Primary key integrity is also called Entity Integrity.
- FOREIGN KEY
- values (except NULL, if allowed) must be found in referenced primary or unique key.
- provides a condition which is enforced on every row of the table. For example, CHECK (age >= 18) ensures that you cannot insert or update a row with an age less than 18. Check constraints are often used for Domain Integrity, to ensure that a column will have only certain specified values.
- a condition enforced across tables (and therefore declared separately from CREATE TABLE statements). Few DBMS systems support assertions.
This was first published in August 2007