Computer Security in the Real World

 

 

Butler W. Lampson[1]

Microsoft

 

Note: This web page was converted automatically from a Word original. There may be problems with the formatting and the pictures. To see the intended form, follow one of the links below.

Citation. Proc. Annual Computer Security Applications Conference, 2000.

Links: Abstract, Postscript, Acrobat, Web page, Word.

Email: blampson@microsoft.com. This paper is at http://www.research.microsoft.com/lampson.

 

 


Abstract

After thirty years of work on computer security, why are almost all the systems in service today extremely vulnerable to attack? The main reason is that security is expensive to set up and a nuisance to run, so people judge from experience how little of it they can get away with. Since there’s been little damage, people decide that they don’t need much security. In addition, setting it up is so complicated that it’s hardly ever done right. While we await a catastrophe, simpler setup is the most important step toward better security.

In a distributed system with no central management like the Internet, security requires a clear story about who is trusted for each step in establishing it, and why. The basic tool for telling this story is the “speaks for” relation between principals that describes how authority is delegated, that is, who trusts whom. The idea is simple, and it explains what’s going on in any system I know. The many different ways of encoding this relation often make it hard to see the underlying order.

 

 

1           Introduction

People have been working on computer system security for at least 30 years. During this time there have been many intellectual successes. Notable among them are the subject/object access matrix model [11], access control lists [17], multilevel security using information flow [6, 13] and the star-property [3], public key cryptography [14], and cryptographic protocols [1]. In spite of these successes, it seems fair to say that in an absolute sense, the security of the hundreds of millions of deployed computer systems is terrible: a determined and competent attacker could destroy most of the information on almost any of these systems, or steal it from any system that is connected to a network. Even worse, the attacker could do this to millions of systems at once.

On the other hand, not much harm is actually being done by attacks on these insecure systems. Once or twice a year an email virus such as “I love you” infects a million or two machines, and newspapers print extravagant estimates of the damage it does, but these are minor annoyances. There is no accurate data about the cost of failures in computer security. On the one hand, most of them are never made public for fear of embarrassment. On the other, when a public incident does occur, the security experts and vendors of antivirus software that talk to the media have every incentive to greatly exaggerate its costs. But money talks. Many vendors of security have learned to their regret that although people complain about inadequate security, they won’t spend much money, sacrifice many features, or put up with much inconvenience in order to improve it. This strongly suggests that bad security is not really costing them much.

Of course, computer security is not just about computer systems. Like any security, it is only as strong as its weakest link, and the links include the people and the physical security of the system. Very often the easiest way to break into a system is to bribe an insider. This short paper, however, is limited to computer systems.

What do we want from secure computer systems? Here is a reasonable goal:

Computers are as secure as real world systems, and people believe it.

Most real world systems are not very secure by the absolute standard suggested above. It’s easy to break into someone’s house. In fact, in many places people don’t even bother to lock their houses, although in Manhattan they may use two or three locks on the front door. It’s fairly easy to steal something from a store. You need very little technology to forge a credit card, and it’s quite safe to use a forged card at least a few times.

Why do people live with such poor security in real world systems? The reason is that security is not about perfect defenses against determined attackers. Instead, it’s about

value,

locks, and

punishment.

The bad guy balances the value of what he gains against the risk of punishment, which is the cost of punishment times the probability of getting punished. The main thing that makes real world systems sufficiently secure is that bad guys who do break in are caught and punished often enough to make a life of crime unattractive. The purpose of locks is not to provide absolute security, but to prevent casual intrusion by raising the threshold for a break-in.

Well, what’s wrong with perfect defenses? The answer is simple: they cost too much. There is a good way to protect personal belongings against determined attackers: put them in a safe deposit box. After 100 years of experience, banks have learned how to use steel and concrete, time locks, alarms, and multiple keys to make these boxes quite secure. But they are both expensive and inconvenient. As a result, people use them only for things that are seldom needed and either expensive or hard to replace.

Practical security balances the cost of protection and the risk of loss, which is the cost of recovering from a loss times its probability. Usually the probability is fairly small (because the risk of punishment is high enough), and therefore the risk of loss is also small. When the risk is less than the cost of recovering, it’s better to accept it as a cost of doing business (or a cost of daily living) than to pay for better security. People and credit card companies make these decisions every day.

With computers, on the other hand, security is only a matter of software, which is cheap to manufacture, never wears out, and can’t be attacked with drills or explosives. This makes it easy to drift into thinking that computer security can be perfect, or nearly so. The fact that work on computer security has been dominated by the needs of national security has made this problem worse. In this context the stakes are much higher and there are no police or courts available to punish attackers, so it’s more important not to make mistakes. Furthermore, computer security has been regarded as an offshoot of communication security, which is based on cryptography. Since cryptography can be nearly perfect, it’s natural to think that computer security can be as well.

What’s wrong with this reasoning? It ignores two critical facts:

·                             Secure systems are complicated, hence imperfect.

·                             Security gets in the way of other things you want.

Software is complicated, and it’s essentially impossible to make it perfect. Even worse, security has to be set up by establishing user accounts and passwords, access control lists on resources, and trust relationships between organizations. In a world of legacy hardware and software, networked computers, mobile code, and constantly changing relationships between organizations, setup gets complicated. And it’s easy to think up scenarios in which you want precise control over who can do what. Features put in to address such scenarios make setup even more complicated.

Security gets in the way of other things you want. For software developers, security interferes with features and with time to market. This leads to such things as a widely used protocol for secure TCP/IP connections that use the same key for every session as long as the user’s password stays the same [20], or an endless stream of buffer-overrun errors in privileged programs, each one making it possible for an attacker to take control of the system.

For users and administrators, security interferes with getting work done conveniently, or in some cases at all. This is more important, since there are lot more users than developers. Security setup also takes time, and it contributes nothing to useful output. Furthermore, if the setup is too permissive no one will notice unless there’s an audit or an attack. This leads to such things as users whose password is their first name, or a large company in which more than half of the installed database servers have a blank administrator password [9], or public access to databases of credit card numbers  [22, 23], or e-mail clients that run attachments containing arbitrary code with the user’s privileges [4].

Furthermore, the Internet has made computer security much more difficult than it used to be. In the good old days, a computer system had a few dozen users at most, all members of the same organization. It ran programs written in-house or by a few vendors. Information was moved from one computer to another by carrying tapes or disks.

Today half a billion people all over the world are on the Internet, including you. This poses a big new set of problems.

·                             Attack from anywhere: Any one on the Internet can take a poke at your system.

·                             Sharing with anyone: On the other hand, you may want to communicate or share information with any other Internet user.

·                             Automated infection: Your system, if compromised, can spread the harm to many others in a few seconds.

·                             Hostile code: Code from many different sources runs on your system, usually without your knowledge if it comes from a Web page. The code might be hostile, but you can’t just isolate it, because you want it to work for you.

·                             Hostile environment: A mobile device like a laptop may find itself in a hostile environment that attacks its physical security.

·                             Hostile hosts: If you own information (music or movies, for example), it gets downloaded to your customers’ systems, which may be hostile and try to steal it.

1.1         Real security?

The end result should not be surprising. We don’t have “real” security that guarantees to stop bad things from happening, and the main reason is that people don’t buy it. They don’t buy it because the danger is small, and because security is a pain.

·                             Since the danger is small, people prefer to buy features. A secure system has fewer features because it has to be implemented correctly. This means that it takes more time to build, so naturally it lacks the latest features.

·                             Security is a pain because it stops you from doing things, and you have to do work to authenticate yourself and to set it up.

A secondary reason we don’t have “real” security is that systems are complicated, and therefore both the code and the setup have bugs that an attacker can exploit. This is the reason that gets all the attention, but it is not the heart of the problem.

Will things get better? Certainly if there are some major security catastrophes, buyers will change their priorities and systems will become more secure. Short of that, the best we can do is to drastically simplify the parts of systems that have to do with security:

·                             Users need to have at most three categories for authorization: me, my group or company, and the world.

·                             Administrators need to write policies that control security settings in a uniform way, since they can’t deal effectively with lots of individual cases.

·                             Everyone needs a uniform way to do end-to-end authentication and authorization across the entire Internet.

Since people would rather have features than security, most of these things are unlikely to happen.

 On the other hand, don’t forget that in the real world security depends more on police than on locks, so detecting attacks, recovering from them, and punishing the bad guys are more important than prevention.

Section 2.3 discusses these points in more detail. For a fuller account, see Bruce Schneier’s recent book [19].

1.2         Outline

The next section gives an overview of computer security, highlighting matters that are important in practice. Section 3 explains how to do Internet-wide end-to-end authentication and authorization.

2           Overview of computer security

Like any computer system, a secure system can be studied under three headings:

Specification:

What is it supposed to do?

Implementation:

How does it do it?

Correctness:

Does it really work?

In security they are called policy, mechanism, and assurance, since it’s customary to give new names to familiar concepts. Thus we have the correspondence:

Specification

Policy

Implementation

Mechanism

Correctness

Assurance

Assurance is especially important for security because the system must withstand malicious attacks, not just ordinary use. Deployed systems with many happy users often have thousands of bugs. This happens because the system enters very few of its possible states during ordinary use. Attackers, of course, try to drive the system into states that they can exploit, and since there are so many bugs, this is usually quite easy.

This section briefly describes the standard ways of thinking about policy and mechanism. It then discusses assurance in more detail, since this is where security failures occur.

2.1         Policy: Specifying security

Organizations and people that use computers can describe their needs for information security under four major headings [15]:

·                             Secrecy: controlling who gets to read information.

·                             Integrity: controlling how information changes or resources are used.

·                             Availability: providing prompt access to information and resources.

·                             Accountability: knowing who has had access to information or resources.

They are usually trying to protect some resource against danger from an attacker. The resource is usually either information or money. The most important dangers are:

Vandalism or sabotage that

 

—damages information

—disrupts service

integrity

availability

Theft

 

—of money

integrity

—of information

secrecy

Loss of privacy

secrecy

Each user of computers must decide what security means to them. A description of the user’s needs for security is called a security policy.

Most policies include elements from all four categories, but the emphasis varies widely. Policies for computer systems are usually derived from policies for security of systems that don’t involve computers. The military is most concerned with secrecy, ordinary businesses with integrity and accountability, telephone companies with availability. Obviously integrity is also important for national security: an intruder should not be able to change the sailing orders for a carrier, and certainly not to cause the firing of a missile or the arming of a nuclear weapon. And secrecy is important in commercial applications: financial and personnel information must not be disclosed to outsiders. Nonetheless, the difference in emphasis remains [5].

A security policy has both a positive and negative aspect. It might say, “Company confidential information should be accessible only to properly authorized employees”. This means two things: properly authorized employees should have access to the information, and other people should not have access. When people talk about security, the emphasis is usually on the negative aspect: keeping out the bad guy. In practice, however, the positive aspect gets more attention, since too little access keeps people from getting their work done, which draws attention immediately, but too much access goes undetected until there’s a security audit or an obvious attack,[2] which hardly ever happens. This distinction between talk and practice is pervasive in security.

This paper deals mostly with integrity, treating secrecy as a dual problem. It has little to say about availability, which is a matter of keeping systems from crashing and allocating resources both fairly and cheaply. Most attacks on availability work by overloading systems that do too much work in deciding whether to accept a request.

2.2         Mechanism: Implementing security

Of course, one man’s policy is another man’s mechanism. The informal access policy in the previous paragraph must be elaborated considerably before it can be enforced by a computer system. Both the set of confidential information and the set of properly authorized employees must be described precisely. We can view these descriptions as more detailed policy, or as implementation of the informal policy.

In fact, the implementation of security has two parts: the code and the setup or configuration. The code is the programs in the trusted computing base. The setup is all the data that controls the operations of these programs: access control lists, group memberships, user passwords or encryption keys, etc.

The job of a security implementation is to defend against vulnerabilities. These take three main forms:

1)       Bad (buggy or hostile) programs.

2)       Bad (careless or hostile) agents, either programs or people, giving bad instructions to good but gullible programs.

3)       Bad agents tapping or spoofing communications.

Case (2) can be cascaded through several levels of gullible agents. Clearly agents that might get instructions from bad agents must be prudent, or even paranoid, rather than gullible.

Broadly speaking, there are four defensive strategies:

1)                   Keep everybody out. This is complete isolation. It provides the best security, but it keeps you from using information or services from others, and from providing them to others. This is impractical for all but a few applications.

2)                   Keep the bad guys out. It’s all right for programs inside this defense to be gullible. Code signing and firewalls do this.

3)                   Let the bad guys in, but keep them from doing damage. Sandboxing does this, whether the traditional kind provided by an operating system process, or the modern kind in a Java virtual machine. Sandboxing typically involves access control on resources to define the holes in the sandbox. Programs accessible from the sandbox must be paranoid; it’s hard to get this right.

4)                   Catch the bad guys and prosecute them. Auditing and police do this.

The well-known access control model provides the framework for these strategies. In this model, a guard[3] controls the access of requests for service to valued resources, which are usually encapsulated in objects.

The guard’s job is to decide whether the source of the request, called a principal, is allowed to do the operation on the object. To decide, it uses two kinds of information: authentication information from the left, which identifies the principal who made the request, and authorization information from the right, which says who is allowed to do what to the object. As we shall see in section 3, there are many ways to make this division.

The reason for separating the guard from the object is to keep it simple. If security is mixed up with the rest of the object’s implementation, it’s much harder to be confident that it’s right. The price paid for this is that in general the decisions must be made based only on the principal, the method of the object, and perhaps the parameters. For instance, if you want a file system to enforce quotas only for novice users, there are only two ways to do it within this model:

1)                   Have separate methods for writing with quotas and without, and don’t authorize novice users to write without quotas.

2)                   Have a separate quota object that the file system calls on the user’s behalf.

Of course security still depends on the object to implement its methods correctly. For instance, if a file’s read method changes its data, or the write method fails to debit the quota, or either one touches data in other files, the system is insecure in spite of the guard.

Another model is sometimes used when secrecy in the face of bad programs is a primary concern: the information flow control model [6, 13]. This is roughly a dual of the access control model, in which the guard decides whether information can flow to a principal.

In either model, there are three basic mechanisms for implementing security. Together, they form the gold standard for security:

·                             Authenticating principals, answering the question “Who said that?” or “Who is getting that information?”. Usually principals are people, but they may also be groups, machines, or programs.

·                             Authorizing access, answering the question “Who can do which operations on this object?”.

·                             Auditing the decisions of the guard, so that later it’s possible to figure out what happened and why.

2.3         Assurance: Making security work

The unavoidable price of reliability is simplicity.  (Hoare)

What does it mean to make security work? The answer is based on the idea of a trusted computing base (TCB), the collection of hardware, software, and setup information on which the security of a system depends. Some examples may help to clarify this idea.

·                             If the security policy for the machines on a LAN is just that they can access the Web but no other Internet services, and no inward access is allowed, then the TCB is just the firewall (hardware, software, and setup) that allows outgoing port 80 TCP connections, but no other traffic.[4] If the policy also says that no software downloaded from the Internet should run, then the TCB adds the browser code and setup that disables Java and other software downloads.[5]

·                             If the security policy for a Unix system is that users can read system directories, and read and write their home directories, then the TCB is roughly the hardware, the Unix kernel, and any program that can write a system directory (including any that runs as superuser). This is quite a lot of software. It also includes /etc/passwd and the permissions on system and home directories.

The idea of a TCB is closely related to the end-to-end principle [18]—just as reliability depends only on the ends, security depends only on the TCB. In both cases, performance and availability isn’t guaranteed.

In general, it’s not easy to figure out what is in the TCB for a given security policy. Even writing the specs for the components is hard, as the examples may suggest.

For security to work perfectly, the specs for all the TCB components must be strong enough to enforce the policy, and each component has to satisfy its spec. This level of assurance has seldom been attempted. Essentially always, people settle for something much weaker and accept that both the specs and the implementation will be flawed. Either way, it should be clear that a smaller TCB is better.

A good way to make defects in the TCB less harmful is to use defense in depth, redundant mechanisms for security. For example, a system might include:

·                             Network level security, using a firewall.

·                             Operating system security, using sandboxing to isolate programs. This can be done by a base OS like Windows 2000 or Unix, or by a higher-level OS like a Java VM.

·                             Application level security that checks authorization directly.

The idea is that it will be hard for an attacker to simultaneously exploit flaws in all the levels. Defense in depth offers no guarantees, but it does seem to help in practice.

Most discussions of assurance focus on the software (and occasionally the hardware), as I have done so far. But the other important component of the TCB is all the setup or configuration information, the knobs and switches that tell the software what to do. In most systems deployed today there is a lot of this information, as anyone who has run one will know. It includes:

1)                   What software is installed with system privileges, and perhaps what software is installed that will run with the user’s privileges. “Software” includes not just binaries, but anything executable, such as shell scripts or macros.

2)                   The database of users, passwords (or other authentication data), privileges, and group memberships. Often services like SQL servers have their own user database.

3)                   Network information such as lists of trusted machines.

4)                   The access controls on all the system resources: files, services (especially those that respond to requests from the network), devices, etc.

5)                   Doubtless many other things that I haven’t thought of.

Although setup is much simpler than code, it is still complicated, it is usually done by less skilled people, and while code is written once, setup is different for every installation. So we should expect that it’s usually wrong, and many studies confirm this expectation. The problem is made worse by the fact that setup must be based on the documentation for the software, which is usually voluminous, obscure, and incomplete at best.[6] See [2] for an eye-opening description of these effects in the context of financial cryptosystems, [16] for an account of them in the military, and [19] for many other examples.

The only solution to this problem is to make security setup much simpler, both for administrators and for users. It’s not practical to do this by changing the base operating system, both because changes there are hard to carry out, and because some customers will insist on the fine-grained control it provides. Instead, take advantage of this fine-grained control by using it as a “machine language”. Define a simple model for security with a small number of settings, and then compile these into the innumerable knobs and switches of the base system.

What form should this model take?

Users need a very simple story, with about three levels of security: me, my group or company, and the world, with progressively less authority. Browsers classify the network in this way today. The corresponding private, shared, and public data should be in three parts of the file system: my documents, shared documents, and public documents. This combines the security of data with where it is stored, just as the physical world does with its public bulletin boards, private houses, locked file cabinets, and safe deposit boxes. It’s familiar, there’s less to set up, and it’s obvious what the security of each item is.

Everything else should be handled by security policies that vendors or administrators provide. In particular, policies should classify all programs as trusted or untrusted based on how they are signed, unless the user overrides them explicitly. Untrusted programs can be rejected or sandboxed; if they are sandboxed, they need to run in a completely separate world, with separate global state such as user and temporary folders, history, web caches, etc. There should be no communication with the trusted world except when the user explicitly copies something by hand. This is a bit inconvenient, but anything else is bound to be unsafe.

Administrators still need a fairly simple story, but they need even more the ability to handle many users and systems in a uniform way, since they can’t deal effectively with lots of individual cases. The way to do this is to let them define so-called security policies[7], rules for security settings that are applied automatically to groups of machines. These should say things like:

·                             Each user has read/write access to their home folder on a server, and no one else has this access.

·                             A user is normally a member of one workgroup, which has access to group home folders on all its members’ machines and on the server.

·                             System folders must contain sets of files that form a vendor-approved release.

·                             All executable programs must be signed by a trusted authority.

 These policies should usually be small variations on templates provided and tested by vendors, since it’s too hard for most administrators to invent them from scratch. It should be easy to turn off backward compatibility with old applications and network nodes, since administrators can’t deal with the security issues it causes.

Some customers will insist on special cases. This means that useful exception reporting is essential. It should be easy to report all the variations from standard practice in a system, especially variations in the software on a machine, and all changes from a previous set of exceptions. The reports should be concise, since long ones are sure to be ignored.

To make the policies manageable, administrators need to define groups of users (sometimes called “roles”) and of resources, and then state the policies concisely in terms of these groups. Ideally, groups of resources follow the file system structure, but there need to be other ways to define them to take account of the baroque conventions in existing networks, OS’s and applications.

The implementation of policies is usually by compiling them into existing security settings. This means that existing resource managers don’t have to change, and it also allows for both powerful high-level policies and efficient enforcement, just as compilers allow both powerful programming languages and efficient execution.

Developers need a type-safe language like Java; this will eliminate a lot of bugs. Unfortunately, most of the bugs that hurt security are in system software that talks to the network, and it will be a while before system code is written that way.

They also need a development process that takes security seriously, valuing designs that make assurance easier, getting them reviewed by security professionals, and refusing to ship code with serious security flaws.

3           End-to-end access control

Any problem in computer science can be solved with another level of indirection.        (Wheeler)

Secure distributed systems need a way to handle authentication and authorization uniformly throughout the Internet. In this section we first explain how security is done locally today, and then describe the principles that underlie a uniform end-to-end scheme.

3.1         Local access control

Most existing systems do authentication and authorization locally. They have a local database for user authentication (usually by passwords) and group membership, and a local database of authorization information, usually in the form of an access control list (ACL) on each resource.

In these systems access control works like this:

·                             It’s assumed that the channel on which the user communicates with the system is secure, that is, only the user and the system can send or receive messages on that channel.

·                             The system has a local database of user names and passwords (or hashes of passwords). This also records which users are members of which groups. Usually it stores an internal security identifier (SID) for each user and group as well.

·                             The user authenticates the channel by sending a password response (some function of the password and a challenge) to the system. This is called “logging in”. After verifying the response from the local database, the system creates a process for the user, attaches it to the channel, and assigns the user and group SIDs to it as its identity. If this process creates others, they get the same SIDs, or perhaps a subset.

·                             An executable file (program image) can also have an identity (called setuid in Unix). This means that if a process is started up running this program, it gets the program’s identity as well as the caller’s, and it can switch between the two identities. This switching is sometimes called “impersonation”.

·                             Each resource object has an ACL that is a list of SIDs along with the access each one is permitted. When a process calls a method of the object, a guard (usually the OS kernel) checks that one of the SIDs in the process’ identity is on the ACL with the right access permission. An object can read its caller’s identity and do its own access checking if it wants to.

Operating systems like Unix and Windows 2000 do security this way; they either rely on physical security or luck to secure the channel to the user, or use an encrypted channel protocol like PPTP. The databases for both authentication (user names, passwords, SIDs, and groups) and authorization (ACLs) are strictly local.

You might think that security on the web is more global or distributed, but in fact web servers work the same way. They usually use SSL to secure the user channel; this also authenticates the server’s DNS name, but users hardly ever pay any attention. Authorization is primitive, since usually the only protected resources are the entire service and the private data that it keeps for each user. Each server farm has a separate local user database.

There is a slight extension of this strictly local scheme, in which each system belongs to a “domain” and the authentication database is stored centrally on a domain controller. The basic idea is very simple; the following description omits many details about how the bits are routed and how the secure messages are formatted.

Each system in the domain has a secure channel to the controller, implemented by a shared key that encrypts messages between the two; this key is set up when the system joins the domain. To log in a user the login system, instead of doing the work itself, does an RPC to the controller, passing in the user’s password response. The controller does exactly what the login system did by itself in the earlier scheme, and returns the user’s identity. It also returns a token that the login system can use later to reauthenticate the user to the controller. SIDs are no longer local to the individual system but span the domain.

Kerberos, Windows NT, and Passport all work this way. In Kerberos the reauthentication token is confusingly called a “ticket-granting ticket”.

To authenticate the user to another system in the domain, the login system can ask the controller to forward the authentication to the target system (or it can forward the password response, an old and deprecated method). In Kerberos the domain controller does this by sending a message called a “ticket” to the target on the secure channel between them.[8]

Authentication to another domain works the same way, except that there is another level of indirection through the target domain’s controller. A shared key between the two domains secures this channel. The secure communication is thus login system to login controller to target controller to target. A further extension organizes the domains in a tree and uses this scheme repeatedly, once for each domain on the path through the common ancestor.

Unless the domains trust each other completely, each one should have its own space of SIDs and should only be trusted to assign its own SIDs to a user. Otherwise any domain can assign any SID to any user, so that the Microsoft subsidiary in Russia can authenticate someone as Bill Gates. Unfortunately Windows 2000 inter-domain se­curity today omits this precaution. See section 3.5 for more on this point.

3.2         Distributed access control

A distributed system may involve systems (and people) that belong to different organizations and are managed differently. To do access control cleanly in such a system (as opposed to the strictly local systems of the last section) we need a way to treat uniformly all the items of infor­mation that contribute to the decision to grant or deny access. Consider the following example:

Alice at Intel is part of a team working on a joint Intel-Microsoft project called Atom. She logs in, using a smart card to authenticate herself, and connects using SSL to a project web page called Spectra at Microsoft.  The web page grants her access because:

1)                   The request comes over an SSL connection secured with a session key KSSL.

2)                   To authenticate the SSL connection, Alice’s smart card uses her key KAlice to sign a response to a challenge from the Microsoft server.[9]

3)                   Intel certifies that KAlice is the key for Alice@In­tel. com.

4)                   Microsoft’s group database says that Alice@Intel. com is in the Atom group.

5)       The ACL on the Spectra page says that Atom has read/write access.

For brevity, we drop the .com from now on.

To avoid the need for the smart card to re-authenticate Alice to Microsoft, the card can authenticate a temporary key Ktemp on her login system, and that key can authenticate the login connection. (2) is then replaced by:

2a)    Alice’s smart card uses her key KAlice to sign a certificate for the temporary key Ktemp owned by the login system.

2b)    Alice’s login system authenticates the SSL connection by using Ktemp to sign a response to a challenge from the Microsoft server.

From this example we can see that many different kinds of information contribute to the access control decision:

Authenticated session keys

User passwords or public keys

Delegations from one system to another

Group memberships

ACL entries.

We want to do a number of things with this information:

·                             Keep track of how secure channels are authenticated, whether by passwords, smart cards, or systems.

·                             Make it secure for Microsoft to accept Intel’s authentication of Alice.

·                             Handle delegation of authority to a system, for example, Alice’s login system.

·                             Handle authorization via ACLs like the one on the Spectra page.

·                             Record the reasons for an access control decision so that it can be audited later.

3.3         Chains of responsibility

What is the common element in all the steps of the example and all the different kinds of information? From the example we can see that there is a chain of responsibility running from the request at one end to the Spectra resource at the other. A link of this chain has the form

“Principal P speaks for principal Q about subjects T.”

For example, KSSL speaks for KAlice about everything, and Atom@Microsoft speaks for Spectra about read and write.

The idea of “speaks for” is that if P says something about T, then Q says it too. Put another way, Q takes responsibility for anything that P says about T. A third way, P is a more powerful principal than Q (at least with respect to T) since P’s statements are taken at least as seriously as Q’s, and perhaps more seriously.

The notion of principal is very general, encompassing any entity that we can imagine making statements. In the example, secure channels, people, systems, groups, and resource objects are all principals. We can think of a principal as in some sense equivalent to the set of statements that it ever makes. A stronger principal makes more statements.

The idea of “about subjects T” is that T is some way of describing a set of things that P (and therefore Q) might say. You can think of T as a pattern or predicate that characterizes this set of statements. In the example, T is “all subjects” except for step (5), where it is “read and write requests”. It’s up to the guard of the object that gets the request to figure out whether the request is in T, so the interpretation of T’s encoding can be local to the object. SPKI [8] develops this idea in some detail.

We can write this P T( Q for short, or P Þ Q if T is “all subjects”. With this notation the chain for the example is:

KSSL Þ Ktemp Þ KAlice Þ Alice@Intel Þ
Atom@Microsoft r/w( Spectra

The picture below shows how the chain of responsibility is related to the various principals. Note that the “speaks for” arrows are quite independent of the flow of bytes.

The remainder of this section explains some of the details of establishing the chain of responsibility. Things can get a bit complicated; don’t lose sight of the simple idea. For more details see [12, 21, 8, 10].

3.4         Evidence for the links

How do we establish a link in the chain, that is, a fact P Þ Q? Someone, either the object’s guard or a later auditor, needs to see evidence for the link; we call this entity the “verifier”. The evidence has the form “principal says delegation”, where a delegation is a statement of the form P T( Q.[10] The principal is taking responsibility for the delegation. So we need to answer three questions:

Why do we trust the principal for this delegation?

How do we know who says the delegation?

Why is the principal willing to say it?

Why trust? The answer to the first question is always the same: We trust Q for P Þ Q, that is, we believe it if Q says it. When Q says P T( Q, Q is delegating its authority for T to P, because on the strength of this statement anything that P says about T will be taken as something that Q says. We believe the delegation on the grounds that Q, as a responsible adult or the computer equivalent, should be allowed to delegate its authority.

There are also some delegations that we trust unconditionally, because they are instances of general rules called “axioms”. There are no axioms for delegation from basic principals like encryption keys. We discuss compound principals like Alice@Intel later.

Who says? The second question is: How do we know that Q says P T( Q? The answer depends on how Q is doing the saying.

·                             If Q is a key, then “Q says X” means that Q cryptographically signs X, and this is something that a program can easily verify. This case applies for Ktemp Þ KAlice. If KAlice signs it, the verifier believes that KAlice says it, and therefore trusts it by the delegation rule above.

·                             If Q is the verifier itself, then P T( Q is probably just an entry in a local database; this case applies for an ACL entry like Atom Þ Spectra. The verifier believes its own local data.

These are the only ways that the verifier can directly know who said something: receive it on a secure channel or store it locally.

To verify that any other principal says something, the verifier needs some reasoning about “speaks for”. For a key binding like KAlice Þ Alice@Intel, the verifier needs a secure channel to some principal that can speak for Alice@Intel. As we shall see later, Intel delegate( Alice@Intel. So it’s enough for the verifier to see KAlice Þ Alice@Intel on a secure channel from Intel. Where does this channel come from?

The simplest way is for the verifier to simply know KIntel Þ Intel, that is, to have it wired in. Then encryption by KIntel forms the secure channel. Recall that in our example the verifier is a Microsoft web server. If Microsoft and Intel establish a direct relationship, Microsoft will know Intel’s public key KIntel, that is, know KIntel Þ Intel.

Of course, we don’t want to install KIntel Þ Intel explicitly on every Microsoft server. Instead, we install it in some Microsoft-wide directory. All the other servers have secure channels to the directory (for instance, they know the directory’s public key KMSDir) and trust it unconditionally to authenticate principals outside Microsoft. Only KMSDir and the delegation

KMSDir Þ * except *.Microsoft.com

need to be installed in each server.

The remaining case in the example is the group membership Alice@Intel Þ Atom@Microsoft. Just as Intel delegate( Alice@Intel, so Microsoft