This is part 2 of a series on the Random Oracle Model. See “Part 1: An introduction” for the previous post.
In a previous post I promised to explain what the Random Oracle Model is, and — more importantly — why you should care about it. I think I made a good start, but I realize that I haven’t actually answered either of these questions.
Instead what I did is talk a lot about hash functions. I described what an ideal hash function would look like: namely, a random function. Then I spent most of the post explaining why random functions were totally impractical (recap: too big to store, too slow to compute).
At the end of it all, I left us with the following conclusions: random functions would certainly make great hash functions, but unfortunately we can’t use them in real life. Still, if we can’t get a security proof any other way, it might just be useful to pretend that a hash function can behave like a random function. Of course, when we actually deploy the scheme we’ll have to use a real hash function like SHA or RIPEMD — definitely not random functions — and it’s not clear what our proof would mean in that case.
Still, this isn’t totally useless. If we can accomplish such a security proof, we can at least rule out the “obvious” attacks (based on the kind of glitches that real scheme designers tend to make). Any specific attack on the scheme would essentially have to have something to do with the properties of the hash function. This is still a good heuristic.
As you can see, this has gotten somewhat technical, and it’s going to get a bit more technical through to the end of the post. If you’ve gotten enough background to get the gist of the thing, feel free to skip on to the next post in this series, in which I’ll talk about where this model is used (hint all modern RSA signing and encryption!), and what it means for cryptography in practice.
Brass tacks
A security proof typically considers the interaction of two or more parties, one of whom we refer to as the adversary. In a typical proof, most of the players are “honest”, meaning they’ll operate according to a defined cryptographic protocol. The adversary, on the other hand, can do anything she wants. Her goal is to break the scheme.
Usually we spell out a very specific game that the parties will play. For an encryption scheme, the rules of this game allow the adversary to do things that she might be able to do in a “real world” environment. For example: she might be able to obtain the encryption of chosen plaintexts and/or the decryption of chosen ciphertexts. In most cases she’ll intercept a legitimate ciphertext transmitted by an honest encryptor. Typically her goal is to learn (something about) the underlying plaintext.
It’s standard to assume that the parties will all know the scheme or protocol, including how to compute the hash function if one is used. This is a common-sense assumption, sometimes referred to as Kerckhoffs’s Principle. Moreover, we generally assume that the adversary is limited in some way — she doesn’t have unlimited computational power or time, which means she can’t just brute force the decryption key.
A Random Oracle security proof is just like this, but we make one further tweak. Even though everyone is allowed to compute the hash function, they can’t do it by themselves. Hashing is done by a new magical party called the “oracle”. If any party needs to hash something, they ship the message (over an imaginary secure channel) to the oracle, who computes the hash and sends it back to them.
Why the oracle? Well, since the oracle is “outside” of the parties, they no longer need to carry around a description of the hash function, and they don’t need to do the work of computing it. This is good because — as I mentioned in the previous post — the full description of a random function is ridiculously large, and consequently takes an insane of amount of time to compute. By sticking it into the oracle, we can handwave away that ugliness.
We’ll place two restrictions on the random oracle. First, it must implement a genuine random function (sampled at the beginning of the game), with the same input/output profile as a realistic hash function we might use (e.g., 256-bit output strings if we’re eventually going to implement with SHA256).
Secondly, the responses given by the oracle must be consistent. In other words, just as hashing “Peter Piper” through SHA1 will always produce 1c6244929dc8216f5c1024eebefb5cb73c378431,* regardless of who does it, sending a given string through the random oracle will produce the same result, regardless of which party asks for it.
Lazy Afternoon
- At the beginning of the game the oracle’s table is empty.
- Whenever some party asks the oracle to hash a message, the oracle first checks to see if that input value is already stored in the table.
- If not, it generates a random output string, stores the input message and the new output string in its table, and returns the output string to the caller.
- If the oracle does find the input value in the table, it returns output value that’s already stored there.
Back to Delphia
1. Pick a random secret key (say 128 bits long) and share it with both the sender and receiver. Set IV to 1. 2. Cut a plaintext message into equal-length blocks M1, …, MN, where every block is exactly as long as the output of the hash function. 3. Using || to indicate concatenation and ⊕ for exclusive-OR, encrypt the message as follows: C1 = H(key || IV) ⊕ M1 C2 = H(key || IV+1) ⊕ M2 … CN = H(key || IV+N) ⊕ MN 4. Output (IV, C1, C2, ..., CN) as the ciphertext. 5. Make sure to set IV = (IV+N+1) before we encrypt the next message.
A proof sketch!
Some honest party — the encryptor — is going to encrypt a message. The adversary will intercept the ciphertext. The adversary “wins” if he can learn any information about the underlying plaintext besides its length.
Recall that we’re in the magical Random Oracle Model. This is just like the real world, except whenever any party computes the hash function H(), s/he’s actually issuing a call out to the oracle, who will do all the work of hashing using a random function. Everyone can call the oracle, even the adversary.
Following the scheme description above, the encryptor first chooses a random key (step 1). He divides his plaintext up into blocks (step 2). For each block i = 1 to N, he queries the oracle to obtain H(key || i). Finally, he XORs the received hashes with the blocks of the plaintext (step 3), and sends the ciphertext (C1, …, CN) out such that the adversary intercepts it (step 4).
- Internally, the oracle starts with an empty table.
- Each time the oracle receives a new query of the form “key || i” from the encryptor, it produces a new, perfectly random string for the output value. It stores both input and output in its table, and sends the output back to the encryptor.
- The encryptor forms the ciphertext blocks (C1, …, CN) by XORing each message block with one of these random strings.
- The encryptor never asks for the same input string twice, because the IV is always incremented.And finally, we make the most important observation:
- XORing a message with a secret, perfectly random string is a very effective way to encrypt it. In fact, it’s a One Time Pad. As long as the adversary does not know the random string, the resulting ciphertext is itself randomly-distributed, and thus reveals no information about the underlying message (other than its length).
The Final Analysis
Part (b) is a little harder. Is there any way the Adversary could learn the oracle’s output for H(key || i) for even one value of i from 1 to N? The oracle does not make its table public, and the adversary can’t “listen in” on the hash calls that the encryptor made.
Thinking about this a little harder, you’ll realize that there’s exactly one way that the adversary could learn one of those strings: she can query the oracle too. Specifically, all she has to do is query the oracle on “key || i” for any 1 <= i <= N and she’ll get back one of the strings that the encryptor used to encrypt the ciphertext. If the adversary learns one of these strings, she can easily recover (at least part of) the plaintext, and our argument doesn’t work anymore.
There’s one obvious problem with this, at least from the adversary’s point of view: she doesn’t know the key.
So in order to query on “key || i”, the adversary would have to guess the key. Every time she submits a guess “guess || i” to the oracle she’ll either be wrong — in which case she gets back a useless random string. Or she’ll be right, in which case our scheme won’t be secure anymore.
But fortunately for us, the key is random, and quite big (128 bits). This tells us something about the adversary’s probability of success.
Specifically: if she makes exactly one guess, her probability of guessing the right key is quite small: one in the number of possible keys. For our specific example, her success probability would be 1/(2^128), which is astronomically small. Or course, she could try guessing more than once; if makes, say q guesses, her success probability rises to q/(2^128).
The basic assumption we started with is that the adversary is somehow limited in her computational power. This means she can only perform a reasonable number of hashes. In a formal security proof this is where we might invoke polynomial vs. exponential time. But rather than do that, let’s just plug in some concrete numbers.
Let’s assume that the adversary only has time to compute 2^40 (1.1 trillion) hashes. Assuming we used a 128-bit key, then her probability of guessing the key is at most (2^40)/(2^128) = 1/(2^88). This is still a pretty tiny number.
Think a real-world adversary might be able to compute more than 2^40 hashes? No problem. Plug in your own numbers — the calculation stays the same. And if you think the scheme isn’t secure enough, you can always try making the key longer.
In Summary
Depending on your point of view, this may have seemed a bit technical. Or maybe not. In any case, it definitely went a bit past the “layperson’s introduction” I promised in the previous post. Moreover, it wasn’t technical enough, in that it didn’t capture all of the ways that cryptographers can “cheat” when using the random oracle model.***
I’m going to do my best to up for that in the next post, where we’ll talk a bit about some real schemes with proofs in the random oracle model, and — more importantly — why cryptographers are willing to put up with this kind of thing.
See the next post: Part 3, in which the true nature of the Random Oracle Model is revealed.
Notes:
* I got this result from an online SHA1 calculator of untrusted veracity. YMMV.
** Of course, in a real proof we’d be more formal, and we’d also consider the possibility that an adversary might at least be able to obtain the encryption of some chosen plaintexts. This would be captured in the security game we use.
Do random oracles create collisions? Seemingly, they'd have to, if their domain is finite. If so, how does this affect the proof of the encryption scheme built from an RO?
Also, if the output of an RO is truly random, how would an adversary ever know if he's correctly guessed the key? Aren't all possible plaintexts then possible decryptions?
“Do random oracles create collisions? Seemingly, they'd have to, if their domain is finite.”
Good question. Yes, random oracles do create collisions. This is unavoidable in any function where there are more inputs than outputs. The typical hash function takes in a variable-sized input and produces a fixed-size output. You can SHA1 hash a 1MB message and you'll get a 160-bit output.
The implication of this is something called the “pigeonhole principle”. If you have a million pigeons and only 100 holes, some pigeons are going to wind up sharing a hole. Ditto for hash function inputs and outputs. But that doesn't mean that it's easy for you to /find/ such a collision. In the ROM they happen at random, and it would take you a long time to find two inputs that map to the same output. So a random oracle hash function is implicitly collision /resistant/ (and one-way too, which means that you get one-way functions 'for free' in the ROM).
“If so, how does this affect the proof of the encryption scheme built from an RO?”
I think what you're asking is: what if the adversary queries on “wrong key || i”, but (due to a random collision in the hash function) they happen to get the same result from the oracle as if they'd queried on the right key?
The answer is: it doesn't help the adversary at all if he finds a collision on the key. If he queries on H(“not the key || i”) he might, due to the vagaries of random numbers, happen to get an output that's the same as H(“key || i”), but that's just what you'd expect from a perfect random number generator.
This in no way tells him anything about the message, anymore than representing the ciphertext as a bunch of scrabble letters, putting them into a bag and shaking it up, then drawing them out, would tell you anything about the actual plaintext. By some dumb luck you might actually draw out the real plaintext, but that would just be random coincidence. You could just as easily have drawn “eat zucchini” or “go Chargers”.
The fact that he got the right result and it happens to decrypt the plaintext properly is just random coincidence, and coincidences happen in this world. (With very low probability, obviously.)
In fact, the security of the scheme /depends/ to some extent on the fact that the function is perfectly random. If you wanted to avoid collisions, you could swap the random function in for a random /permutation', which has no collisions. But actually, permutations are less secure than plain functions specifically /because/ they never collide. If you use a permutation for too many blocks, the adversary can actually reduce its guessing space for the next block simply by knowing which values you've already 'used up'.
“Also, if the output of an RO is truly random, how would an adversary ever know if he's correctly guessed the key? Aren't all possible plaintexts then possible decryptions?”
If the key is 128-bits long (for example), then there are only 2^128 possible ways to decrypt the message. If the message itself if only 128 bits long, then it's possible that it could decrypt to (close to) every possible plaintext. (It still probably wouldn't quite get there, because there would be some collisions in the hash.)
However, if your message was 1 million bits long, then there would still be 2^{1 million} possible plaintexts, and only 2^{128} possible decryptions. So you definitely wouldn't get every possible plaintext. Of the (relatively) small subset of possible plaintexts that you would get by brute-forcing the key, most would probably be weird random junk.