Professor: Alfred Menezes | Term: Fall 2024

View the original lecture slides here.

Chapter 1: Introduction to Cryptography

Cryptography is about securing communications in the presence of malicious adversaries.

Alice and Bob are communicating via an unsecured channel. Eve is eavesdropping (malicious). Communication should be

Confidential: Keep data secret from everyone not authorized to see it.
Data Integrity: Ensure data is not altered by others.
Data Origin Authentication: Corroborating the source of data.
Non-Repudiation: Preventing an entity from denying previous commitments or actions.

Transport Layer Security (TLS): The cryptographic protocol used by web browsers to securely communicate with websites (e.g. Facebook, Gmail).

TLS is used to assure an individual user of the authenticity of the website and establish a secure communications channel.

Symmetric-Key Cryptography: The client and server a priori share some secret information $k$ called a key.

They can encrypt their messages with AES and authenticate the resulting ciphertexts with HMAC.

Question

How do Alice and Bob share the shared secret $k$ ?

Public-key cryptography: The client and server a priori share some authenticated (but non-secret) information.

To establish a secret key, Alice selects the secret session key $k$ , and encrypts it with Bob’s RSA public key. Then only Bob can decrypt the resulting ciphertext with its RSA private key to recover $k$ .

Question

How does Alice obtain an authentic copy of Bob’s RSA public key?

Bob cannot send it to Alice over the internet because it can be intercepted.

Signature scheme: Bob’s RSA public key is signed by a Certification Authority (CA) using its secret signing key with the RSA signature scheme.

Alice can verify the signature using the CA’s RSA public verification key. In this way, Alice obtains an authentic copy of Bob’s RSA public key.

The CA’s RSA public key is embedded in Alice’s browser.

TLS potential vulnerabilities:

The cryptography is weak
Cryptography can be broken using quantum computers
Weak random number generation
Issuance of fraudulent certificates
Software bugs
Phishing attacks
TLS only protects data during transit. It does not protect data stored at the server

Definition

Cybersecurity is comprised of the concepts, technical measures, and administrative measures used to protect networks, computers, programs and data from deliberate or inadvertent unauthorized access, disclosure, manipulation, or use.

Cryptography $\neq =$ Cybersecurity. Cryptography provides some mathematical tools that can assist with the provision of cybersecurity services. It is a small part of a complete security solution.

Security is a chain; weak links become targets.

Chapter 2: Symmetric-Key Encryption

Fundamental Concepts

Definition

Asymmetric-key encryption scheme (SKES) consists of:

$M$ - the plaintext space,

$C$ - the ciphertext space,

$K$ - they key space,

a family of encryption functions $E_{k} : M \to C, \forall k \in K$ ,

a family of decryption functions $D_{k} : C \to M, \forall k \in K$ ,

such that $D_{k} (E_{k} (m)) = m$ for all $m \in M, k \in K$ .

Alice and Bob agree on a secret $k \in K$ communicating over the secured channel
Alice computes $c = E_{k} (m)$ and sends the ciphertext $c$ to Bob over the unsecured channel.
Bob retrieves the plaintext by computing $m = D_{k} (c)$

Note: The secret key $k$ might be used for a fixed time interval, or it might be used a fixed number of times.

Called symmetric-key encryption since the same key is used for encryption and decryption.

The Enigma and Lorenz Machines are examples.

Substitution Cipher

$M =$ all English messages

$C =$ all encrypted messages

$K =$ all permutations of the English alphabet.

$E_{k} (m) :$ Apply permutation $k$ to $m$ , one letter at a time

$D_{k} (c) :$ Apply the inverse permutation $k^{- 1}$ to $c$ , one letter at a time

Security Model: Defined the computational abilities of the adversary, and how she interacts with the communicating parties.

Convention: We will always strive to model maximal adversary capabilities and minimal adversary goals.

Basic assumption: The adversary knows everything about the SKES, except the particular key $k$ chosen by Alice and Bob. Avoid security by obscurity.

Computational power of the adversary:

Information-theoretic security: Eve has infinite computational resources
Complexity-theoretic security: Eve is a “polynomial-time Turing machine”
Computational security: Eve has a bounded amount of computational power. We say Eve is computationally bounded.

Passive Attacks:

Ciphertext-only attack: The adversary knows some ciphertext
Known-plaintext attack: The adversary also knows some plaintext and the corresponding ciphertext. Active Attacks:
Chosen-plaintext attack: The adversary can also choose some plaintext and obtains the corresponding ciphertext
Clandestine attacks: bribery, blackmail (not covered in this course)

Adversary’s goal

Recover the secret key $k$
Systematically recover plaintext from ciphertext, without necessarily learning $k$ .
Learn some partial information about the plaintext from the ciphertext, other than its length

If the adversary can achieve 1 or 2, the SKES is said to be totally insecure.

If the adversary cannot learn any partial information about the plaintext from the ciphertext, the SKES is said to be partially secure.

Definition

A symmetric-key encryption scheme is said to be secure if it is semantically secure against chosen-plaintext attack by a computationally bounded adversary.

To break a symmetric-key encryption scheme, the adversary has to accomplish the following:

The adversary is given a challenge ciphertext $c$
During its computation, the adversary can select arbitrary plaintext and obtain the corresponding ciphertexts
After a feasible amount of computation, the adversary obtains some information about the plaintext $m$ corresponding to $c$ .

Desirable properties of a SKES

Efficient algorithms should be known for computing $E_{k}$ and $D_{k}$ .
The secret key $k$ should be small, but large enough to render exhaustive key search infeasible.
The scheme should be secure.
The scheme should be secure even against the designer of the system.

The simple substitution cipher is totally insecure against a chosen-plaintext attack.

Is exhaustive key search possible?

Number of keys to try is $26! \approx 4 \times 1 0^{26} \approx 2^{88}$ . This will take too long, even with many computers.

In this course:

$2^{40}$ operations is considered very easy.
$2^{56}$ operations is considered easy.
$2^{64}$ operations is considered feasible.
$2^{80}$ operations is considered barely feasible.
$2^{128}$ operations is considered infeasible.

The Bitcoin network is performing about $2^{80}$ hash operations per hour.

Definition

A cryptographic scheme is said to have a security level of $ℓ$ bits if the fastest known attack on the scheme takes approximately $2^{ℓ}$ operations. A security level of $128$ bits is desirable in practice.

Simple frequency analysis of ciphertext letters can be used to recover the secret key.

Vigenere Cipher

The secret key is an English word having no repeated letters, e.g. $k =$ CRYPTO. We sum each letter in $m$ by $k$ to get $c$ . Here $A = 0, B = 1, \dots, Z = 25$ , and addition of letters is module $26$ . Decryption is subtraction module $26 : m = c - k$ .

The Vigenere cipher is totally insecure against a chosen-plaintext attack.

It is also insecure against a cipher-text only attack.

Stream Ciphers

A stream cipher is a SKES that encrypts the plaintext one bit at a time. A block cipher is a SKES that encrypts the plaintext one block at a time.

One-time Pad

The secret key is a random string of letters with the same length as the message. We use the same process as the Vigenere.

The key should not be reused. If $c_{1} = m_{1} + k$ and $c_{2} = m_{2} + k$ , then $c_{1} - c_{2} = (m_{1} + k) - (m_{2} + k) = m_{1} - m_{2}$ . So, $c_{1} - c_{2}$ depends only on the plaintext (and not on the key $k$ ), and hence can leak information.

Convention: From now on, messages and keys will be assumed to be bit (binary) strings.

Notation: $\oplus$ is bitwise exclusive-or (XOR) (bitwise addition mod $2$ ).

Example

$1011001011 \oplus 1001001001 = 0010000010$

Note that $x \oplus x = 0, x \oplus y = y \oplus x$

So, for the one-time pad, encryption is $c = m \oplus k$ and decryption is $m = c \oplus k$ .

Perfect secrecy: The one-time pad is semantically secure against ciphertext-only attack by an adversary with infinite computational resources.

This can be formally proven using concepts from information theory. Shannon proved that if plaintexts are $ℓ$ -bit strings, then any symmetric-key encryption with perfect secrecy must have $∣ K ∣ \geq 2^{ℓ}$ . So perfect secrecy is useless in practice.

Basic Idea: Instead of using a random key in the one-time pad, use a “pseudorandom” key.

Definition

A pseudorandom bit generator (PRBG) is a deterministic algorithm that takes as input a (random) seed, and outputs a longer “pseudorandom” sequence called the keystream.

Stream cipher: uses a PRGB for encryption. The seed is the secret key shared by Alice and Bob.

No more perfect secrecy - security depends on the quality of the PRBG.

For a stream cipher to be secure:

Indistinguishability requirement: The keystream should be “indistinguishable” from a random sequence.
Unpredictability requirement: Given portions of the keystream, it should be computationally infeasible to learn any information about the remainder of the keystream
- If an adversary knows a portion $c_{1}$ of ciphertext and the corresponding plaintext $m_{1}$ , then she can easily find the corresponding portion $k_{1} = c_{1} \oplus m_{1}$ of the keystream.

Aside: Don’t use UNIX random number generators rand and srand for cryptographic purposes.

Now we introduce the ChaCha20 stream cipher. Designed by Dan Bernstein in 2008, the ChaCha20 is conceptually very simple. It is extremely fast in software an does not require and special hardware. No security weaknesses have been found.

Notation:

$256 -$ bit $k = (k_{1}, k_{2}, k_{3}, \dots, k_{8})$
$128 -$ bit constant $f = (f_{1}, f_{2}, f_{3}, f_{4})$
$96 -$ bit nonce $n = (n_{1}, n_{2}, n_{3})$
$32 -$ bit counter $c$
A hexadecimal digit is a $4 -$ bit number.
- $f_{1} = 0 x 61707865$
- $f_{2} = 0 x 3320646 e$
- $f_{3} = 0 x 79622 d 32$
- $f_{4} = 0 x 6 b 206574$
A nonce is a non-repeating quantity

The initial state is:

f_{1} k_{1} k_{5} c f_{2} k_{2} k_{6} n_{1} f_{3} k_{3} k_{7} n_{2} f_{4} k_{4} k_{8} n_{3} = S_{1} S_{5} S_{9} S_{13} S_{2} S_{6} S_{10} S_{14} S_{3} S_{7} S_{11} S_{15} S_{4} S_{8} S_{12} S_{16}

More Notation:

$\oplus :$ xor
$⊞ :$ integer addition module $2^{32}$
$<<< t :$ left-rotation by $t$ positions

Define $QR$ : Quarter Round Function

Input: Four $32$ -bit words $a, b, c, d$

a \leftarrow a ⊞ b, d \leftarrow d \oplus a, d \leftarrow d <<< 16 c \leftarrow c ⊞ d, b \leftarrow b \oplus c, b \leftarrow b <<< 12 a \leftarrow a ⊞ b, d \leftarrow d \oplus a, d \leftarrow d <<< 8 c \leftarrow c ⊞ d, b \leftarrow b \oplus c, b \leftarrow b <<< 7

Output: $a, b, c, d$

ChaCha20 Keystream Generator

Select a nonce $n$ and initialize the counter $c$ to $0$ . While keystream bytes are required do:

Create the initial state $S$ , and make a copy $S^{'}$ of $S$ .

Update $S$ by repeating the following $10$ times:

$QR (S_{1}, S_{5}, S_{9}, S_{13}), QR (S_{2}, S_{6}, S_{10}, S_{14}), QR (S_{3}, S_{7}, S_{11}, S_{15})$

$QR (S_{4}, S_{8}, S_{12}, S_{16}) QR (S_{1}, S_{6}, S_{11}, S_{16}), QR (S_{2}, S_{7}, S_{12}, S_{13})$

$QR (S_{3}, S_{8}, S_{9}, S_{14}), QR (S_{4}, S_{5}, S_{10}, S_{15})$

Output $S \oplus S^{'}$ ( $64$ keystream bytes).

Increment the counter $c$

Encryption: The keystream bytes are xored with the plaintext bytes to produce ciphertext bytes. The nonce is appended to the ciphertext.

Block Ciphers: DES

Definition

A block cipher is a symmetric-key encryption scheme that breaks up the plaintext into blocks of a fixed length (e.g. $128$ bits), and encrypts the blocks one at a time.

In contrast, a stream cipher encrypts the plaintext one character (usually a bit) at a time.

A historically important example of a block cipher is the Data Encryption Standard (DES):

flowchart LR
key
key -- 56 bits --> DES
plaintext -- 64 bits --> DES
DES -- 64 bits --> ciphertext

Key length: $56$ bits, size of key space: $2^{56}$ , block length: $64$ bits.

Design principles articulated by Claude Shannon

Security:

Diffusion: Each ciphertext bit should depend on all plaintext bits.
Confusion: The relationship between key and ciphertext bits should be complicated.
Key length: Should be small, but large enough to preclude exhausting key search.

Efficiency:

Simplicity: Easier to implement and analyze.
Speed: High encryption and decryption rates.
Platform: Suitable for hardware and software.

The design principles of DES are still classified.

DES Problem #1: Small key size

Exhaustive search on the key space takes $2^{56}$ operations and can easily be parallelized.

DES Problem #2: Small block size

If plaintext blocks are distributed uniformly at random, then the expected number of ciphertext blocks observed before a collision occurs is $\approx 2^{32}$ . This is by the birthday paradox.

Birthday paradox: Suppose that an urn contains $n$ numbered balls. Suppose that balls are drawn from the urn, one at a time, with replacement. The expected number of draws before a ball is selected for the second time (called a collision) is approximately $πn /2 \approx n$ .

Thus the ciphertext reveals some information about the plaintext.

Block Ciphers: Triple-DES

Recall: The only substantial weaknesses known in DES are the obvious ones: small key length and small block length.

Question

How can one construct a more secure block cipher from DES?

Multiple encryption: Re-encrypt the ciphertext one or more times using independent keys. Hope that this increases the effective key length.

Note: Multiple encryption does not necessarily result in increased security.

Example

If $E_{π}$ denotes the encryption function for the simple substitution cipher with the key $π$ , then $E_{π_{2}} \circ E_{π_{1}}$ any more secure than $E_{π}$ ?

No, since $E_{π_{2}} \circ E_{π_{1}} = E_{π_{2} \circ π_{1}}$

Encryption is: $c = E_{k_{2}} (E_{k_{1}} (m))$ , where $E =$ DES decryption.

The Double-DES key length is $ℓ = 112$ bits, so exhaustive key search takes $2^{112}$ operations.

Note: The block length is unchanged ( $64$ bits).

Meet-in-the-middle Attack on Double-DES

Input: 3 plaintext/ciphertext pairs $(m_{1}, c), (m_{2}, c_{2}), (m_{3}, c_{3})$ Output: The secret key $(k_{1}, k_{2})$

For each $h_{2} \in {0, 1}^{56}$ do: a. Compute $E_{h_{2}}^{- 1} (c_{1})$ and store $[E_{h_{2}}^{- 1} (c_{1}), h_{2}]$

For each $h_{1} \in {0, 1}^{56}$ do:

Compute $E_{h_{1}} (m_{1})$

Search for $E_{h_{1}} (m_{1})$ in the table

For each match $[E_{h_{2}}^{- 1} (c_{1}), h_{2}]$ in the table:

Check if $E_{h_{2}} (E_{h_{1}} (m_{2})) = c_{2}$ ; if so then:

Check if $E_{h_{2}} (E_{h_{1}} (m_{3})) = c_{3}$ ; if so then:

Output $(h_{1}, h_{2})$ and STOP.

The main idea is that $c = E_{k_{2}} (E_{k_{1}} (m)) ⟺ E_{k_{2}}^{- 1} (c) = E_{k_{1}} (m)$

Question

How many plaintext/ciphertext pairs are needed for unique key determination?

Let $E$ be a block cipher with key space $K = {0, 1}^{ℓ}$ , and plaintext and ciphertext space ${0, 1}^{L}$ .

Let $k^{'} \in K$ be the secret key chosen by Alice and Bob, and let $(m_{i}, c_{i}), 1 \leq i \leq t$ , be known plaintext/ciphertext pairs where the plaintext $m_{i}$ are distinct.

Question

So how large should $t$ be to ensure that there is only one key $k \in K$ for which $E_{k} (m_{i}) = c_{i}$ for all $1 \leq i \leq t$ ?

We select $t$ so that $F K \approx 0$ where $F K$ is the expected number of false keys.

For each $k \in K$ , the encryption function $E_{k} : {0, 1}^{L} \to {0, 1}^{L}$ is a permutation (bijection).

We make the heuristic assumption that for each $k \in K, E_{k}$ is a random function, i.e. a randomly selected function. This assumption is certainly false since $E_{k}$ is not random, and because a random function is almost certainly not a permutation.

Now, fix $k \in K, k \neq = k^{'}$ .

The probability that $E_{k} (m_{i}) = c_{i}$ for all $1 \leq i \leq t$ is $t \frac{1}{2 ^{L}} \cdot \frac{1}{2 ^{L}} \dots \frac{1}{2 ^{L}} = \frac{1}{2 ^{L t}}$ Thus, the expected number of false keys $k \in K$ for which $E_{k} (m_{i}) = c_{i}$ for all $1 \leq i \leq t$ is $F K = \frac{2 ^{ℓ} - 1}{2 ^{L t}}$

Time: The number of DES operations is $\approx 2^{56} + 2^{56} + 2 \cdot 2^{48} + 2 (1 + \frac{1}{2 ^{16}}) \approx 2^{57}$

The security level of Double-DES is 57 bits, so Double-DES is not much more secure than DES.

A Triple-DES secret key is $k = (k_{1}, k_{2}, k_{3})$ , where $k_{1}, k_{2}, k_{3} \in_{R} {0, 1}^{56}$

Encryption is: $c = E_{k_{3}} (E_{k_{2}} (E_{k_{1}} (m)))$ , where $E =$ DES encryption.

flowchart LR
k1
k2
k3
id1[DES]
id2[DES]
id3[DES]
k1 --> id1
k2 --> id2
k3 --> id3
m -- 64 bits --> id1
id1 -- 64 bits --> id2
id2 -- 64 bits --> id3 
id3 -- 64 bits --> c

Decryption is: $m = E_{k_{1}}^{- 1} (E_{k_{2}}^{- 1} (E_{k_{3}}^{- 1} (c)))$ , where $E^{- 1} =$ DES decryption.

The Triple-DES key length is $ℓ = 168$ bits, so exhaustive key search takes $2^{168}$ operations (infeasible).

Meet-in-the-middle attack takes $\approx 2^{112}$ operations. So the security level of Triple-DES is 112 bits.

Block Ciphers: AES

The Advanced Encryption Standard (AES)

Key lengths: $128$ , $192$ and $256$ bits.

Block length: $128$ bits

2024: No attacks have been found on AES that are faster than exhaustive key search.

AES is an example of a substitution-permutation network.

Definition

A substitution-permutation network (SPN) is an iterated block cipher where a round consists of a substitution operation followed by a permutation operation

Components of an SPN cipher:

$n$ : the block length (in bits)

$ℓ$ : the key length (in bits)

$h$ : the number of rounds

A fixed invertible function $S : {0, 1}^{b} \to {0, 1}^{b}$ called substitution, where $b$ is a divisor of $n$ .

A fixed permutation $P$ on ${1, 2, \dots, n}$ .

A key scheduling algorithm that determines subkeys $k_{1}, k_{2}, \dots, k_{h}, k_{h + 1}$ from a key $k$ . Note: $n, ℓ, h, S, P$ and the key scheduling algorithm are public.

The encryption looks like: $A \leftarrow$ plaintext for $i = 1, 2, \dots, h$ do $A \leftarrow A \oplus k_{i}$ $A \leftarrow S (A)$ $A \leftarrow P (A)$ $A \leftarrow A \oplus k_{h + 1}$ ciphertext $\leftarrow A$

Decryption is the reverse of encryption.

AES is an SPN, where the permutation operation is comprised of two invertible linear transformations.

All operations are byte oriented, e.g., $b = 8$ so the $S -$ box maps $8$ bits to $8$ bits. This facilitates fast implementations on software platforms.

The block length of AES is $n = 128$ . Each subkey is $128$ bits.

AES accepts three key lengths. The number of rounds $h$ depends on the key length.

Each round updates a variable called State which consists of a $4 \times 4$ array of bytes (note $4 \times 4 \times 8 = 128$ , the block length).

State is initialized with the plaintext.

Add AES round uses four invertible operations:

AddRoundKey (key-mixing)
SubBytes (S-box)
ShiftRows (permutation)
MixColumns (linear transformation)

After $h$ rounds are completed, a final subkey is XORed with State, the result being the ciphertext.

Definition

The elements of the finite filed $GF (2^{8})$ are the polynomials of degree at most $7$ in $Z_{2} [y]$ , with addition and multiplication performed module the irreducible polynomial $f (y) = y^{8} + y^{4} + y^{3} + y + 1$ ( $Z_{2} [y]$ is the set of polynomials in $y$ with coefficients from $Z_{2}$ ). We interpret an $8 -$ bit string $a = a_{7} a_{6} a_{5} \dots a_{1} a_{0}$ as coefficients of the polynomial $a (y) = a_{7} y^{7} + a_{6} y^{6} + a_{5} y^{5} + \dots + a_{1} y + a_{0}$ and vice versa.

Example

Let $a = 11101100 = ec$ and $b = 00111011 = 3 b$ , so $a (y) = y^{7} + y^{6} + y^{5} + y^{3} + y^{2}$ and $b (y) = y^{5} + y^{4} + y^{3} + y + 1$ .

Addition: $a (y) + b (y) = y^{7} + y^{6} + y^{4} + y^{2} + y + 1$ , so $ec + 3 b = d 7$ .

Multiplication: $a (y) \cdot b (y) = y^{12} + y^{10} + y^{8} + y^{4} + y^{2}$ , which leaves a remainder of $r (y) = y^{7} + y^{6} + y^{3}$ upon division by $f (y)$ . Hence $a \cdot b =$ $11001000$ in $GF (2^{8})$ , or $ec \cdot 3 b = c 8$ .

Inversion: $e c^{- 1} = 5 d,$ since $ec \cdot 5 d = 01$ .

Definition (S-box)

Let $p \in {0, 1}^{8}$ , and consider $p$ as an element of $GF (2^{8})$ .

Let $q = p^{- 1}$ if $p \neq = 0$ , and $q = p$ if $p = 0$

Define $q = (q_{7} q_{6} q_{5} \dots q_{1} q_{0})$

Compute

$S (p) = r = r_{0} r_{1} r_{2} r_{3} r_{4} r_{5} r_{6} r_{7} = 1111100001111100001111100001111110001111110001111110001111110001 = q_{0} q_{1} q_{2} q_{3} q_{4} q_{5} q_{6} q_{7} + 11000110$

Then $S (p) = r = (r_{7} r_{6} r_{5} \dots r_{1} r_{0})$

ShiftRows: Permute the bytes of State by applying a cyclic shift to each row.

MixColumns:

Read column $i$ of State as a polynomial $(a_{0, i}, a_{1, i}, a_{2, i}, a_{3, i}) = a_{0, i} + a_{1, i} x + a_{2, i} x^{2} + a_{3, i} x^{3}$ (interpret the coefficients as elements of the finite field $GF (2^{8})$ .
Multiply this polynomial with the constant polynomial with the constant polynomial $c (x) = 02 + 01 x + 01 x^{2} + 03 x^{3}$ and reduce module $x^{4} - 1$ . This gives a new polynomial $b_{0, i} + b_{1, i} x + b_{2, i} x^{2} + b_{3, i} x^{3}$

The $\oplus c (x)$ operation.

Let $a (x) = a_{0} + a_{1} x + a_{2} x^{2} + a_{3} x^{3}$ , where each $a_{i} \in GF (2^{8})$ .

Let $c (x) = 02 + 01 x + 01 x^{2} + 03 x^{3}$ , where 01, 02, 03 are elements in $GF (2^{8})$ (written in hex).

To compute $a (x) \oplus c (x)$ do:

Compute $d (x) = a (x) \times c (x)$ (polynomial multiplication where coefficient arithmetic is in $GF (2^{8})$ ).
Divide by $x^{4} - 1$ to find the remainder polynomial $r (x)$ . Equivalently, replace $x^{4}$ by $1$ , $x^{5}$ by $x$ , and $x^{6}$ by $x^{2}$ , and simplify.
Then $a (x) \oplus c (x) = r (x)$

AES Encryption

From the key $k$ derive $h + 1$ subkeys $k_{0}, k_{1}, \dots k_{h}$ .

State $\leftarrow$ plaintext

State $\leftarrow$ State $\oplus k_{0}$

for $i = 1, 2, \dots, h - 1$ do
	State ← SubBytes(State)
	State ← ShiftRows(State)
	State ← MixColumns(State)
	State ← State ⊕ k_i
State $\leftarrow$ SubBytes(State)

State $\leftarrow$ ShiftRows(State)

State $\leftarrow$ State $\oplus k_{h}$

ciphertext $\leftarrow$ State

AES Decryption

From the key $k$ derive $h + 1$ subkeys $k_{0}, k_{1}, \dots, k_{h}$

State $\leftarrow$ ciphertext

State $\leftarrow$ State $\oplus k_{h}$

State $\leftarrow$ InvShiftRows(State)

State $\leftarrow$ InvSubBytes(State)

for $i = h - 1, \dots, 2, 1$ do
	State ← State ⊕ k_i
	State ← InvMixColumns(State)
	State ← InvShiftRows(State)
	State ← InvSubBytes(State)
State $\leftarrow$ State $\oplus k_{0}$

plaintext $\leftarrow$ State

For 128-bit keys, AES has 10 rounds, so we need 11 subkeys.

The first subkey is $k_{0} = (r_{0}, r_{1}, r_{2}, r_{3})$ is the actual AES key.
The second subkey is $k_{1} = (r_{4}, r_{5}, r_{6}, r_{7})$
The third subkey is $k_{2} = (r_{8}, r_{9}, r_{10}, r_{11})$
The eleventh subkey is $k_{10} = (r_{40}, r_{41}, r_{42}, r_{43})$

The function $f_{i} : {0, 1}^{32} \to {0, 1}^{32}$ are defined as follows:

The input is divided into four bytes: $(a, b, c, d)$
Left-rotate the bytes: $(b, c, d, a)$
Apply the AES S-box to each byte: $(S (b), S (c), S (d), S (a))$
XOR the leftmost byte with the constant $ℓ_{i}$ , and output the result: $(S (b) \oplus ℓ_{i}, S (c), S (d), S (a))$

The constants $ℓ_{i}$ (in hex): $ℓ_{1} = 01, ℓ_{2} = 02, ℓ_{3} = 04, ℓ_{4} = 08, ℓ_{5} = 10, ℓ_{6} = 20, ℓ_{7} = 40, ℓ_{8} = 80, ℓ_{9} = 1 b, ℓ_{10} = 36$

Modes of Operation

In practice, one usually wishes to encrypt a large quantity of data. The plaintext message is $m = m_{1}, m_{2}, \dots, m_{t}$ where each $m_{i}$ is an $L -$ bit block.

Question

How should we use a block cipher $E_{k} : {0, 1}^{L} \to {0, 1}^{L}$ to encrypt $m$ ?

Some modes of operation: ECB, CBC, CTR, GCM, CCM.

Suppose that the plaintext message is $m = m_{1}, m_{2}, \dots, m_{t}$

Electronic Codebook Mode (ECB): encrypts the blocks independently, one at a time. Drawback: Identical plaintext result in identical ciphertext blocks.

Cipher Block Chaining (CBC) mode:

Encryption: Select $c_{0} \in_{R} {0, 1}^{L}$ ( $c_{0}$ is a random non-secret IV), and then compute $c_{i} = E_{k} (m_{i} \oplus c_{i - 1})$ for $i = 1, 2, \dots, t$ . The ciphertext is $(c_{0}, c_{1}, c_{2}, \dots, c_{t})$ .

Decryption: $m_{i} = E_{k}^{- 1} (c_{i}) \oplus c_{i - 1},$ for $i = 1, 2, \dots t$ .

Identical plaintexts with different IVs result in different ciphertexts.

Chapter 3: Hash Functions

Fundamental Concepts

Hash functions play a fundamental role in cryptography. They are used in a variety of cryptographic primitives and protocols.

They are very difficult to design because of stringent security and performance requirements.

The most commonly used hash functions are:

SHA-1
SHA-2 family: SHA-224, SHA-256, SHA-384, SHA-512
SHA-3 family

SHA-256

SHA-256: ${0, 1}^{*} \to {0, 1}^{256}$ SHA-256(“Hello there”) = $0x4e47826698bb4630fb4451010062fadbf85d61427cbdfaed7ad0f23f239bed89$

Definition

A hash function is a mapping $H$ such that:

$H$ maps binary messages of arbitrary lengths $\leq L$ to outputs of a fixed length $n$ : $H : {0, 1}^{\leq L} \to {0, 1}^{n}$ . ( $L$ is usually large, e.g., $L = 2^{64}$ , whereas $n$ is small, e.g. $n = 256$ )

$H (x)$ can be efficiently computed for all $x \in {0, 1}^{\leq L}$

$H$ is called an $n -$ bit hash function. $H (x)$ is called the hash or message digest of $x$ .

Notes:

The description of a hash function is public; there are no secret keys.
For simplicity, we will usually write ${0, 1}^{*}$ instead of ${0, 1}^{\leq L}$
More generally, a hash function is an efficiently computable function from a set $S$ to a set $T$ .

Consider $H : {0, 1}^{\leq 4} \to {0, 1}^{2}$ to be a hash function mapping a bitstring to its last two digits.

$1001$ is a preimage of $01$ .

$H (01, 1001)$ is a collision. $01$ is a second preimage of $1011$ .

Hash functions are used in all kind of applications. One reason for the widespread use of hash functions is speed.

Definition

A hash function $H : {0, 1}^{*} \to {0, 1}^{n}$ is preimage resistant if, given a hash value $y \in_{R} {0, 1}^{n}$ , it is computationally infeasible to find (with non-negligible success probability) any $x \in {0, 1}^{*}$ with $H (x) = y$ . ( $x$ is called a preimage of $y$ .)

Password protection on a multi-user computer system:

The server stores $[$ userid, $H$ (password) $]$
If the attacker obtains a copy of the password file, they do not learn any passwords.
This requires preimage resistance.

Definition

A hash function $H : {0, 1}^{*} \to {0, 1}^{n}$ is 2nd preimage resistant if, given $x \in_{R} {0, 1}^{*}$ , it is computationally infeasible to find (with non-negligible success probability) any $x^{'} \in {0, 1}^{*}$ with $x^{'} \neq = x$ and $H (x^{'}) = H (x)$

Modification Detection Codes (MDC’s):

To ensure that a message $m$ is not modified by unauthorized means, one computes $H (m)$ and protects $H (m)$ from unauthorized modification.
This is useful protection and requires 2nd preimage resistance.

Definition

A hash function $H : {0, 1}^{*} \to {0, 1}^{n}$ is collision resistant if it is computationally infeasible to find $x, x^{'} \in {0, 1}^{*}$ with $x^{'} \neq = x$ and $H (x^{'}) = H (x)$ . Such a pair $(x, x^{'})$ is called a collision for $H$ .

Message digests for digital signature schemes:

For reasons of efficiency, instead of signing a long message $x$ , the (much shorter) message digest $h = H (x)$ is signed.
This application requires preimage-resistance, 2nd preimage resistance, and collision resistance.
To see why collision resistance is required, suppose that the legitimate signer Alice can find a collision $(x_{1}, x_{2})$ for $H$ . Alice can sign $x_{1}$ and later claimed to have signed $x_{2}$ .

Other applications:

Message Authentication Codes
Pseudorandom bit generation
Key derivation functions
Proof-of-work
Quantum-safe signature schemes

Relationships between PR, 2PR, CP

Breaking preimage resistance (PR):

Given: $y \in_{R} {0, 1}^{n}$

Required: $x \in {0, 1}^{*}$ with $H (x) = y$ .

Breaking 2nd preimage resistance (2PR):

Given: $x \in_{R} {0, 1}^{*}$

Required: $x^{'} \in {0, 1}^{*}$ with $x^{'} \neq = x$ and $H (x^{'}) = H (x)$

Breaking collision resistance (CR):

Given: $-$

Required: $x, x^{'} \in {0, 1}^{*}$ with $x^{'} \neq = x$ and $H (x^{'}) = H (x)$

Claim $1$ : If $H$ is CR, then $H$ is 2PR

Proof: Suppose that $H : {0, 1}^{*} t o {0, 1}^{n}$ is not 2PR.

We’ll show that $H$ is not CR.

Select $x \in_{R} {0, 1}^{*}$ . Since $H$ is not 2PR, we can efficiently find $x^{'} \in {0, 1}^{*}, x \neq = x$ , with $H (x^{'}) = H (x)$ .

Thus, $(x, x^{'})$ is a collision for $H$ that we have efficiently found, showing that $H$ is not CR.

Note: This proof established the contrapositive statement.

Claim $2$ : CR does not guarantee PR

Proof: Suppose that $H : {0, 1}^{*} \to {0, 1}^{n}$ is CR.

Consider the hash function $\overline{H} : {0, 1}^{*} \to {0, 1}^{n + 1}$ defined by

\overline{H} (x) = {0∣∣ H (x), 1∣∣ x, if x \in {0, 1}^{n} if x \in {0, 1}^{n}

Then $\overline{H}$ is CR (since $H$ is).

And $\overline{H}$ is not PR since preimages can be efficiently found for at least half of all $y \in {0, 1}^{n + 1}$ , namely the hash values that begin with $1$ .

Note: The hash function $\overline{H}$ is rather contrived. For somewhat uniform hash functions, i.e., hash functions for which all hash values have roughly the same number of preimages, CR does not guarantee PR.

Claim $2^{*}$ : Suppose $H$ is somewhat uniform. If $H$ is CR, then $H$ is PR

Proof: Suppose that $H : {0, 1}^{*} \to {0, 1}^{n}$ is not PR.

We’ll show that $H$ is not CR.

Select $x \in_{R} {0, 1}^{*}$ and compute $y = H (x)$ . Since $H$ is not PR, we can efficiently find $x^{'} \in {0, 1}^{*}$ with $H (x^{'}) = y$ . Since $H$ is somewhat uniform, we expect that $y$ has many preimages, and thus $x^{'} = x$ with very high probability. Thus, $(x, x^{'})$ is a collision for $H$ that we have efficiently found, so $H$ is not CR.

Claim $3$ : PR does not guarantee 2PR

Proof: Suppose that $H : {0, 1}^{*} \to {0, 1}^{n}$ is PR.

Define $\overline{H} : {0, 1}^{*} \to {0, 1}^{n}$ by $\overline{H} (x_{1}, x_{2}, \dots, x_{t}) = H (0, x_{2}, \dots, x_{t})$ for all $(x_{1}, x_{2}, \dots, x_{t}) \in {0, 1}^{*}$ .

Then $\overline{H}$ is PR. However $\overline{H}$ is not 2PR.

Claim $4$ : Suppose $H$ is somewhat uniform. If $H$ is 2PR, then $H$ is PR

Proof: Suppose that $H : {0, 1}^{*} \to {0, 1}^{n}$ is not PR.

We will show that $H$ is not 2PR.

So, suppose we are given $x \in_{R} {0, 1}^{*}$ . We compute $y = H (x)$ .

Since $H$ is not PR, we can efficiently find $x^{'} \in {0, 1}^{*}$ with $H (x^{'}) = y$ .

Since $H$ is somewhat uniform, we expect that $x^{'} \neq = x$ with very high probability. Hence, $x^{'}$ is a second preimage of $x$ that we have efficiently found.

Thus $H$ is not 2PR.

Claim $5$ : 2PR does not guarantee CR

Proof: Suppose that $H : {0, 1}^{*} \to {0, 1}^{n}$ is 2PR.

Consider $\overline{H} : {0, 1}^{*} \to {0, 1}^{n}$ defined by $\overline{H} (x) = H (x)$ if $x \neq = 1$ , and $\overline{H} (1) = H (0)$ .

Then $\overline{H}$ is not CR, since $(0, 1)$ is a collision for $\overline{H}$
Suppose now that $\overline{H} : {0, 1}^{*} \to {0, 1}^{n}$ is not 2PR. We’ll show that $H$ is not 2PR.

So, we are given $x \in_{R} {0, 1}^{*}$ . Since $\overline{H}$ is not 2PR, we can efficiently find $x^{'} \in {0, 1}^{*}, x^{'} \neq = x$ , with $\overline{H} (x^{'}) = \overline{H} (x)$ . With probability essentially $1$ , we can assume that $x \neq = 0, 1$ . Hence, $\overline{H} (x) = H (x)$ .

Now, if $x^{'} \neq = 1$ , then $H (x^{'}) = \overline{H} (x^{'}) = \overline{H} (x) = H (x)$

And, if $x^{'} = 1$ , then $\overline{H} (x^{'}) = \overline{H} (1) = H (0) = H (x)$ .

In either case, we have efficiently found a second preimage for $x$ with respect to $H$ .

Hence, $H$ is not 2PR, a contradiction. Thus, $\overline{H}$ is 2PR.

Generic Attacks

Definition

A generic attack on hash functions $H : {0, 1}^{*} \to {0, 1}^{n}$ does not exploit any properties that the specific hash function might have.

In the analysis of a generic attack, we view $H$ as a random function in the sense that for each $x \in {0, 1}^{*}$ , the hash value $y = H (x)$ was defined by selecting $y \in_{R} {0, 1}^{n}$ .

A random function is an ideal hash function (from a security point-of-view). However they are not practical.

Attack

Given $y \in_{R} {0, 1}^{n}$ , repeatedly select arbitrary $x \in {0, 1}^{*}$ until $H (x) = y$ .

The expected number of hash operations is $2^{n}$ .

This generic attack is infeasible if $n \geq 128$ .

Attack

Select arbitrary $x \in {0, 1}^{*}$ and store $(H (x), x)$ in a table sorted by first entry. Repeat until a collision is found.

Analysis: By the birthday paradox, the expected number of hash operations is $π 2^{n} /2 \approx 2^{n}$

This generic attack is infeasible if $n \geq 256$ .

This generic attack for finding collisions is optimal. The expected space required is $π 2^{n} /2 \approx 2^{n}$ .

VW: van Oorschot & Wiener parallel collision search.

The expected number of hash operations: $\approx 2^{n}$ . The expected space required is negligible.

It is easy to parallelize, $m$ -fold speedup with $m$ processors.

The VW collision-finding algorithm can easily be modified to find meaningful collisions.

If collision resistance is desired, then use an $n$ -bit hash function with $n \geq 256$ .

Parallel Collision search (VW method)

Problem: Find a collision for $H : {0, 1}^{*} \to {0, 1}^{n}$ .

Assumption: $H$ is a random function.

Notation: Let $N = 2^{n}$ .

Define a sequence ${x_{i}}_{i \geq 0}$ by $x_{0} \in_{R} {0, 1}^{n}, x_{i} = H (x_{i - 1})$ for $i \geq 1$ .

Let $j$ be the smallest index for which $x_{j} = x_{i}$ for some $i < j$ ; such a $j$ must exist. Then $x_{j + ℓ} = x_{i + ℓ}$ for all $ℓ \geq 1$ . By the birthday paradox, $E [j] \approx π N /2 \approx N$ . In fact, $E [i] \approx \frac{1}{2} N$ and $E [j - i] \approx \frac{1}{2} N$ .

Now, $i \neq = 0$ with overwhelming probability, in which event $(x_{i - 1}, x_{j - 1})$ is a collision for $H$ .

Question

How to find $(x_{i - 1}, x_{j - 1})$ without using much storage?

We only store distinguished points.

Distinguished points: Select an easily-testable distinguishing property for elements of ${0, 1}^{n}$ , e.g. leading 32 bits are all 0.

Let $θ$ be the proportion of elements of ${0, 1}^{n}$ that are distinguished.

VW method: Compute the sequence $x_{0}, x_{1}, x_{2}, x_{3}, \dots$ and only store the points that are distinguished.

VW Collision Finding

Stage 1: Detecting a collision

Select $x_{0} \in_{R} {0, 1}^{n}$

Store $(x_{0}, 0, -)$ in a sorted table.

$L P \leftarrow x_{0}$ ( $L P$ = last point stored)

For $d = 1, 2, 3, \dots$ do:

Compute $x_{d} = H (x_{d - 1})$

If $x_{d}$ is distinguished then

If $x_{d}$ is already in the table, say $x_{d} = x_{b}$ where $b < d$ , then go to Stage 2

Store $(x_{d}, d, L P)$ in the table.

$L P \leftarrow x_{d}$

Stage 2: Finding a collision

Set $ℓ_{1} \leftarrow b - a, ℓ_{2} \leftarrow d - c$

Suppose $ℓ_{1} \geq ℓ_{2}$ , and set $k \gets \ell_1 - \ell_2

Compute $x_{a + 1}, x_{a + 2}, \dots x_{a + k}$

For $m = 1, 2, 3, \dots$ do:

Compute $(x_{a + k + m}, x_{c + m})$

Until $x_{a + k + m} = x_{c + m}$

The collision is $(x_{a + k + m - 1}, x_{c + m - 1})$

VW Analysis

Stage 1: Expected number of $H$ -evaluations is:

π N /2 + \frac{1}{θ} \approx N + \frac{1}{θ}

Stage 2: Expected number of $H -$ evaluations is $\leq \frac{3}{θ}$

Overall expected running time: $N + \frac{4}{θ}$

Expected storage: $\approx 3 n θ N$ bits

Example

Consider $n = 128$ . Take $θ = 1/ 2^{32}$ . Then the expected run time of VW collision search is $2^{64}$ $H -$ evaluations (feasible), and the expected storage is 192 gigabytes (negligible).

Iterated Hash Functions

Merkle's Meta Method

Components:

Fixed initializing value $I V \in {0, 1}^{n}$

Efficiently-computable compression function $f : {0, 1}^{n + r} \to {0, 1}^{n}$

To compute $H (x)$ where $x$ has bitlength $b < 2^{r}$ do:

Break up $x$ into $r -$ bit blocks, $\overline{x} = x_{1}, x_{2}, \dots, x_{t}$ , padding the last block with $0$ bits as necessary.

Define $x_{t + 1}$ , the length-block, to hold the right-justified binary representation of $b$ .

Define $H_{0} = I V$ .

Compute $H_{i} = f (H_{i - 1}, x_{i})$ for $i = 1, 2, \dots, t + 1$ . (The $H_{i}‘s are called chaining variables).

Define $H (x) = H_{t + 1}$

Theorem (Merkle)

If the compression function $f$ is collision resistant, then the iterated hash function $H$ is also collision resistant.

Merkle’s theorem reduces the problem of designing collision-resistant hash functions to that of designing collision-resistant compression functions.

A major theme in cryptographic research is to formula precise security definitions and assumptions, and then prove that a protocol is secure.

A proof of security is certainly desirable since it rules out the possibility of attacks being discovered in the future.

However, it isn’t always easy to asses the practical security assurances (if any) that a security proof provides).

Assumptions might be unrealistic, or false, or circular.
The security model might not account for certain kinds of realistic attacks.
The security proof might be asymptotic.

Proof of Merkle’s Theorem ( $f$ is CR $⟹ H$ is CR)

Suppose that $H$ is not CR. We’ll show that $f$ is not CR.

Since $H$ is not R, we can efficiently find messages $x, x^{'} \in {0, 1}^{*}$ , with $x \neq = x^{'}$ and $H (x) = H (x^{'})$ .

Let $\overline{x} = x_{1}, x_{2}, \dots, x_{t}$ , $b$ = bit-length( $x$ ), $x_{t + 1} =$ length block.

Let $\overline{x^{'}} = x_{1}^{'}, x_{2}^{'}, \dots, x_{t}^{'}$ , $b^{'}$ = bit-length( $x^{'}$ ), $x_{t + 1}^{'} =$ length block.

We can efficiently compute:

H_{0} H_{1} H_{2} H_{t - 1} H_{t} H (x) = H_{t + 1} = I V = f (H_{0}, x_{1}) = f (H_{1}, x_{2}) ⋮ = f (H_{t - 2}, x_{t - 1}) = f (H_{t - 1}, x_{t}) = f (H_{t}, x_{t + 1})

H_{0} H_{1}^{'} H_{2}^{'} H_{t^{'} - 1}^{'} H_{t^{'}}^{'} H (x^{'}) = H_{t^{'} + 1}^{'} = I V = f (H_{0}, x_{1}^{'}) = f (H_{1}^{'}, x_{2}^{'}) ⋮ = f (H_{t^{'} - 2}^{'}, x_{t^{'} - 1}^{'}) = f (H_{t^{'} - 1}^{'}, x_{t^{'}}^{'}) = f (H_{t^{'}}^{'}, x_{t^{'} + 1}^{'})

Since $H (x) = H (x^{'})$ , we have $H_{t + 1} = H_{t^{'} + 1}^{'}$

Case 1: Now, if $b \neq = b^{'}$ then $x_{t + 1} \neq = x_{t^{'} + 1}^{'}$ . Thus, $(H_{t}, x_{t + 1}), H_{t^{'}}^{'}, x_{t^{'} + 1}^{'}$ is a collision for $f$ that we have efficiently found.

Case 2: Suppose next that $b = b^{'}$ . Then $t = t^{'}$ and $x_{t + 1} = x_{t + 1}^{'}$

Let $i$ be the largest index, $0 \leq i \leq t$ , for which $(H_{i}, x_{i + 1}) \neq = (H_{i}^{'}, x_{i + 1}^{'})$ . Such an $i$ must exist since $x \neq = x^{'}$ .
Then $H_{i + 1} = f (H_{i}, x_{i + 1}) = f (H_{i}^{'}, x_{i + 1}^{'}) = H_{i + 1}^{'}$ , so $(H_{i}, x_{i + 1}), (H_{i}^{'}, x_{i + 1}^{'})$ is a collision for $f$ that we have efficiently found.

Thus, $f$ is not collision resistant $□$

MDx is a family of iterated hash functions. MD4 has 128-bit outputs. Professor Xiaoyun Wang found collisions for MD4 by hand.

MD5 is a strengthened version of MD4. MD5 should not be used if collision resistance is required, but is probably okay as a preimages-resistant hash function.

Secure Hash Algorithm (SHA) was designed by NSA and published by NIST in 1992. 160-bit iterated hash function based on MD4.

The SHA-2 design is similar to SHA-1, and thus there were lingering concerns that the SHA-1 weaknesses could eventually extend to SHA-2.

SHA-3 is used in practice, but not as widely deployed as SHA-2.

SHA-256

Iterated hash function (Merkle’s meta method). $n = 256, r = 512$ .

Compression function is $f : {0, 1}^{256 + 512} ⟶ {0, 1}^{256}$

Input: bit string $x$ of arbitrary bitlength $b \geq 0$ .

Output: 256-bit hash value $H (x)$ of $x$ .

SHA-256 Notation

$A, B, C, D, E, F, G, H$ are 32-bit words.

$+$ : addition module $2^{32}$
$\overset{ˉ}{A}$ : bitwise complement
$A ≫ s$ : shift $A$ right by $s$ positions
$A ↪ s$ : rotate $A$ right by $s$ positions
$A B$ : bitwise AND of $A, B$
$A \oplus B$ : bitwise exclusive-OR

$f (A, B, C) = A B \oplus \overset{ˉ}{A} C$

$g (A, B, C) = A B \oplus A C \oplus BC$

$r_{1} (A) = (A ↪ 2) \oplus (A ↪ 13) \oplus (A ↪ 22)$

$r_{2} (A) = (A ↪ 6) \oplus (A ↪ 11) \oplus (A ↪ 25)$

$r_{3} (A) = (A ↪ 7) \oplus (A ↪ 8) \oplus (A ≫ 3)$

$r_{4} (A) = (A ↪ 17) \oplus (A ↪ 19) \oplus (A ≫ 10)$

SHA-256 constants

32-bit initial chaining values (IVs): These words were obtained by taking the first 32 bits of the fractional parts of the square roots of the first 8 prime numbers.
Per-round integer additive constants: These words were obtained by taking the first 32 bits of the fractional parts of the cube roots of the first 64 prime numbers.

SHA-256 preprocessing

Pad $x$ with 1, followed by as few 0’s as possible so that the bitlength is 64 less than a multiple 512.
Append the 64-bit binary representation of $b (mod 2^{64})$
The formatted input is $x_{0}, x_{1}, \dots, x_{16 m - 1}$ , where each $x_{i}$ is a 32-bit word
Initialize the words of the chaining variable: $(H_{1}, H_{2}, \dots, H_{7}, H_{8}) \leftarrow (h_{1}, h_{2}, \dots, h_{7}, h_{8})$

SHA-256 Processing

For each $i$ from 0 to $m - 1$ do the following:

Copy the $i$ th block of sixteen 32-bit words into temporary storage: $X_{j} \leftarrow x_{16 i + j} 0 \leq j \leq 15$

Expand the 16-word block into a 64-word block: For $j$ from 16 to 63 do: $X_{j} \leftarrow r_{4} (X_{j - 2}) + X_{j - 7} + r_{3} (X_{j - 15}) + X_{j - 16}$

Initialize working variables: $(A, B, \dots, G, H) \leftarrow (H_{1}, H_{2}, \dots, H_{7}, H_{8})$

For $j$ from 0 to 63 do:

$T_{1} \leftarrow H + r_{2} (E) + f (E, F, G) + y_{j} + X_{j} T_{2} \leftarrow r_{1} (A) + g (A, B, C)$

$H \leftarrow G, G \leftarrow F, F \leftarrow E, E \leftarrow D + T_{1}, D \leftarrow C, C \leftarrow B, B \leftarrow A, A \leftarrow T_{1} + T_{2}$

Update chaining variable: $(H_{1}, H_{2}, \dots, H_{7}, H_{8}) \leftarrow (H_{1} + A, H_{2} + B, \dots, H_{7} + G, H_{8} + H)$ Output: SHA256( $x$ ) = $H_{1} ∣∣ H_{2} ∣∣ H_{3} ∣∣ H_{4} ∣∣ H_{5} ∣∣ H_{6} ∣∣ H_{7} ∣∣ H_{8}$

Chapter 4: Message Authentication Codes

Fundamental Concepts

Definition

A message authentication code (MAC) scheme is a family of functions $M A C_{k} : {0, 1}^{*} \to {0, 1}^{n}$ parameterized by an $ℓ$ -bit key $k$ , where each function $M A C_{k}$ can be efficiently computed.

$t = M A C_{k} (x)$ is called the MAC or tag of $x$ with key $k$ .

MAC schemes are used for providing (symmetric-key) data integrity and data origin authentication.

To provide data integrity and data origin authentication:

Alice and Bob establish a secret key $k \in_{R} {0, 1}^{ℓ}$
Alice computes the tag $t = M A C_{k} (x)$ of a message $x$ and sends $(x, t)$ to Bob.
Bob verifies that $t = M A C_{k} (x)$

Note: There is no confidentiality or non-repudiation.

Let $k$ be the secret key shared by Alice and Bob.

The adversary does not know $k$ , but is allowed to obtain (from either Alice or Bob) the tags for messages of her choosing. The adversary’s goal is to obtain the tag of a new message of the adversary’s choosing.

Definition

A $M A C$ scheme is secure if given some tags $M A C_{k} (x_{i})$ for messages $x_{i}$ ‘s of one’s own choosing, it is computationally infeasible to compute a message-tag pair ( $x, M A C_{k} (x))$ for a new message $x$ . A $M A C$ scheme is secure if it is existentially unforgeable against chosen-message attack.

An ideal $M A C$ scheme has the following property: For each $k \in {0, 1}^{ℓ}$ , the function $M A C_{k} : {0, 1}^{*} \to {0, 1}^{n}$ is a random function. (This is useless in practice).

Generic Attack

Guessing the tag of a message $x \in {0, 1}^{*}$ . Attack: Select $y \in_{R} {0, 1}^{n}$ and guess that $M A C_{k} (x) = y$

Analysis: Assuming that the $M A C$ scheme is ideal, the success probability is $1/ 2^{n}$

Generic Attack (2)

Attack: Given $r$ message-tag pairs $(x_{1}, t_{1}), \dots (x_{r}, t_{r})$ , one can check whether a guess $h$ of the key is correct by verifying that $M A C_{h} (x_{i}) = t_{i}$ for $i = 1, 2, \dots, r$ .

Analysis: Assuming the $M A C$ scheme is ideal, expected number of keys for which all $(x_{i}, t_{i})$ pairs verify is $1 + F K = 1 + (2^{ℓ} - 1) / 2^{n r}$ .

Exhaustive key search is infeasible if $ℓ \geq 128$

GSM (Global System for Mobile Communications) security is notable since it uses only symmetric-key primitives.

Objectives:

Entity authentication: The cell phone service provider needs the assurance that entities accessing its service are legitimate subscribers
Confidentiality: The data exchanged between a cell phone user and their cell phone service provider should be confidential

GSM does not provide end-to-end security.

Alice: Cell phone user. Bob: Cell phone service provider

Alice sends an authentication request to Bob
Bob selects a challenge $r \in_{R} {0, 1}^{128}$ and sends $r$ to Alice
Alice’s SIM card uses $k$ to compute the response $t = M A C_{k} (r)$ . Alice sends $t$ to Bob.
Bob retrieves Alice’s key $k$ from its database, and verifies that $t = M A C_{k} (r)$
Alice and Bob compute an encryption key $K_{E} = KD F_{k} (r)$ , and thereafter use the encryption algorithm Enc $_{K_{E}} (\cdot)$ to encrypt and decrypt messages for each other for the remainder of the session

CBC-MAC and HMAC

Let $E$ be an $n$ -bit block cipher with key space ${0, 1}^{ℓ}$ .

Assumption: Plaintext messages all have lengths that are multiples of $n$ .

To compute CBC-MAC $_{k} (x)$ :

Divide $x$ into $n$ -bit blocks $x_{1}, x_{2}, \dots, x_{r}$
Compute $H_{1} = E_{k} (x_{1})$
For $i = 2, 3, \dots, r$ , compute $H_{i} = E_{k} (H_{i - 1} \oplus x_{i})$
Then CBC-MAC $_{k} (x) = H_{r}$

CBC-MAC is not secure if variable-length messages are allowed.

Here is a chosen-message attack on CBC-MAC:

Select an arbitrary 3-block message $x = (x_{1}, x_{2}, x_{3})$
Obtain the tag $t_{1}$ of the one-block message $x_{1} : t_{1} = E_{k} (x_{1})$
Obtain the tag $t_{2}$ of the two-block message $(t_{1} \oplus x_{2}, x_{3}) : t_{2} = E_{k} (E_{k} (t_{1} \oplus x_{2}) \oplus x_{3})$
Output the forgery $(x, t_{2})$

Correctness: $t_{2} = E_{k} (E_{k} (t_{1} \oplus x_{2}) \oplus x_{3}) = E_{k} (E_{k} (E_{k} (x_{1}) \oplus x_{2}) \oplus x_{3}) =$ CBC-MAC $(x)$

One countermeasure for variable-length messages is Encrypted CBC-MAC, where CBC-MAC tag is encrypted using a second key $s$ : EMAC $_{k, s} (x) = E_{s} (H_{r})$ , where $H_{r} =$ CBC-MAC $_{k} (x)$

Theorem

Suppose that $E$ is an ideal encryption scheme. Then EMAC is a secure MAC scheme.

Encryption scheme $E$ is ideal if for each $k \in {0, 1}^{ℓ}, E_{k} : {0, 1}^{n} \to {0, 1}^{n}$ is a random permutation.

Hash functions were not originally designed for message authentication. In particular, they are not “keyed” primitives.

Question

How to use a hash function to construct a secure MAC?

Let $H$ be an iterated $n$ -bit hash function. For simplicity, assume that all message shave bitlengths that are multiples of $r$ , and suppose that the length-block is omitted.

Let $n + r$ be the input blocklength of the compression function $f : {0, 1}^{n + r} \to {0, 1}^{n} . L e t$ k \in_R {0,1}^n. Let $K$ denote $k$ padded with $r - n$ 0’s, so $K$ has bitlength $r$ .

Definition

$M A C_{k} (x) = H (K, x)$

This is insecure. Here is a length extension attack:

Suppose that ( $x$ , MAC $_{k} (x)$ ) is known.
Then MAC $_{k} (x ∣∣ y)$ can be easily computed for any $y$ .

Also insecure if messages can be of any bitlength and a length block is postponed to $K ∣∣ x$ .

Suppose that the adversary knows a message-tag pair $(x, t)$ , where $x = x_{1}$ is a one-block message. Hence, $t = f (f (I V, K), x_{1})$ .

The adversary does the following:

Arbitrarily select a one-block message $y$ .
Compute $t^{'} = f (t, y)$ and set $x^{'} = x_{1} ∣∣ y$ . ( $x^{'}$ is a two-block message).
Output $(x^{'}, t^{'})$

Correctness: The message-tag pair $(x^{'}, t^{'})$ is a valid forgery since $t^{'} = f (f (f (I V, K), x_{1}), y)$ .

Hash-based MAC. Define two $r -$ bit strings: ipad $= 0 x 36$ , opad $= 0 x 5 C$

Definition

HMAC $_{k} (x) = H (K \oplus$ opad, $H (K \oplus$ ipad, $x))$

HMAC is commonly used as a key derivation function (KDF).

Suppose that Alice has a secret key $k$ , and wishes to derive several session keys $s k_{i}$ , e.g. to encrypt data in different communication sessions.

Alice computes $s k_{1} =$ HMAC $_{k} (1)$ , $s k_{2} =$ HMAC $_{k} (2)$ , $s k_{3} =$ HMAC $_{k} (3), \dots$

Rationale: Without knowledge of $k$ , an adversary is unable to learn anything about any particular session key $s k_{i}$ , even though she might have learnt some other session keys.

Chapter 5: Authenticated Encryption

Fundamental Concepts

A symmetric-key encryption scheme $E$ provides confidentiality.

A MAC scheme provides authentication (data origin authentication and data integrity).

Question

What if confidentiality and authentication are both required?

Alice computes $c = E_{k_{1}} (m)$ and $t =$ MAC $_{k_{2}} (m)$ , and sends $(c, t)$ to Bob. Here, $m$ is the plaintext and $(k_{1}, k_{2})$ is the secret key she shares with Bob.

Bob decrypts $c$ to obtain $m = E_{k_{1}}^{- 1} (c)$ and then verifies that $t =$ MAC $_{k_{2}} (m)$

This generic method is not secure.

Instead consider the following.

Alice computes $c = E_{k_{1}} (m)$ and $t =$ MAC $_{k_{2}} (c)$ , and sends $(c, t)$ to Bob. Here, $m$ is the plaintext and $(k_{1}, k_{2})$ is the secret key she shares with Bob.

Bob first verifies that $t =$ MAC $_{k_{2}} (c)$ and then decrypts $c$ to obtain $m = E_{k_{1}}^{- 1} (c)$

This generic method has been proven to be secure, provided that the encryption scheme $E$ and the MAC scheme employed are secure.

Many special-purpose authenticated encryption schemes have been developed, the most popular of which is using a symmetric-key encryption such as AES in GCM. Some of these authenticated encryption schemes also allow for the authentication of “header” data.

Definition

An authenticated encryption scheme $A E$ is $A E$ -secure if:

$A E$ is semantically secure against chosen-plaintext attack; and

$A E$ has ciphertext integrity, i.e., an adversary who is able to obtain ciphertext-tag pairs $(c_{1}, t_{1}), (c_{2}, t_{2}), \dots, (c_{ℓ}, t_{ℓ})$ for plaintext messages $m_{1}, m_{2}, \dots, m_{ℓ}$ of her choosing, is unable to produce a valid ciphertext-tag pair $(c, t)$ where $c \neq \in {c_{1}, c_{2}, \dots, c_{ℓ}}$ .

AES-GCM

AES-GCM is an authenticated encryption scheme designed by David McGrew and John Viega. Uses the CTR mode of encryption and GMAC, a custom-designed MAC scheme.

CTR: Counter mode of encryption.

Let $k \in_{R} {0, 1}^{128}$ be the secret key shared by Alice and Bob. Let $M = (M_{1}, M_{2}, \dots, M_{u})$ be a plaintext message, where each $M_{i}$ is a 128-bit block and $u \leq 2^{32} - 2$ .

To encrypt $M$ , Alice does the following:

Select a nonce $I V \in {0, 1}^{96}$
Let $J_{0} = I V ∣∣ 0^{31} ∣∣1$
For $i$ from 1 to $u$ do: $J_{i} \leftarrow J_{i - 1} + 1$ and compute $C_{i} =$ AES $_{k} (J_{i}) \oplus M_{i}$
Send $(I V, C_{1}, C_{2}, \dots, C_{u})$ to Bob.

To decrypt, Bob does the following:

Let $J_{0} = I V ∣∣ 0^{31} ∣∣1$
For $i$ from 1 to $u$ do: $J_{i} \leftarrow J_{i - 1} + 1$ and compute $M_{i} =$ AES $_{k} (J_{i}) \oplus C_{i}$

CTR mode of encryption can be viewed as a stream cipher. As was the case with CBC encryption, identical plaintexts with different IVs result in different ciphertexts. Unlike CBC encryption, CTR encryption is parallelizable. Note that AES $^{- 1}$ is not used. The secret key can have bitlength 128, 192, 256.

Let $a = a_{0} a_{1} a_{2} \dots a_{127}$ be a 128-bit block. We associate the binary polynomial $a (x) = a_{0} + a_{1} x + a_{2} x^{2} + \dots + a_{127} x^{127} \in Z_{2} [x]$ with $a$ .

Let $f (x) = 1 + x + x^{2} + x^{7} + x^{128}$ .

If $a$ and $b$ are 128-bit blocks, then define $c = a \cdot b$ to be the block corresponding to the polynomial $c (x) = a (x) \cdot b (x) (mod f (x))$ .

That is, $c (x)$ is the remainder upon dividing $a (x) \cdot b (x)$ by $f (x)$ in $Z_{2} [x]$ . This is multiplication in the Galois field $GF (2^{128})$ .

Let $A = (A_{1}, A_{2}, \dots, A_{v})$ , where each $A_{i}$ is a 128-bit block. Let $L$ be the bitlength of $A$ . Let $k \in_{R} {0, 1}^{128}$ be the secret key.

Ket $J_{0} = I V ∣∣ 0^{31} ∣∣1$ , where $I V \in {0, 1}^{96}$ is a nonce.
Compute $H =$ AES $_{k} (0^{128})$
Let $f_{A} (x) = A_{1} x^{v + 1} + A_{2} x^{v} + \dots + A_{v - 1} x^{3} + A_{v} x^{2} + Lx \in GF (2^{128}) [x]$
Compute the authentication tag $t =$ AES $_{k} (J_{0}) \oplus f_{A} (H)$
Send $(I V, A, t)$

Example

Let $A = (A_{1}, A_{2}, A_{3})$ Then $f_{A} (x) = A_{1} x^{4} + A_{2} x^{3} + A_{3} x^{2} + Lx$ Hence, $f_{A} (H) = A_{1} H^{4} + A_{2} H^{3} + A_{3} H^{2} + L H$ $f_{A} (H)$ can be computed using Horner’s rule: $f_{A} (H) = ((((((A_{1} \cdot H) + A_{2}) \cdot H) + A_{3}) \cdot H) + L) \cdot H$ This requires three additions and four multiplications in $GF (2^{128})$

In general, if $A$ has blocklength $v$ , then computing $f_{A} (H)$ using Horner’s rule requires $v$ additions and $v + 1$ multiplications in $GF (2^{128})$ .

Consider the simplified tag: $t^{'} = f_{A} (H)$

An adversary can guess the tag $t^{'}$ of a message $A$ with success probability $\frac{1}{2 ^{128}}$ . She can also guess the tag $t^{'}$ by making a guess $H^{'}$ for $H$ and computing $f_{A} (H^{'})$ . Her success probability is at most $\frac{v + 1}{2 ^{128}}$ , where $v$ is the blocklength of $A$ . However, if the adversary sees a single valid message-tag pair $(A, t^{'})$ she can solve the polynomial equation $f_{A} (H) = t^{'}$ for $H$ . To circumvent the aforementioned attack, a second secret AES $_{k} (J_{0})$ is used to hide $t^{'} : t =$ AES $_{k} (J_{0}) \oplus f_{A} (H)$ . The secret AES $_{k} (J_{0})$ serves as a one-time pad for $t^{'}$ .

Input:

AAD (Additional Authenticated Data), also called encryption context: Data to be authenticated (but not encrypted): $A = (A_{1}, A_{2}, \dots, A_{v})$ Data to be encrypted and authenticated: $M = (M_{1}, M_{2}, \dots, M_{u}), u \leq 2^{32} - 2$ . Secret key $k \in_{R} {0, 1}^{128}$ , shared between Alice and Bob.

Output: $(I V, A, C, t)$ where

$I V$ is a 96-big initialization vector.
$A = (A_{1}, A_{2}, \dots, A_{v})$ is the additional authenticated data.
$C = (C_{1}, C_{2}, \dots, C_{u})$ is the encrypted/authenticated data.
$t$ is a 128-bit authentication tag.

Alice does the following:

Let $L = L_{A} ∣∣ L_{M}$ , where $L_{A}, L_{M}$ are bitlengths of $A, M$ expressed as 64-bit integers.
Select a nonce $I V \in {0, 1}^{96}$ and let $J_{0} = I V ∣∣ 0^{31} ∣∣1$
Encryption: For $i$ from 1 to $u$ do: Compute $J_{i} = J_{i - 1} + 1$ and $C_{i} =$ AES $_{k} (J_{i}) \oplus M_{i}$
Authentication: Compute $H =$ AES $_{k} (0^{128})$ . Compute $t =$ AES $_{k} (J_{0}) \oplus f_{A, C} (H)$
Output: $(I V, A, C, t)$

Note: $f_{A, C} (x) = A_{1} x^{u + v + 1} + A_{2} x^{u + v} + \dots A_{v - 1} x^{u + 3} + A_{v} x^{u + 2} + C_{1} x^{u + 1} + C_{2} x^{u} + \dots + C_{u - 1} x^{3} + C_{u} x^{2} + Lx$

Upon receiving $(I V, A, C, t)$ Bob does the following:

Let $L = L_{A} ∣∣ L_{C}$ , where $L_{A}, L_{C}$ are bitlengths of $A, C$ expressed as 64-bit integers.
Authentication: Compute $H =$ AES $_{k} (0^{128})$ . Compute $t^{'} =$ AES $_{k} (J_{0}) \oplus f_{A, C} (H)$ . If $t^{'} = t$ then proceed to decryption; if $t^{'} \neq = t$ then reject.
Decryption: Let $J_{0} = I V ∣∣ 0^{31} ∣∣1$ . For $i$ from 1 to $u$ do: Compute $J_{i} = J_{i - 1} + 1$ and $M_{i} =$ AES $_{k} (J_{i}) \oplus C_{i}$
Output: $(A, M)$

IV’s should not be repeated (with the same key $k$ ). Suppose an IV is reused, and an eavesdropped captures two transmissions: $(I V, A_{1}, C_{1}, t_{1}), (I V, A_{2}, C_{2}, t_{2})$ . Suppose also that $M_{1}$ and $M_{2}$ have the same blocklengths, and that the eavesdropper knows $M_{1}$ .

Then $t_{1} =$ AES $_{k} (J_{0}) \oplus f_{A_{1}, C_{1}} (H)$ and $t_{2} =$ AES $_{k} (J_{0}) \oplus f_{A_{2}, C_{2}} (H)$ , so $t_{1} \oplus t_{2} = f_{A_{1}, C_{1}} (H) \oplus f_{A_{2}, C_{2}} (H)$ .

This polynomial equation can be quickly solved for $H$ , and then AES $_{k} (J_{0}) = t_{1} \oplus f_{A_{1}, C_{1}} (H)$ can be computed.

Thereafter, the adversary can properly encrypt/authenticate any plaintext (of blocklength at most that of $M_{1}$ ).

Chapter 6: Public-Key Cryptography

Fundamental Concepts

Symmetric-key cryptography: Communicating parties a priori share some secret keying information.

The shared secret keys can then be used to achieve confidentiality (e.g. using AES-CBC), or authentication (e.g. using HMAC), or both (e.g. using AES-GCM).

Question

How can Alice and Bob establish the secret key $k$ ?

Method 1: Point-to-point key distribution

flowchart LR
Alice --->|k| Bob

The secured channel could be:

A trusted courier
A face-to-face meeting
A SIM card that contains an authentication key

Method 2: Use a Trusted Third Party (TTP) $T$ .

Each user $A$ shares a secret key $K_{A T}$ with $T$ for a symmetric-key encryption scheme $E$ . To establish this key, $A$ must visit $T$ once.

$T$ serves as a key distribution centre (KDC).

flowchart LR
A --->|Request A,B| T
T --->|EkBTk| B
T --->|EkATk| A

$A$ sends $T$ a request for a key to share with $B$
$T$ selects a session key $k$ , and encrypts it for $A$ using $k_{A T}$
$T$ encrypts $k$ for $B$ using $k_{BT}$

Drawbacks of using a KDC:

The TTP must be unconditionally trusted
The TTP is an attractive target
The TTP must be on-line, and is therefore a potential bottleneck and critical reliability point.

With a network of $n$ users, each user has to share a different secret key with every other user. Each user has to store and manage $n - 1$ different secret keys. The total number of secret keys is $(2 n) \approx \frac{n ^{2}}{2}$

Non-repudiation is impractical.

Public-key cryptography: Communicating parties a priori share some authenticated (but non-secret) information.

Merkle Puzzles

Alice and Bob establish a secret session key by communicating over an authenticated (but non-secret) channel.

Alice creates $N$ puzzles $P_{1}, P_{2}, \dots, P_{N}$ (e.g., $N = 1 0^{9}$ ). Each puzzle takes $t$ hours to solve. The solution to $P_{i}$ reveals a 128-bit session key $s k_{i}$ and a randomly selected 128-bit serial number $n_{i}$
Alice sends $P_{1}, P_{2}, \dots, P_{N}$ to Bob
Bob selects $j \in_{R} [1, N]$ and solves puzzle $P_{j}$ to obtain $s k_{j}$ and $n_{j}$
Bob sends $n_{j}$ to Alice.
The secret session key is $s k_{j}$

Key pair generation for public key cryptography:

Each entity $A$ does the following:

Generate a key pair $(P_{A}, S_{A})$
$A$ ‘s public key is $P_{A}$ and her secret key is $S_{A}$ .

Security requirement: It should be infeasible for an adversary to recover $S_{A}$ from $P_{A}$ .

Example

$S_{A} = (p, q)$ where $p, q$ are randomly-selected prime numbers, and $P_{A} = p \cdot q$

To encrypt a message $m$ for Bob, Alice does:

Obtain an authentic copy of Bob’s public key $P_{B}$
Compute $c = E (P_{B}, m)$ , where $E$ is the encryption function.
Send $c$ to Bob.

To decrypt $c$ , Bob does:

Compute $m = D (S_{B}, c)$ where $D$ is the decryption function.

To sign a message $m$ , Alice does:

Compute $s =$ Sign( $S_{A}, m$ ).
Send $(m, s)$ to Bob

To verify Alice’s signature $s$ on $m$ , Bob does:

Obtain an authentic copy of Alice’s public key $P_{A}$ .
Accept if Verify $(P_{A}, m, s) =$ “Accept”

If Alice generates a signed message $(m, s)$ , anyone who has the authentic copy of Alice’s public key $P_{A}$ can verify the authenticity of the signed message.

Advantages of public-key cryptography

No requirement for a secured channel
Each user has only one key pair
A signed message can be verified by anyone

Disadvantages of public-key cryptography

Public-key schemes are slower than their symmetric-key counterparts.

In practice, symmetric-key and public-key schemes are used together.

Elementary Number Theory

Content covered in MATH135

The set of integers is $Z = {\dots, - 3, - 2, - 1, 0, 1, 2, 3, \dots}$

An integer $a$ is said to divide an integer $b$ , written $a ∣ b$ if there exists and integer $c$ such that $b = c a$ .

Division algorithm: If $a$ and $b$ are integers with $b \geq 1$ , then the orginary long division of $a$ by $b$ yields unique integers $q$ and $r$ with $a = q b + r$ and $0 \leq r < b$ .

An integer $p \geq 2$ is said to be prime if its only positive divisors are $1$ and $p$ . Otherwise, $p$ is called composite.

Fundamental Theorem of Arithmetic: Every integer $n \geq 2$ has a factorization as a product of prime powers: $n = p_{1}^{e_{1}} p_{2}^{e_{2}} \dots p_{k}^{e_{k}}$ , where the $p_{i}$ are distinct primes and the $e_{i}$ are positive integers. Furthermore, the factorization is unique up to rearrangement of factors.

Prime Number Theorem: For $x \geq 2$ , let $π (x)$ denote the number of primes between $2$ and $x$ . For all $x \geq 114$ , we have $\frac{x}{l o g _{e} x} < π (x) < 1.25 \frac{x}{l o g _{e} x}$

Let $a$ and $b$ be integers, not both $0$ . The greatest common divisor of $a$ and $b$ , denoted $g c d (a, b)$ , is the largest positive integer $d$ that divides both $a$ and $b$ .

Integers $a$ and $b$ are said to be relatively prime, or coprime, if $g c d (a, b) = 1$ .

Fact

If $a$ and $b$ are positive integers with $a \geq b$ , then $g c d (a, b) = g c d (b, a (mod b))$ .

Euclidian algorithm for computing $g c d (a, b)$ where $a \geq b$ .

While $b \neq = 0$ do the following:

Set $r \leftarrow a (mod b)$ , a \gets b, b \gets r

Return $(a)$ .

Integers modulo $n$ : $Z_{n} = {0, 1, 2, \dots, n - 1}$ , where addition subtraction and multiplication are performed modulo $n$ .

Definition

Let $a \in Z_{n}$ . The multiplicative inverse of $a$ modulo $n$ is an integer $x \in Z_{n}$ such that $a x \equiv 1 (mod n)$ . If such an $x$ exists, then it is unique, and $a$ is said to be invertible modulo $n$ ; the inverse of $a$ modulo $n$ is denoted by $a^{- 1} mod n$

Fact

Let $a \in Z_{n}$ . Then $a^{- 1} mod n$ exists if and only if $g cd (a, n) = 1$

Computing inverses modulo $n$ . Let $a \in [1, n - 1]$ with $g cd (a, n) = 1$ . Use the EEA to find integers $x$ and $y$ such that $a x + n y = 1$ . Then $a x \equiv (mod n)$ , so $a^{- 1} mod n = x mod n$

Algorithmic Number Theory

Theorem: Every integer $n \geq 2$ has a unique prime factorization.

How we find this prime factorization efficiently is a hard problem. How we verify a prime factorization is easy. Deciding whether a number $n$ is prime or composite is easy.

An algorithm is a “well-defined computational procedure” that takes a variable input and eventually halts with some output.

The efficiency of an algorithm is measured by the scarce resources it consumes.

Definition

The input size is the number of bits required to write down the input using a reasonable encoding. e.g., the size of a positive integer $n$ is $⌊ lo g_{2} n ⌋ + 1$ bits.

Definition

The running time of an algorithm is an upper bound, as a function of the input size, of the worst case number of basic operations the algorithm executes over all inputs of a fixed size.

Definition

An algorithm is a polynomial-time algorithm if its expected running time is $O (k^{c})$ , where $k$ is the input size and $c$ is a fixed positive integer.

Definition

If $f (n)$ and $g (n)$ are functions from the positive integers to the real numbers, then $f (n) = O (g (n))$ means that there exists a positive constant $c$ and a positive integer $n_{0}$ such that $f (n) \leq c g (n) \forall n \geq n_{0}$ .

Modular exponentiation.

Input: A $k$ -bit integer $n$ , and integers $a, m \in [0, n - 1]$

Output: $a^{m} mod n$

Naïve algorithm #1:

Compute $d = a^{m}$
Return ( $d mod n$ )

Analysis: The bitlength of $d$ is $\approx lo g_{2} d = lo g_{2} a^{m} = m lo g_{2} a = O (2^{k} k)$ since $m \approx 2^{k}$ . Hence the algorithm is not polytime.

Naïve algorithm #2:

$A \leftarrow a$
For $i$ from 2 to $m$ do:
1. $A \leftarrow A \times a mod n$
Return( $A$ )

This algorithm is also not polytime.

Let the binary representation of $m$ be $m = \sum_{i = 0}^{k - 1} m_{i} 2^{i}$ , where $m_{i} \in {0, 1}$ . Then:

a^{m} = a^{\sum_{i = 0}^{k - 1} m_{i} 2^{i}} = i = 0 \prod k - 1 a^{m_{i} 2^{i}} = 0 \leq i \leq k - 1 \prod a^{2^{i}} (mod n)

We can do the following repeated square-and-multiply algorithm for computing $a^{m} mod n$

Write $m$ in binary $m = \sum_{i = 0}^{k - 1} m_{i} 2^{i}$

If $m_{0} = 1$ then $B \leftarrow a$ ; else $B \leftarrow 1$ .
$A \leftarrow a$
For $i$ from 1 to $k - 1$ do:
1. $A \leftarrow A^{2} mod n$
2. If $m_{i} = 1$ then $B \leftarrow B \times A mod n$
Return ( $B$ )

Analysis: At most $k$ modular squaring and $k$ modular multiplications, so the worst case running time is $O (k^{3})$ bit operations. This is polytime.

Chapter 7: RSA

Basic RSA

RSA is used for public-key encryption and signatures.

RSA Key Generation:

Each entity $A$ does the following:

Randomly select two large, distinct primes $p$ and $q$ of the same bitlength
Compute $n = pq$ and $ϕ = ϕ (n) = (p - 1) (q - 1)$ ( $n$ is called the RSA modulus)
Select arbitrary integer $e, 1 < e < ϕ$ , with $g cd (e, ϕ) = 1$ ( $e$ is called the encryption exponent)
Compute the integer $d, 1 < d < ϕ$ , with $e d \equiv 1 (mod ϕ)$ ( $d = e^{- 1} mod ϕ$ is called the decryption exponent)
$A^{'}$ s public key is $(n, e)$ ; her private key is $d$ .

Basic RSA public-key encryption scheme

RSA encryption: To encrypt a message for $A, B$ does the following:

Obtain an authenticated copy of $A^{'}$ s public key $(n, e)$
Represent the message as an integer $m \in [0, n - 1]$
Compute the ciphertext $c = m^{e} mod n$
Send to $c$ to $A$

RSA decryption: To decrypt $c$ , $A$ does the following:

Compute $m = c^{d} mod n$

Theorem

For all $m \in [0, n - 1]$ , if $c = m^{e} mod n$ , then $m = c^{d} mod n$

Proof: We’ll prove that $m^{e d} \equiv m (mod n)$ for all $m \in [0, n - 1]$

Since $e d \equiv 1 (mod ϕ)$ , we can write $e d = 1 + k ϕ = 1 + k (p - 1) (q - 1)$ for some $k \in Z$ . Since $e d > 1$ and $(p - 1) (q - 1) \geq 1$ , we have $k \geq 1$ . We will now prove that $m^{e d} \equiv m (mod p)$ .

Suppose first that $p$ divides $m$ . Then $m \equiv 0 (mod p)$ , so $m^{e d} \equiv 0^{e d} \equiv 0 (mod p)$ . Thus, $m^{e d} \equiv m (mod p)$

Suppose now that $p$ does not divide $m$ . By Fermat’s Little Theorem, we have $m^{p - 1} \equiv 1 (mod p)$ . Raising both sides to the power $k (q - 1)$ , and then multiplying by $m$ , gives $m^{1 + k (p - 1) (q - 1)} \equiv m (mod p)$ . Thus, $m^{e d} \equiv m (mod p)$ .

So, we conclude that $m^{e d} \equiv m (mod p)$ for all $m \in [0, n - 1]$ .

Similarly, $m^{e d} \equiv m (mod q)$ . Since $p$ and $q$ both divide $m^{e d} - m$ , and since $p$ and $q$ are distinct primes, we can conclude that $pq$ divides $m^{e d} - m$ . Thus, $m^{e d} \equiv m (mod n)$ .

RSA signature generation: To sign a message $m \in {0, 1}^{*}$ , $A$ does the following:

Compute $M = H (m)$ , where $H$ is a hash function
Compute the signature $s = M^{d} mod n$
$A$ ‘s signed message is $(m, s)$

RSA signature verification: To verify $(m, s)$ , $B$ does the following:

Obtain an authenticated copy of $A$ ‘s public key $(n, e)$
Compute $M = H (m)$
Compute $M^{'} = s^{e} mod n$
Accept $(m, s) ⟺ M = M^{'}$

Integer Factorization

Let $f (n)$ and $g (n)$ be functions from the positive integers to the positive real numbers.

Big-O notation: We write $f (n) = O (g (n))$ if there exists a positive constant $c$ and a positive integer $n_{o}$ such that $f (n) \leq c g (n)$ for all $n \geq n_{0}$ CS341.

Little-o notation: We write $f (n) = o (g (n))$ if $lim_{n \to \infty} \frac{f ( n )}{g ( n )} = 0$

Polynomial time algorithm: One whose worst-case running time is of the form $O (n^{c})$ where $n$ is the input size and $c$ is a constant.

Exponential time algorithm: One whose worst-case running time is not of the form $O (n^{c})$ for any constant $c$ . In this course, fully exponential-time functions are of the form $2^{c n}$ , where $c$ is a constant.

Sub-exponential-time algorithm: One whose worst-case running time function is of the form $2^{o (n)}$ , and not of the form $O (n^{c})$ for any constant $c$ .

Let $A$ be an algorithm whose input is an integer $n$ . The input size is $O (lo g n)$ .

If the expected running time is of the form $L_{n} [α, c] = O (exp ((c + o (1)) (lo g_{e} n)^{α} (lo g_{e} lo g_{e} n)^{1 - α}))$ , where $c$ is a positive constant, and $α$ is a constant satisfying $0 < α < 1$ , then $A$ is a sub-exponential-time algorithm.

Factoring is believed to be a hard problem. However we have no proof or theoretical evidence that factoring is hard.

RSA Encryption

Security of RSA key generation: If an adversary can factor $n$ , she can compute $d$ from $(n, e)$ . It has been proven that any efficient method for computing $d e$ from $(n, e)$ is equivalent to factoring $n$ .

Security of Basic RSA encryption: A basic notion of security is that it should be computationally infeasible to compute $m$ from $c$ . This is known as the RSA problem.

RSA Problem (RSAP): Given an RSA public key $(n, e)$ and $c = m^{e} mod n$ (where $m \in_{R} [0, n - 1]$ ), compute $m$ .

The only effective method known for solving RSAP is to factor $n$ . Henceforth, we shall assume that RSAP is intractable.

Dictionary attack:

Suppose that the plaintext $m$ is chosen from a relatively small (and known) set of $M$ of messages. Then, given a target ciphertext $c$ , the adversary can encrypt each $m \in M$ until $c$ is obtained.

Countermeasure: Append a randomly selected 128-bit string (called salt) to $m$ prior to encryption note that $m$ is now encrypted to one of $2^{128}$ possible ciphertexts, so a dictionary attack is infeasible.

Ciphertext Attack

Suppose the adversary $E$ has a target ciphertext $c$ that was encrypted for $A$ . Suppose also that $E$ can induce $A$ to decrypt any ciphertext for $E$ , except for $c$ itself. Then $E$ can decrypt as follows:

Select arbitrary $x \in [2, n - 1]$ with $g c d (x, n) = 1$

Compute $\overset{c}{^} = c x^{e} mod n$ , where $(n, e)$ is $A$ ‘s public key

Obtain the decryption $\overset{m}{^} \equiv \overset{c}{^}^{d} \equiv (c x^{e})^{d} \equiv c^{d} x^{e d} \equiv m x (mod n)$

Compute $m = \overset{m}{^} x^{- 1} mod n$

Countermeasure: Add some prescribed formatting to $m$ prior to encryption. After decrypting the ciphertext $c$ , if the plaintext is not properly formatted, then $A$ rejects $c$ .

Summary: RSA encryption should incorporate salting and formatting.

Definition

A public-key encryption scheme is secure if it semantically secure against chosen-ciphertext attack by a computationally bounded adversary.

To break a public-key encryption scheme, the adversary $E$ has to accomplish the following:

$E$ is given the public key and a challenge ciphertext $c$ .
$E$ has a decryption oracle, to which she can present any ciphertexts for decryption except for $c$ itself.
After a feasible amount of computation, $E$ should learn something about the plaintext $m$ that corresponds to $c$ .

A key encapsulation mechanism (KEM) allows two parties to establish a shared secret key, called a session key. A KEM is comprised of three algorithms:

Key generation: Each user (Alice) uses this algorithm to generate an encapsulation key $e k$ (public key) and a decapsulation key $d k$ (the private key).

$A$ ‘s public encapsulation key is $e k = (n, e)$ . $A$ ’s private decapsulation key is $d k = d$ .

Encapsulation: Bob uses Alice’s encapsulation key $e k$ to generate a secret key $k$ and a ciphertext $c$ , and sends $c$ to Alice

To select and transport a session key $k$ for $A, B$ does the following:

Obtain an authenticated copy of $A$ ‘s encapsulation key $(n, e)$
Select $r \in_{R} [0, n - 1]$
Compute $c = r^{e} mod n$ and $k = KD F (r)$
Send $c$ to $A$

Decapsulation: Alice uses her decapsulation key $d k$ to recover $k$ from the ciphertext $c$ .

$A$ processes $c$ as follows:

Compute $r = c^{d} mod n$ and $k = KD F (r)$
The session key is $k$

RSA Signatures

Basic RSA signature scheme:

Key generation: Each entity $A$ does the following:

Randomly select two large distinct primes $p$ , $q$ of the same bitlength
Compute $n = pq$ and $ϕ = (p - 1) (q - 1)$
Select arbitrary $e, 1 < e < ϕ$ , such that $g cd (e, ϕ) = 1$
Compute $d, 1 < d < ϕ$ , such that $e d \equiv 1 (mod ϕ)$
$A$ ‘s public key is $(n, e)$ ; $A$ ‘s private key is $d$

Signature generation: To sign a message $m \in {0, 1}^{*}$ , $A$ does the following:

Compute $M = H (m)$ , where $H$ is a hash function
Compute $s = M^{d} mod n$ (so $s^{e} \equiv M^{e d} \equiv M (mod n)$ )
$A$ ‘s signature on $m$ is $s$

Signature verification: To verify $A$ ‘s signed message $(m, s)$ , $B$ does the following:

Obtain an authentic copy of $A$ ‘s public key $(n, e)$
Compute $M = H (m)$
Compute $M^{'} = s^{e} mod n$
Accepts $(m, s) ⟺ M = M^{'}$

Hardness of RSAP: We quire RSAP to be intractable, since otherwise $E$ could forge $A$ ‘s signature as follows.

Select arbitrary $m$
Compute $M = H (m)$
Solve $s^{e} \equiv M (mod n)$ for $s$
Then $s$ is $A$ ‘s signature on $m$

Goals of the adversary:

Total break: $E$ recovers $A$ ‘s private key, or a method for systematically forging $A$ ‘s signatures
Existential forgery: $E$ forges $A$ ‘s signature for a single message of $E$ ‘s choosing; $E$ might not have any control over the content or structure of this message

Attack model:

Key-only attack: The only information $E$ has is $A$ ‘s public key
Known-message attack: $E$ knows some message-signature pairs
Chosen-message attack: $E$ has access to a signing oracle which it can use to obtain $A$ ‘s signatures on some messages of its choosing.

Definition

A signature scheme is secure if it is existentially unforgeable by a computationally bounded adversary who launches a chosen-message attack.

Question

Is the basic RSA signature scheme secure?

No. If $H$ is SHA-256; Yes, if $H$ is “full domain”.

RSA-FDH (Full Domain Hash RSA): Same as the basic RSA signature scheme, except that the hash function is $H : {0, 1}^{*} \to [0, n - 1]$ where $n$ is the RSA modulus.

Theorem

If RSAP is intractable and $H$ is a random function, then RSA-FDH is a secure signature scheme.

PKCS 1 v1.5 RSA Signatures

Public Key Cryptographic Standards (PKCS)

Signature generation: To sign $m \in {0, 1}^{*}$ , Alice does:

Compute $h = H (m)$ , where $H$ is a hash function from an approved list
Format $h$ , where $k =$ byte length of $n$
Compute $s = M^{d} mod n$
Send $(m, s)$

Signature verification: Bob does:

Obtain an authentic copy of Alice’s public key $(n, e)$
Compute $M = s^{e} mod n$ , and write $M$ as a byte string of length $k$
Check the formatting
From the next 15 bytes, get the hash name; say $H =$ SHA-1
Let $h =$ next 20 bytes
Compute $h^{'} = H (m)$
Accept $⟺ h = h^{'}$

Assumptions:

The encryption exponent is $e = 3$ : this is commonly used in practice.
The hash function is $H =$ SHA-1: this is without loss of generality.
The RSA modulus $n$ has bitlength 3072 (384 bytes): this is without much loss of generality.
The verifier doesn’t check that there are no leftover bytes to the right of $h$

Bleichenbacher's attack

Select arbitrary $m \in {0, 1}^{*}$

Compute $h = H (m)$

Let $D$ be the following 288-bit integer:

Let $N = 2^{288} - D$

Check that $3∣ N$ ; if $3 ∤ N$ , then modify $m$ slightly and go to step 2

Let $s = 2^{2019} - 2^{34} N /3$

Output $(m, s)$

Elliptic Curve Cryptography

Elliptic Curves

Example

Elliptic curves over $R$ :

$E / R : Y^{2} = X^{3} - X$

$E / R : Y^{2} = X^{3} - X + 1$

Consider the elliptic curve $E / Z_{11} : Y^{2} = X^{3} + X + 6$ .

The set of $Z_{11}$ -rotational points on $E$ is: $E (Z_{11}) = {\infty, (2, 4), (2, 7), \dots, (10, 2), (10, 9)}$ . $# E (Z_{11}) = 13$ (the cardinality).

Definition

An elliptic curve $E$ over $F$ is defined by a Weierstrass equation $E / F : Y^{2} = X^{3} + a X + b$ where $a, b \in F$ with $4 a^{3} + 27 b^{2} \neq = 0$ .

Definition

The set of $F$ -rational points on $E$ is $E (F) = {(x, y) \in F \times F : y^{2} = x^{3} + a x + b} \cup {\infty}$ where $\infty$ is a special point at infinity.

Let $E / p : Y^{2} = X^{3} + a X + b$ be an elliptic curve.

Then $# E (Z_{p})$ is finite. Easy to see that $1 \leq # E (Z_{p}) \leq 2 p + 1$

Theorem

Let $E$ be an elliptic curve defined over $Z_{p}$ . Then $(p - 1)^{2} \leq # E (Z_{p}) \leq (p + 1)^{2}$

There is an efficient algorithm for determining $# E (Z_{p})$ .

There is a natural way to add two points in $E (F)$ to get a third point in $E (F)$ .

Let $E$ be an elliptic curve defined over $R$ . $\infty$ is an imaginary point through which every vertical line passes.

The geometric rule for point addition is:

Let $P, Q \in E (R)$
Let $ℓ$ denote the straight line through $P$ and $Q$
Let $T \in E (R)$ be the third point of intersection of $ℓ$ with the elliptic curve.
Then $P + Q$ is the reflection of $T$ in the $X$ -axis

Addition rules: Let $E$ be an elliptic curve defined over $F$ .

A1: $P + \infty = \infty + P = P$ for all $P \in E (F)$

A2: If $P = (x, y) \in E (F)$ , then $- P = (x, - y)$ ; also $- \infty = \infty$ . Furthermore, $P + (- P) = (- P) + P = \infty$ for all $P \in E (F)$

A3: Let $P = (x_{1}, y_{1})$ , $Q = (x_{2}, y_{2}) \in E (F)$ , with $P \neq = \pm Q$ . Then $P + Q = (x_{3}, y_{3})$ where $x_{3} = λ^{2} - x_{1} - x_{2}$ , $y_{3} = - y_{1} + λ (x_{1} - x_{3})$ , and $λ = \frac{y _{2} - y _{1}}{x _{2} - x _{1}}$

A4: Let $P = (x_{1}, y_{1}) \in E (F)$ , with $P \neq = - P$ . Then $P + P = (x_{3}, y_{3})$ where $x_{3} = λ^{2} - 2 x_{1}$ , $y_{3} = - y_{1} + λ (x_{1} - x_{3})$ , and $λ = \frac{3 x _{1}^{2} + a}{2 y _{1}}$

Fact

$(E (F), +)$ is an abelian group

In other words, the addition rule satisfies the following properties:

P1: $P + \infty = P$ for all $P \in E (F)$
P2: For each $P \in E (F)$ , there exists $Q \in E (F)$ such that $P + Q = \infty$
P3: $P + Q = Q + P$ for all $P, Q \in E (F)$
P4: $(P + Q) + R = P + (Q + R)$ for all $P, Q, R \in E (F)$

Elliptic Curve Discrete Logarithm Problem

Definition

Let $P \in E (Z_{p})$ and let $k \in N$ . Then $k P = k P + P + \dots + P$ . Also $0 P = \infty$ , and $(- k) P = - (k P)$ , the operation $k P$ is called point multiplication.

Theorem

Suppose $n = # E (Z_{p})$ is prime, and let $P \in E (Z_{p})$ with $P \neq = \infty$ . Then:

$n P = \infty$

The points $\infty, P, 2 P, 3 P, \dots, (n - 1) P$ are distinct, and so $E (Z_{p}) = {\infty, P, 2 P, 3 P, \dots (n - 1) P}$

$P$ is called a generator of $E (Z_{p})$ .

Note: $k P = (k mod n) P$ for all $k \in Z$ .

Definition

Elliptic Curve Discrete Logarithm problem (ECDLP): Given $E, p, n, P \in E (Z_{p})$ (with $P \neq = \infty$ ) and $Q \in_{R} E (Z_{p})$ , find the integer $ℓ \in [0, n - 1]$ such that $Q = ℓ P$

Definition

The integer $ℓ$ is called the discrete logarithm of $Q$ to the base $P$ , written $ℓ = lo g_{P} Q$

CO487

Chapter 1: Introduction to Cryptography

Chapter 2: Symmetric-Key Encryption

Fundamental Concepts

Stream Ciphers

Block Ciphers: DES

Block Ciphers: Triple-DES

Block Ciphers: AES

Modes of Operation

Chapter 3: Hash Functions

Fundamental Concepts

Relationships between PR, 2PR, CP

Generic Attacks

Iterated Hash Functions

SHA-256

Chapter 4: Message Authentication Codes

Fundamental Concepts

CBC-MAC and HMAC

Chapter 5: Authenticated Encryption

Fundamental Concepts

AES-GCM

Chapter 6: Public-Key Cryptography

Fundamental Concepts

Elementary Number Theory

Algorithmic Number Theory

Chapter 7: RSA

Basic RSA

Integer Factorization

RSA Encryption

RSA Signatures

PKCS 1 v1.5 RSA Signatures

Elliptic Curve Cryptography

Elliptic Curves

Elliptic Curve Discrete Logarithm Problem

Table of Contents

Backlinks