Public-Key Cryptography
Public-Key cryptography works by using pairs of mathematically related keys generated by cryptographic algorithms. These cryptographic algorithms are based on difficult ( one-way ) mathematical problems, such as the discrete logarithm assumption. In other words, the security of many public-key cryptographic algorithms depends on mathematical and computational difficulty. A key pair consists of a private key and a public key; the private key should never be shared with anyone. Public-key algorithms support encryption and digital signatures.
Encryption provides confidentiality and works by using a recipients public key to encrypt a message. Once the message is received, the recipient will use his or her own private key to decrypt the message.
Digital signatures provide integrity and non-repudiation. Integrity refers to the assurance that the data is accurate and has not been changed. Non-repudiation is the inability for someone to deny the origin or source of data. An example use case would be for a service such as DocuSign, where authenticity of a signature is vital, and, once a contract is signed, it should not be possible for anyone to deny the authenticity of the signature. In order to provide a digital signature, the sender of a message will encrypt the hash of the message using their private key. The receiver of a message will use the senders public key to decrypt the hash, and compare the message hash computed by the sender with the hash computed locally.
Math Background
I will briefly introduce a few mathematical properties necessary for understanding Diffie-Hellman. This is by no means comprehensive, and you are encouraged to do additional reading on your own. You may be tempted to skip the math section; although it may not be the most exiting for some, it is necessary to understand Diffie-Hellman. Don't Skip
Division and Remainder
-
∀ a, b ∈ Z (set of intergers) , ∃ q, r ∈ Z such that a = b * q + r :
- a is the dividend
- b is the divisor
- q is the quotient
- r is the remainder
- ⌊ a/b ⌋ = q
- a ≡ r (mod b)
Greatest Common Divisor
- ∀ a, b ∈ Z, a | b iff the remainder of dividing b by a is zero.
- a | b should be read “
a
dividesb
” - iff = if and only if
- a | b should be read “
- The greatest common divisor of
a
andb
is the largest number that is a divisor of botha
andb
.- gcd(a, b) = d
- if gcd(a, b) = 1,
a
andb
are said to be coprime or relatively prime. - gcd(a, 0) = a ∀ a ≠ 0.
- ∀ a, b ∈ Z, ∃ s, t ∈ Z such that gcd(a, b) = a * s + b * t
GCD(a, b) can easily be coded using the euclidian algorithm as show below. Note
: do not confuse the Euclidian algorithm with the Extended Euclidian algorithm; I will introduce that in my next post that will cover RSA.
def gcd(a, b):
if (b == 0 && a != 0):
return a
return gcd(b, a % b)
Prime Numbers
- p is a prime number iff its set of divisors is {1, p}
- ∀ a ∈ Z, gcd(a, p) ∈ {1, p}
- ∀ a ≠ 0 ∈ Zp, gcd(a, p) = 1
- How many prime numbers are ≤ x ?
- π(x) = x / ln(x)
- Prime Factorization
- All integers can be expressed as a product of prime numbers
- E.g. 48 = 24
- Factorization is the process of finding that product
- Prime factorization is computationally Hard for large numbers
- Currently, there aren’t any known algorithms that can compute this in polynomial time.
- All integers can be expressed as a product of prime numbers
Multiplicative Group
-
To keep this post brief, I will provide minimal explanation of Group Mathematics. For further understanding, read more about Group Mathematics. If you have time, check out Galois Groups and Galois Fields as well. Understanding Galois finite field mathematics isn’t necessary for understanding Diffie-Hellman in particular, but it is important for understanding other types of cryptography, such as AES (Advanced Encryption Standard). It is, however, crucial that you understand Group Mathematics, because it plays an important role in Diffie-Hellman.
- Groups are sets with operations that possess the following properties:
- Closed
- Associative
- Identity Element
- Inverse element
- For the purposes of this blog, I will focus on a multiplicative group G. The multiplication operation provides includes the following group properties:
- Closed: ∀ a, b ∈ G, a * b ∈ G
- Associative: ∀ a, b, c &isin G, (a * b) * c = a * (b * c)
- Identity Element: ∃ e0 ∈ G, ∀ a ∈ G, e0 * a = a * e0 = a
- Inverse Element: ∀ a ∈ G, ∃ b ∈ G, a * b = b * a = e0
-
Group( Zn* )
- Zn* = {a ∈ Zn | gcd(a,b) = 1}
- Removes 0 because gcd(a, 0) = a ∀ a ≠ 0.
- If n is prime, only 0 is removed
- This means Zn* is the set of all integers ∈ Zn such that gcd(a, n) = 1. That is, a and n are coprime.
- Operation is multiplication (mod n)
- closed, associative, and commutative
- Identity element is (e0 = 1)
- Inverse Element
- ∀ a ∈ Zn*, gcd(a, n) = 1
Note
: not ∀ a ∈ Zn, rather ∀ a ∈ Zn*- gcd(a, n) = a * s + n * t
- 1 = a * s + n * t
- 1 = a * s + n * t (mod n)
- 1 ≡ a * s (mod n)
- a-1 ≡ s (mod n)
- Zn* = {a ∈ Zn | gcd(a,b) = 1}
- Subgroup
- Ga = {a1, a2, …, am-1, am} ⊆ G
- Ga is a subset of G and inherits all of the operations of G
- ∀ a ∈ G, |a| = |Ga|, |G>sub>a</sub>| divides |G|
- In set theory | G | is the cardinality or number of elements in a set. It is also referred to as “order of.”
- Z10* = {1, 3, 7, 9}
- |Z10*| = 4
- In Group Math, |a| is the smallest m ∈ N, such that am (mod n) = e0 identity element.
- Select an element from the set Z10* ! Let’s choose 3.
- You can see below that when m is 4, 34 = 1.
Remember (mod n)
- {31, 32, 33, 34} = {3, 9, 7, 1}
- You can see below that when m is 4, 34 = 1.
- Let’s also try 9
- {91, 92} = {9, 1}
- Notice how |3| produces the set {3, 9, 7, 1}, which includes all of the numbers in Z10*. However, |9| only includes {9, 1}. This is important because it means that 3 is a generator of Z10*.
- an element a ∈ Zn* is a generator of Zn* iff |a| = |G|
Diffie-Hellman
Okay! Now we’re ready to explore the Diffie-Hellman algorithm. Diffie-Hellman make heavy application of the math outlined above, so make sure you understand it before moving on.
Public-key cryptography is significantly slower than symmetric-key cryptography, and is typically used to bootstrap symmetric-key cryptography. Diffie-Hellman facilitates key agreement or key exchange.
The following outlines the Diffie-Hellman process for some user U and K
- Step 1: The first step is for users to agree on a prime numbers
p
andq
.- q is a prime number
- p is a large prime number, ideally 4096 bits; 2048-bit is acceptable
p must be a safe prime; that is, p = 2q + 1
- Find a prime number q, multiply it by 2, and add 1.
- g ∈ Zp*
- g is a
generator
(see above for explanation of a generator) that generates an orderq
subgroup of Zp*. - gq produces every element of Zp*
- g is a
- Why is it necessary to use a generator g ∈ Zp* and not any arbitrary member a ∈ Zp* ?
- Using a generator means all possible keys generated by g are reachable through the exponentiation of g, which is important for computing public keys. When searching for the smallest m such that gm = |Gg| = | Zp* |, we stop at the identity element to ensure that the entire subgroup is covered.
- There is likely a better explanation available online.
- Using a generator means all possible keys generated by g are reachable through the exponentiation of g, which is important for computing public keys. When searching for the smallest m such that gm = |Gg| = | Zp* |, we stop at the identity element to ensure that the entire subgroup is covered.
- Step 2: Select a random prime a that will be the private key
- a ∈ Zp*.
- Should not be small
- User K has the private key ak
- User U has the private key au
- Step 3: Compute public keys
- user K, the public key computed is K = gak (mod p)
- user U, the public key computed is U = gau (mod p)
-
Step 4: Send Public Keys
- Step 5: Compute shared secret key
Note:
a is the private key only known to the individual users U and K; au and ak, respectively.- User K, computes a shared secret key using the public key of user U.
shared_key
= Uak (mod p)
- User U, computes a shared secret key using the public key of user K.
shared_key
= Kau (mod p)
- Step 6: Both User K and U have a shared secret and turn it into a symmetric key
- An example would be SHA256(shared_key) and using some agreed upon length
l
.
- An example would be SHA256(shared_key) and using some agreed upon length
- Diffie-Hellman depends on the difficulty of the Discrete Logaraithm Problem (DLP), which is
- Given A = ga (mod p), it is computationally hard to calculate
a
knowing onlyA
,g
, andp
.
- Given A = ga (mod p), it is computationally hard to calculate
Additoinal Reading
- You may want to do some additional reading, here are a few topics to consider:
- Miller-Rabin Primality Test
- Eliptic Curves & Eliptic Curve Diffie-Hellman
- Modular Exponentiation
Very Important
- Algorithm using repeated squared simplifies programming modular exponentiation.
Next Post: The RSA Algorithm
In my next post, I will introduce the RSA algorithm, including the foundation mathematical concepts.