Caesar cipher works by shifting the alphabet by a given number of positions to the left, wrapping around at the end.
The key for the cipher is how many positions the letters are shifted by.
For instance, if the key is 3 then a
is replaced by d
, b
by e
, etc. Here is the shift of the entire alphabet:
abcdefghijklmnopqrstuvwxyz
defghijklmnopqrstuvwxyzabc
In order to write a function shift
that shifts a letter
by a given number, we need to:
- Convert the letter to an integer number using
to-int
function. - Add n to it.
- Take the result modulo 26 (that would allow us to wrap around). For instance, if the letter is
x
(position 23 in the alphabet, wherea
is 0), and the shift is 3, the result would be 23 + 3 = 26. The largest letter isz
, at the position 25, so 26 should result in 0. Taking the result modulo 26 accomplishes this task. - After computing the position, we need to convert it back to a character by applying the
to-char
function that you wrote earlier.
The function that performs modulo arithmetic is mod
. Here are a few examples of how it works:
(mod 7 26) ; result: 7
(mod 27 26) ; result: 1
(mod 55 26) ; result: 3
(mod -5 26) ; result: 21
Exercise: Write a function shift
according to the description above. Some examples for how it should work:
(shift \a 3) ; result \d
(shift \b 20) ; result \v
(shift \z 3) ; result \c
Note: if you shift by a negative number, you are performing a reverse operation. For instance, (shift \d -3)
gives you \a
. Thus decryption is just using the same function, but with the opposite (negative) key.
Now you can "encrypt" a letter, but you probably want to encrypt words. If you were writing a program in python or Java, you probably would be thinking of writing a loop. However, in Clojure we use higher-order functions that traverse sequences for us, and we just need to specify what operation we would like to perform on each element.
map
and mapv
are such higher-order functions. They take in a sequence of elements and a function, and return a sequence that results from applying the given function to each element.
This sounds very abstract, so let's look at an example with mapv
. We use a function inc
(increment) that takes an integer and returns the next integer, i.e. (inc 1)
returns 2. now we are going to increment each element of a sequence of numbers
using mapv
:
(mapv inc [1 3 2]) ; returns [2 4 3]
Here [1 3 2]
and [2 4 3]
are vectors of numbers. This is the easiest way of giving Clojure a collection of elements in a specific order. What you get back is a vector in which each element of the given vector is incremented by 1.
The difference between map
and mapv
is that they return the result in a slightly different way: map
returns a sequence in its most general form, and mapv
returns its result as a particular sequential collection known as a vector. Vectors are slightly easier to work with for our examples, so we are using mapv
.
Exercise: What do you expect when you type in Clojure REPL?
(mapv to-int [\a \b \c])
Try it, see if the result is what you were expecting. If it's not, make sure to understand what it is and why.
Note that you can also apply mapv
to a string:
(mapv to-int "abc")
The result is a vector of numbers.
Exercise: Copy the following definition of the function
square
into the definitions panel of Nightcode (right upper panel):
(defn square
"Takes a number and returns its square"
[x]
(* x x))
Reload the file. Now in the REPL panel type in an expression, using mapv
, that computes the squares of numbers [1 3 -2]
.
Maps are often used together with anonymous functions. These
are one-time-use functions that are put together "on the fly" and not given a name. they also don't given names to their parameters, referring to them as %1, %2, %3
- or just %
if there is only one.
They are often used with higher-order functions, such as mapv
. Here is an example:
(mapv #(* % %) [1 3 -2])
This returns [1 9 4]
(the vector of squares of all given numbers, just like in the exercise above).
The anonymous function passed to the map
is #(* % %)
. It is equivalent to the square
function above. The %
sign here refers to the parameter of the function, it is used instead of x
. The #
in front of the expression indicates that this is a function.
Exercise: Use mapv
and an anonymous function to take the opposite of each number in a given vector. For instance, if the vector is [2 -1 0 3]
, the result would be [-2 1 0 -3]
.
mapv
returns its result as a vector, but it would be really useful to get it as a string. The conversion is non-obvious, and you can skip the explanations of how it works. Here is the code for it:
(apply str [\w \o \r \d]) ; results in a string "word"
Explanations of apply
(you can skip this):
Here is the clojuredocs description of apply
The function takes a vector and passes individual elements of it to a function, as if they were written separately, and not in a vector. For instance, (+ [1 2 3])
is an error since [1 2 3]
is a vector, and +
doesn't work on vectors. What we want is (+ 1 2 3)
, that's a valid summation and results in 6
. Using
(apply + [1 2 3])
works exactly like this: it passes the three arguments to +
individually, and not as a vector.
Now we are done with the nitty-gritty details for our ciphers, and are ready to do some encryption and decryption.
Now we can encrypt words with Caesar cipher. Let's say we want to encrypt the word "apple" by shifting the alphabet by 20. We need to do the following steps:
- use
mapv
to shift each letter in the sequence by 20 positions; we can write the actual shifting as an anonymous function that uses the functionshift
that we wrote earlier. - use
apply str
to convert the result from a sequence to a string.
Feel free to write this out on paper or in Nightcode before you look at the solution below.
(def s (mapv #(shift % 20) s)) ; encrypt the sequence
(def result (apply str s)) ; convert to a string
The result, "ujjfy"
, is what the encryption of "apple" with the key 20
.
Instead of saving intermediate results in variables, you can also write all the steps in one line of code:
(apply str (mapv #(shift % 20) "apple"))
The latter style is more common in Clojure.
Of course, we want to encrypt different words, not just "apple", and use keys other than 20
. Thus, we want to write
a function that takes a word and a number k
, and shifts the word by k
. Here k
serves as a key for the cipher.
Exercise Below is the start of a function that encrypts
a word w
with a key k
. Fill in the body of the function and test it on some examples.
(defn caesar-encrypt
"encrypting a word w with a key k using Caesar cipher"
[w k]
)
Don't forget to write all functions in the right upper panel of Nightcode, save and reload file, and test the function in the REPL.
Make sure that (caesar-encrypt "apple" 20)
returns the same result as the expression that you wrote
earlier, and that passing different words (all lower-case letters, no spaces) and different keys gives you different encryptions.
Encryption is good only if we can later decrypt the text.
Exercise Based on the function caesar-encrypt
, write a
function caesar-decrypt
that takes an encrypted word (all lower-case, no spaces or other symbols) and a key and returns
its decryption. Recall how we can use the same shift
function for decryption.
Test that (caesar-decrypt "ujjfy" 20)
returns "apple"
.
Then try your decryption on the following:
(caesar-decrypt "gtxyts" 5)
(caesar-decrypt "mvytebolbsnqo" 10)
Exercise Encrypt your own examples and post them on slack (with the key), then try to decrypt other participants' examples posted there. Before you post your own, make sure they decrypt correctly.
Encryption is not particularly helpful if it preserves capitalization, punctuation, spaces between the words, and similar things that reveal a lot about the text. Thus, in order to encrypt text we will remove all the symbols other than letters and will convert all letters to lowercase.
Relevant Java functions: Java toLowerCase string method (note that Java functions are commonly refer to as methods).
One of the advantages of using Clojure is that one can use all available Java functions and libraries directly from Clojure. For instance, we can use Java toLowerCase
method of the String
class to convert a string into all lowercase letters.
The syntax for using a Java method on an object is to put the method name into the prefix position (just like any Clojure function) and to precede its name with a dot:
(.toLowerCase "What is Clojure?") ; results in "what is clojure?"
Feel free to play with other Java methods for strings: Java 8 String methods.
Relevant functions on clojuredocs: filter, filterv, odd?
Relevant Java functions: isLetter
Now we are going to use another Clojure higher-level function, filterv
, to remove all the non-letter character from a string. It takes a function that returns a true/false value and a vector, and returns a new vector with only those elements of the given one for which the function returned a true value.
For example, we can use a function odd?
that works as follows: odd? 5
returns true
, odd? 4
returns false
.
If we want to keep only odd integers from a given sequence,
we can use filterv
with odd?
:
filterv odd? [6 7 -1 0 5]) ; results in [7 -1 5]
Note that filterv
is a vector analog of a more common (but less convenient in our case) function filter
, just like mapv
is a vector analog of map
.
Just like mapv
, filterv
can also take an anonymous function:
(filterv #(< % 5) [3 6 5 8 0]) ; results in [3 0]
The anonymous function #(< % 5)
returns true if its argument is strictly less than 5
and false otherwise.
We will be using a Java method of the Character class isLetter
to check if a character is a letter. There is a slight difference in how this method is defined in Java: it's method not attached to any object, just to the Character class (it's a so-called static method), and so the syntax for calling it is a bit different:
(Character/isLetter \a) ; true
(Character/isLetter \?) ; false
Exercise: write a function get-letters
that takes a string with any symbols in it, and returns a string of of
only letters in it, all letters converted to lowercase, as in the example below:
(get-letters "Hello, friend!") ; "hellofriend"
The sequences of steps that the function needs to perform is:
- Convert the string to lowercase letters using
toLowerCase
(note: this function works on a string) - filter out non-letter characters using
filterv
- Convert the result back to a string using
apply str
.
You might want to first try out the steps in REPL, and then put it all together in a function.
Now you are ready to do encryption and decryption with Caesar cipher on entire strings of text. The result would be all lowercase with no punctuation marks, but still readable.
The sequence of steps for encryption would require you to:
- Use
get-letters
to get a string only letters (in lower case) from the text that you are trying to encrypt. - Encrypt this string using your
caesar-encrypt
function.
As a test example, "Hello, friend!"
with the key 5 encrypts to "mjqqtkwnjsi"
.
Decryption doesn't require filtering out other symbols and converting to lowercase since encrypted strings are already of the right format, so you can use your caesar-decrypt
function.
Try decrypting the following:
TO-DO: add
Encryption and decryption is easy to do if you know the key (the amount of alphabet shift). But what do you do if you don't know it? The next section shows you how you can break Caesar cipher without a key using Clojure hashmaps.
Next: Breaking Caesar cipher: hashmaps
Previous: Clojure data types and functions