Another very popular fast sorting algorithm, Quicksort has an interesting way of choosing an arbitrary element, called the pivot
, and then recursively grouping values less than the pivot to the left and values greater to its right. There are a few rules to this algorithm that make it a little challenging to understand but we will lay them all out very clearly.
Let's start by looking at the key players in this sort:
- left: A marker used for finding numbers lower than the pivot. It moves one element at a time from the leftmost position until it finds a value greater than or equal to the pivot value. If it ever reaches the rightmost edge of the array, it stops.
- right: A marker used for finding numbers higher than the pivot. It moves one element at a time from the rightmost position until it finds a value less than the pivot. If it ever reaches the left marker, it stops.
- pivot: An initially randomly chosen element in the array. It's value is meant to be the point around which things swap: higher numbers go to the right while lower numbers go to the left. Basically, when the right and left find their values that are lower or higher than the pivot, respectively, those values are swapped. Though it can be chosen randomly, many basic implementations simply use the leftmost or rightmost element as the initial pivot.
Example
[8, 5, 2, 7, 1, 9, 3, 6, 4]
If we choose the last element as the pivot by default (in this case, the number 4), we need to partition the array like so:
left pivot right
[2, 1, 3], [4], [8, 5, 7, 9, 6]
In this case, the pivot (number 4) is considered sorted, and then we call quicksort on the remaining two partitions.
left
[2, 1, 3] <= choose 3 as the pivot
[1], [2], [3] <= partitioned
Now, the left partition pivot (number 3) is considered sorted. Since the left and right partitions are single-element arrays, they're sorted as well.
left pivot right
[1, 2, 3], [4], [8, 5, 7, 9, 6]
Right partition
[8, 5, 7, 9, 6] <= choose 6 as the pivot
[5, 6, 7, 9, 8] <= partitioned
| <= now we need to partition again (this is the recursive step)
V
left2 pivot2 right2
[5], [6], [7, 9, 8]
Now, the right partition pivot (number 6) is considered sorted. Since left2 is a single-element array, it's considered sorted as well. Now for right2.
left2 partition
[7, 9, 8] <= choose 8 as the pivot
[7, 8, 9] <= partitioned
Now left2 is sorted. Final right partition:
[5, 6, 7, 8, 9]
Combining the left partitions and right partitions give us:
[1, 2, 3, 4, 5, 6, 7, 8, 9]
Another good explanation (with visuals)
Quicksort is another O(n log(n)) algorithm similar to Merge Sort. Normally, it outperforms similarly efficient sorts like merge sort and heapsort but there are rare cases when it can perform as poorly as a quadratic like bubble sort. Usually this has to do with the data being in the exactly perfect wrong order which causes the algorithm to work to its maximum or beyond. Interestingly, the exact wrong order for Quicksort that causes it to degrade to O(n2) is an array that is already sorted!
- Comparison Sort - compares values and swaps them
- In-place - operates directly on the array argument
- Unstable - does not preserve original ordering of ties
Quicksort is a two-parter like merge sort. We have the actual quicksort
function that someone would call to sort an array that they pass in as an argument. It operates a little like binarySearch
: The first parameter is the array to be sorted, and the second and third parameters are where to start and where to stop for the sub-portion of the array on which this recursive step is operating. On the first invocation of quicksort
you can use default values for these. The "left" marker will always start at 0 and the "right" should be set to the array's length - 1 if it isn't specified (if it is set to null).
- First, we make sure that the "left" or "low" index is less than the "right" or "high" index. If that is true, we call a
partition
function that will return a new pivot index. - Then we call
quicksort
on the left half of the array (from "left" to "pivot") and also callquicksort
on the right half (from "pivot + 1" to "right"). - The
quicksort
function returns whenever the "left" index ends up greater than or equal to the "right" index. It doesn't need to return any value because the array has been sorted in place.
The partition
function does the heavy lifting. This partition scheme is called Hoare's Partition Scheme and he is the person who invented quicksort. It takes the same parameters as quicksort: the array, the left index and the right index.
- Get the value in the middle of the array and store this as the pivot value.
- Set
i
to be "left" and setj
to be "right". (many uses of Hoare's partition use ado...while
construct that is absent in python's standard library and will showi = left - 1
andj = right + 1
... this will not work with this python implementation) - Now inside of an infinite loop, do the following:
- Increment
i
by 1 while the value in the array at that index is less than the pivot value. - Decrement
j
by 1 while the value in the array at that index is greater than the pivot value. - If
i
is greater than or equal toj
, returnj
as the new pivot. (This return is what will end the infinite loop.) Else, swap the values in the array at indicesi
andj
.
- Increment