Skip to content

Latest commit

 

History

History
98 lines (68 loc) · 5.79 KB

cs-ruby-quicksort.md

File metadata and controls

98 lines (68 loc) · 5.79 KB

Quick Sort

Another very popular fast sorting algorithm, Quicksort has an interesting way of choosing an arbitrary element, called the pivot, and then recursively grouping values less than the pivot to the left and values greater to its right. There are a few rules to this algorithm that make it a little challenging to understand but we will lay them all out very clearly.

Let's start by looking at the key players in this sort:

  • left: A marker used for finding numbers lower than the pivot. It moves one element at a time from the leftmost position until it finds a value greater than or equal to the pivot value. If it ever reaches the rightmost edge of the array, it stops.
  • right: A marker used for finding numbers higher than the pivot. It moves one element at a time from the rightmost position until it finds a value less than the pivot. If it ever reaches the left marker, it stops.
  • pivot: An initially randomly chosen element in the array. It's value is meant to be the point around which things swap: higher numbers go to the right while lower numbers go to the left. Basically, when the right and left find their values that are lower or higher than the pivot, respectively, those values are swapped. Though it can be chosen randomly, many basic implementations simply use the leftmost or rightmost element as the initial pivot.

Example

[8, 5, 2, 7, 1, 9, 3, 6, 4]

If we choose the last element as the pivot by default (in this case, the number 4), we need to partition the array like so:

  left    pivot      right
[2, 1, 3], [4], [8, 5, 7, 9, 6]

In this case, the pivot (number 4) is considered sorted, and then we call quicksort on the remaining two partitions.

left
[2, 1, 3] <= choose 3 as the pivot
[1], [2], [3] <= partitioned

Now, the left partition pivot (number 3) is considered sorted. Since the left and right partitions are single-element arrays, they're sorted as well.

  left    pivot       right
[1, 2, 3], [4], [8, 5, 7, 9, 6]
Right partition
[8, 5, 7, 9, 6] <= choose 6 as the pivot
[5, 6, 7, 9, 8] <= partitioned
       |                <= now we need to partition again (this is the recursive step)
       V
left2       pivot2  right2
[5],         [6],    [7, 9, 8]

Now, the right partition pivot (number 6) is considered sorted. Since left2 is a single-element array, it's considered sorted as well. Now for right2.

left2 partition
[7, 9, 8] <= choose 8 as the pivot
[7, 8, 9] <= partitioned

Now left2 is sorted. Final right partition:
[5, 6, 7, 8, 9]

Combining the left partitions and right partitions give us:
[1, 2, 3, 4, 5, 6, 7, 8, 9]

Another good explanation (with visuals)

Algorithm Analysis

Quicksort is another O(n log(n)) algorithm similar to Merge Sort. Normally, it outperforms similarly efficient sorts like merge sort and heapsort but there are rare cases when it can perform as poorly as a quadratic like bubble sort. Usually this has to do with the data being in the exactly perfect wrong order which causes the algorithm to work to its maximum or beyond. Interestingly, the exact wrong order for Quicksort that causes it to degrade to O(n2) is an array that is already sorted!

Characteristics

  • Comparison Sort - compares values and swaps them
  • In-place - operates directly on the array argument
  • Unstable - does not preserve original ordering of ties

The Algorithm

Quicksort is a two-parter like merge sort. We have the actual quicksort function that someone would call to sort an array that they pass in as an argument. It operates a little like binarySearch: The first parameter is the array to be sorted, and the second and third parameters are where to start and where to stop for the sub-portion of the array on which this recursive step is operating. On the first invocation of quicksort you can use default values for these. The "left" marker will always start at 0 and the "right" should be set to the array's length - 1 if it isn't specified (if it is set to null).

  1. First, we make sure that the "left" or "low" index is less than the "right" or "high" index. If that is true, we call a partition function that will return a new pivot index.
  2. Then we call quicksort on the left half of the array (from "left" to "pivot") and also call quicksort on the right half (from "pivot + 1" to "right").
  3. The quicksort function returns whenever the "left" index ends up greater than or equal to the "right" index. It doesn't need to return any value because the array has been sorted in place.

The partition function does the heavy lifting. This partition scheme is called Hoare's Partition Scheme and he is the person who invented quicksort. It takes the same parameters as quicksort: the array, the left index and the right index.

  1. Get the value in the middle of the array and store this as the pivot value.
  2. Set i to be "left" and set j to be "right". (many uses of Hoare's partition use a do...while construct that is absent in python's standard library and will show i = left - 1 and j = right + 1... this will not work with this python implementation)
  3. Now inside of an infinite loop, do the following:
    1. Increment i by 1 while the value in the array at that index is less than the pivot value.
    2. Decrement j by 1 while the value in the array at that index is greater than the pivot value.
    3. If i is greater than or equal to j, return j as the new pivot. (This return is what will end the infinite loop.) Else, swap the values in the array at indices i and j.

Let's implement it!

Further Research Resources