Concept#
Introduction#
Binary Search is an exceptionally efficient search algorithm used ubiquitously in computer science. At its heart, binary search is a divide-and-conquer strategy that halves the search space with each iteration, providing a search efficiency that is logarithmic in the size of the dataset.
In binary search, the dataset must be in a sorted format, as the algorithm operates by repeatedly dividing the search interval in half. It starts with an interval covering the whole array, and if the value of the search key is less than the middle element of the interval, it narrows the interval to the lower half. Otherwise, it narrows it to the upper half. Repeatedly, this process continues until the value is found or the interval is empty.
Binary search can be used not only to find whether a specific value exists in a sorted array but also to find the position of an insert for a new value to keep the array sorted. Because of this and its impressive efficiency, it underpins many fundamental computer science algorithms and data structures, such as binary search trees and B-trees.
Applications of binary search in computer science are many and varied:
Search Algorithms: Binary search is one of the fundamental search algorithms, applied when dealing with sorted lists or arrays to find the existence and position of a particular value.
Database and File-System Indexing: Many database indexing techniques use structures like B-trees and binary search trees, which at their core use a binary search approach for efficiently locating records.
Numerical Computation: Binary search is used in numeric computations for tasks like root-finding or for implementing numerical optimization algorithms.
Version Control Systems: In software version control systems, binary search is used in “bisecting”, which is a method to find which commit introduced a particular bug.
Memory Management: In memory management tasks, binary search is often used to search in address spaces and page tables in a fast and efficient way.
Intuition#
Assume we have a sorted list of numbers, denoted \(\mathcal{A}\), of size \(N\) and we want to find a particular number in the list, called the target \(\tau\).
Then, we know from Linear Search that we still have to traverse each element until we find the target \(\tau\), or reach the end of the list. This yields a time complexity of \(\mathcal{O}(N)\), with \(N\) being the size of the list.
However, if the list is sorted, we can leverage a strategy known as divide and conquer to improve our search speed significantly. This is where Binary Search comes into play.
The principle behind binary search is that we continually halve our search space, which we denote as \(\mathcal{S}\) (to be conceptualized later). We begin with the whole list, then determine whether our target value is in the first or the second half by comparing it with the middle element. If our target is less than the middle element, it must lie in the first half of the list, so we can immediately discard the second half. Conversely, if our target is larger than the middle element, it must reside in the second half, and we discard the first half.
This process continues until we either find our target or our search space \(\mathcal{S}\) is exhausted, i.e., there’s no element left to check. Each iteration effectively halves the problem size, resulting in a logarithmic time complexity of \(\mathcal{O}(\log N)\). This makes binary search markedly more efficient than linear search, particularly in large, sorted datasets.
A neat connection exists between binary search and the concept of a Binary Search Tree (BST). In a BST, each node has a value larger than all the values in its left subtree and smaller than all the values in its right subtree. This is akin to how binary search operates, dividing the list into two halves, with one half always less than and the other always greater than the middle. Indeed, if you perform an inorder traversal of a BST (left, root, right), you retrieve the elements in sorted order, just as you would have them arranged for a binary search. So the middle number of the sorted list is the “root” of the BST, and the left and right halves of the list are the left and right subtrees of the root. This is a great way to visualize the process of binary search and how it operates.
In what follows, we conceptualize the intuition via examples and visualizations, and lastly through defining the algorithm rigorously.
Example#
Consider the following sorted list of numbers \(\mathcal{A}\):
and we seek to find if the number \(23\) exists in the list.
Visualization#
We create a tabbed SVG viewer to visualize the binary search algorithm in the explanation below.
1# List of SVG image paths
2svg_images = [
3 "./assets/binary-search-0.svg",
4 "./assets/binary-search-1.svg",
5 "./assets/binary-search-2.svg",
6 "./assets/binary-search-3.svg",
7]
8
9tab_titles = ["Step 1", "Step 2", "Step 3", "Step 4"]
10
11tab = create_tabbed_svg_viewer(svg_images, tab_titles)
12display(tab)
Explanation#
We start with the entire list \(\mathcal{A}\), which is our search space \(\mathcal{S}\).
In particular, the initial search space can be solely determined by two pointers (indexes) \(\ell\) and \(r\), where \(\ell\) is the leftmost index and \(r\) is the rightmost index of the array. This means that we are searching for the target value \(\tau\) in the range \([\ell, r]\).
\(\ell = 0\) since indexing in python starts from 0,
\(r = N - 1 = 10 - 1 = 9\)
We compute the middle index \(m\) of the search space \(\mathcal{S}\), which is the index of the middle element of the array.
In this case, \(m = 4\) because our mid strategy uses the floor function:
\[\begin{split} \begin{aligned} m &= \left \lfloor \frac{0 + 9}{2} \right \rfloor \\ &= \left \lfloor \frac{9}{2} \right \rfloor \\ &= 4. \end{aligned} \end{split}\]Now one may ask why do we use the floor function and not the ceiling function? It really depends on how you update the boundary indexes \(\ell\) and \(r\). See a discussion here to understand in details.
At this junction we are at step 2 in the visualization above. The middle index is our pivot point that halves the search space \(\mathcal{S}\) into two subspaces, namely the left and right subspaces.
We now compare the middle element \(\mathcal{A}_m\) with the target value \(\tau\). We will have 3 cases:
\(\mathcal{A}_m = \tau\): We found the target value \(\tau\) at index \(m\) in the array \(\mathcal{A}\), so we return \(m\).
\(\mathcal{A}_m < \tau\): The target value \(\tau\) is greater than the middle element \(\mathcal{A}_m\), so we discard the left half of the array \(\mathcal{A}\) and search the right half.
\(\mathcal{A}_m > \tau\): The target value \(\tau\) is less than the middle element \(\mathcal{A}_m\), so we discard the right half of the array \(\mathcal{A}\) and search the left half.
And in our case, our value at index \(m=4\) is \(\mathcal{A}_m=16\) and \(\mathcal{A}_m < \tau = 23\), so we discard the left half of the array \(\mathcal{A}\) and search the right half. When we discard the left half, we want to also discard the middle element \(\mathcal{A}_m\) because we know that it is not the target value \(\tau\).
So we update the left index \(\ell\) to be
\[\ell \leftarrow m + 1 = 4 + 1 = 5.\]There is no need to update the right index \(r\) because we know that the target value \(\tau\) is in the right half of the array \(\mathcal{A}\), if it exists.
By just shifting the boundary index(es), we have also reduced our search space \(\mathcal{S}\).
In this case, our search space \(\mathcal{S}\) is now the right half of the array \(\mathcal{A}\), which is \([\ell, r] = [5, 9]\).
We now compute the middle index \(m\) of the search space \(\mathcal{S}\), which is the index of the middle element of the array.
In this case, \(m = 7\) because our mid strategy uses the floor function:
\[\begin{split} \begin{aligned} m &= \left \lfloor \frac{5 + 9}{2} \right \rfloor \\ &= \left \lfloor \frac{14}{2} \right \rfloor \\ &= 7. \end{aligned} \end{split}\]And in our case, our value at index \(m=7\) is \(\mathcal{A}_m=56\) and \(\mathcal{A}_m > \tau = 23\), so we discard the right half of the array \(\mathcal{A}\) and search the left half. When we discard the right half, we want to also discard the middle element \(\mathcal{A}_m\) because we know that it is not the target value \(\tau\).
So we update the right index \(r\) to be
\[r \leftarrow m - 1 = 7 - 1 = 6.\]There is no need to update the left index \(\ell\) because we know that the target value \(\tau\) is in the left half of the array \(\mathcal{A}\), if it exists.
We continue searching until we find the target value \(\tau\) or the search space \(\mathcal{S}\) is exhausted, i.e., there’s no element left to check.
In this case, our search space \(\mathcal{S}\) is now the left half of the array \(\mathcal{A}\), which is \([\ell, r] = [5, 6]\).
We now compute the middle index \(m\) of the search space \(\mathcal{S}\), which is the index of the middle element of the array.
In this case, \(m = 5\):
\[\begin{split} \begin{aligned} m &= \left \lfloor \frac{5 + 6}{2} \right \rfloor \\ &= \left \lfloor \frac{11}{2} \right \rfloor \\ &= 5. \end{aligned} \end{split}\]And in our case, our value at index \(m=5\) is \(\mathcal{A}_m=23\) and \(\mathcal{A}_m = \tau = 23\), so we return \(m=5\).
We found the target value \(\tau\) at index \(m\) in the array \(\mathcal{A}\), so we return \(m\).
Now, let’s visualize this. Here’s a table that shows the state of the variables at each step:
Step |
\(\ell\) |
\(r\) |
\(m\) |
\(\mathcal{A}_m\) |
Target \(\tau\) |
Action |
---|---|---|---|---|---|---|
1 |
0 |
9 |
4 |
16 |
23 |
\(\mathcal{A}_m < \tau\), so \(\ell = m + 1\) |
2 |
5 |
9 |
7 |
56 |
23 |
\(\mathcal{A}_m > \tau\), so \(r = m - 1\) |
3 |
5 |
6 |
5 |
23 |
23 |
\(\mathcal{A}_m = \tau\), so return \(m\) |
At the start, we’re looking at the full list and the search space \(\mathcal{S}\) is defined by the initial boundary indexes \(\ell = 0\) and \(r=9\). At each step, we adjust \(\ell\) or \(r\) to reduce the search space \(\mathcal{S}\) and close in on the target number based on whether it’s greater than or less than the middle element. Finally, when we find the target number, we return the index.
(What if we want to find 39?)
Something we did not touch on is the terminating condition. If we were to find a non-existing target \(\tau = 39\), we would eventually reach a point where \(\ell > r\). Let’s continue from the previous example.
Recall we found our index at \(\ell = 5\) and \(r = 6\) to host the target \(\tau=23\). Now assume our target is \(\tau = 39\). We would have the following:
Step |
\(\ell\) |
\(r\) |
\(m\) |
\(\mathcal{A}_m\) |
Target \(\tau\) |
Action |
Search Space \(\mathcal{S}\) |
---|---|---|---|---|---|---|---|
1 |
0 |
9 |
4 |
16 |
39 |
\(\mathcal{A}_m < \tau\), so \(\ell = m + 1\) |
\(\mathcal{S} = [0, 9]\) |
2 |
5 |
9 |
7 |
56 |
39 |
\(\mathcal{A}_m > \tau\), so \(r = m - 1\) |
\(\mathcal{S} = [5, 9]\) |
3 |
5 |
6 |
5 |
23 |
39 |
\(\mathcal{A}_m < \tau\), so \(\ell = m + 1\) |
\(\mathcal{S} = [5, 6]\) |
4 |
6 |
6 |
6 |
38 |
39 |
\(\mathcal{A}_m < \tau\), so \(\ell = m + 1\) |
\(\mathcal{S} = [6, 6]\) |
5 |
7 |
6 |
? |
? |
39 |
Terminating condition since \(\ell > r\) |
\(\mathcal{S} = [7, 6] = \emptyset\) |
We have seen that the seach space \(\mathcal{S}\) is reduced to \([6, 6]\) and we cannot reduce it further. At step 4, where the search space \(\mathcal{S}\) is reduce to \([6, 6]\), we have how many elements in the search space? Only 1! So if that element is not the target \(\tau\), then we know that the target \(\tau\) does not exist in the array \(\mathcal{A}\). This is the terminating condition.
Assumptions and Constraints#
Assumptions#
Sorted Array: We assume the input array is already sorted in ascending order, as binary search is not suitable for unsorted arrays.
Unique Elements: As mentioned, the array consists of unique elements. This implies that the return index for any valid target is unique.
Deterministic: The input array does not change during the course of the algorithm, and there are no external factors affecting its content.
Constraints#
The constaints/assumptions are made below, we follow the same set of assumptions from LeetCode’s Binary Search with some modifications (for clarity):
The array must be sorted in ascending order.
The array consists of unique elements of type
int
with base 10.Array Length: The length of the array
nums
will always be in the range1 <= len(nums) <= 10^4
.Array Content: Every element in the array, as well as the target value, is guaranteed to be in the range
-10^4 <= nums[i], target <= 10^4
.Return Value: The function should return the index of the
target
if it exists innums
and-1
if it doesn’t.
Why Left + Right // 2 May Cause Overflow?#
One of the constraints listed is that the array values are guaranteed to be in
the range -10^4 <= nums[i], target <= 10^4
. This means that the maximum value
of the array elements is \(10^4\). Why does it matter?
In programming, when two integers are added, and the result goes beyond the maximum value that can be stored in that integer type, it’s referred to as an integer overflow.
Consider a 32-bit signed integer in most languages. The largest value it can hold is \(2^{31} - 1\), because one bit is reserved for the sign. When you add two large numbers, their sum might exceed this value, causing an overflow.
Now, let’s explore the mid calculation:
If \(\ell\) and \(r\) are very large (close to the maximum value for the integer type), their sum can overflow, even though their average (mid value) might still be within the valid range.
Example: Let’s take a hypothetical situation where the maximum value an integer can store is \(15\) (in reality, this is much larger, but we’re simplifying for the sake of illustration).
Suppose \(\ell = 7\) and \(r = 14\). When you add these together, you get \(\ell + r = 21\), which exceeds our hypothetical maximum value of \(15\). Hence, an overflow occurs.
However, if we use the safer approach:
With our values, we’d get:
This approach avoids overflow because \(\ell\) and \(\left\lfloor \frac{r - l}{2} \right\rfloor\) will always be valid integers and their sum will be too.
In real-world applications, this is relevant for very large arrays when using
32-bit integers. Using 64-bit integers (like long long
in C++ or int64
in
some other languages) delays the point where overflow might occur, but the
principle remains. The alternate method for calculating mid is a safer approach
that avoids the potential overflow issue altogether.
However, for clarity, we will interchangeably use the original mid calculation in our explanation(s) below.
Test Cases#
To check the correctness of the algorithm, we will use the following test cases throughout this article to ensure that the algorithm is implemented correctly.
Standard Test Cases#
Standard Test Case
container = [1, 3, 5, 7, 9] target = 5 # Expected Output: 2 (since container[2] = 5)
Target at Start
container = [1, 3, 5, 7, 9] target = 1 # Expected Output: 0 (since container[0] = 1)
Target at End
container = [1, 3, 5, 7, 9] target = 9 # Expected Output: 4 (since container[4] = 9)
Target Not in List
container = [1, 3, 5, 7, 9] target = 4 # Expected Output: -1 (since 4 is not in the container)
Large Numbers in Array
container = [10000, 20000, 30000, 40000] target = 30000 # Expected Output: 2 (since container[2] = 30000)
Edge Cases#
Empty List
Although the problem constraints state the minimum size of the array is 1, it’s always a good idea to consider what would happen with an empty array.
container = [] target = 1 # Expected Output: -1 (since the array is empty)
Single Element Array
container = [3] target = 3 # Expected Output: 0 (since container[0] = 3)
Single Element Array, Target Not Present
container = [3] target = 4 # Expected Output: -1 (since 4 is not in the container)
Algorithm (Iterative + Exact Match)#
The algorithm discussed above is recognized as the most generic and precisely fitting version according to LeetCode. For more details, you can refer to their explanation under the “Template I” section of binary search, available at this link.
In what follows we formalize the algorithm in pseudocode and mathematical notation.
Pseudocode#
(Pseudocode)
Algorithm: binary_search_iterative(A, t)
Input: A = [a_0, a_1, ..., a_{N-1}] (sorted list of elements),
t (target value)
Output: Index of t in A or -1 if not found
1: Set l = 0 and r = N - 1 // (Initialization)
2: while l <= r (Iterative Procedure)
3: m = l + (r - l) // 2 // (Calculate mid-point)
4: if A[m] == t then
5: return m // (Target found)
6: elif A[m] < t then
7: l = m + 1 // (Discard left half)
8: else
9: r = m - 1 // (Discard right half)
10: end while
11: return -1 // (Target not found)
Mathematical Representation (Iterative)#
(Mathematical Representation (Iterative))
Given a sorted list \(\mathcal{A}\) of \(N\) elements \(\mathcal{A}_0, \mathcal{A}_1, \ldots, \mathcal{A}_{N-1}\), sorted such that \(\mathcal{A}_0 < \mathcal{A}_1 < \ldots < \mathcal{A}_{N-1}\), and a target value \(\tau\), the following subroutine uses binary search to find the index of \(\tau\) in \(\mathcal{A}\) using an iterative approach.
The search space \(\mathcal{S}\), initially the entire array \(\mathcal{A}\), is iteratively refined. At any iteration, \(\mathcal{S}\) can be defined as the subset of \(\mathcal{A}\) bounded by indices \(\ell\) (lower bound) and \(r\) (upper bound), formally we define \(\mathcal{S}\):
where \(\ell = 0\) and \(r = N - 1\) at the start of the algorithm.
Binary search progresses as follows:
Initialization:
Establish initial boundaries of the search space: Set \(\ell = 0\) (start of the array) and \(r = N - 1\) (end of the array). This initial setting implies \(\mathcal{S} = \mathcal{A}\), encompassing the entire array.
Iterative Procedure:
Execute a loop while \(\ell \leq r\), indicating that \(\mathcal{S}\) is non-empty:
Calculate the mid-point index \(m = \ell + \left \lfloor \frac{r - \ell}{2} \right \rfloor\). This divides \(\mathcal{S}\) into two halves.
If \(\mathcal{A}_m = \tau\), return \(m\). The target \(\tau\) is found at index \(m\) within \(\mathcal{S}\).
If \(\mathcal{A}_m < \tau\), redefine \(\ell = m + 1\). This action discards the left half of \(\mathcal{S}\), focusing the search on the right half. Consequently, \(\mathcal{S}\) is redefined as \(\mathcal{S} := \mathcal{A}[m + 1:r]\).
If \(\mathcal{A}_m > \tau\), redefine \(r = m - 1\). This discards the right half of \(\mathcal{S}\), focusing the search on the left half. Consequently, \(\mathcal{S}\) is redefined as \(\mathcal{S} := \mathcal{A}[\ell:m - 1]\).
In each iteration, \(\mathcal{S}\) is effectively halved, narrowing down the potential location of \(\tau\).
Termination:
The loop exits when \(\ell > r\), meaning \(\mathcal{S}\) becomes empty. At this point, if \(\tau\) has not been found, return \(-1\) to indicate that \(\tau\) is not present in \(\mathcal{A}\).
Correctness#
For the correctness proof of the iterative algorithm, we can trivially prove it via loop invariants. The loop invariant is a condition that is true before and after each iteration of a loop. One can refer to the section on Correctness from Linear Search for more details.
Implementation#
class IterativeBinarySearchExactMatch(Search):
def search(
self, container: Sequence[Real], target: Real
) -> Union[NonNegativeInt, Literal[-1]]:
"""Search for a target from a sorted array container."""
left_index = 0
right_index = len(container) - 1
while left_index <= right_index:
mid_index = self.mid_strategy(left=left_index, right=right_index)
if container[mid_index] == target:
return mid_index
elif container[mid_index] < target:
left_index = mid_index + 1
else:
right_index = mid_index - 1
return -1
def mid_strategy(
self, left: NonNegativeInt, right: NonNegativeInt
) -> NonNegativeInt:
"""Calculate the mid index of the search space."""
mid_index = left + math.floor((right - left) / 2)
return mid_index
Tests#
1strategy = IterativeBinarySearchExactMatch()
2context = SearchContext(strategy)
3result = context.execute_search([2, 5, 8, 12, 16, 23, 38, 56, 72, 91], 23)
4assert result == 5
1tf = TestFramework()
2
3binary_search = IterativeBinarySearchExactMatch().search
4
5@tf.describe("Testing Iterative Binary Search for Exact Match")
6def test_binary_search() -> None:
7
8 @tf.individual_test("Standard Test Case")
9 def _() -> None:
10 tf.assert_equals(
11 binary_search([1, 3, 5, 7, 9], 5),
12 2,
13 "Should return 2"
14 )
15
16 @tf.individual_test("Target at Start")
17 def _() -> None:
18 tf.assert_equals(
19 binary_search([1, 3, 5, 7, 9], 1),
20 0,
21 "Should return 0"
22 )
23
24 @tf.individual_test("Target at End")
25 def _() -> None:
26 tf.assert_equals(
27 binary_search([1, 3, 5, 7, 9], 9),
28 4,
29 "Should return 4"
30 )
31
32 @tf.individual_test("Target Not in List")
33 def _() -> None:
34 tf.assert_equals(
35 binary_search([1, 3, 5, 7, 9], 4),
36 -1,
37 "Should return -1"
38 )
39
40 @tf.individual_test("Large Numbers in Array")
41 def _() -> None:
42 tf.assert_equals(
43 binary_search([10000, 20000, 30000, 40000], 30000),
44 2,
45 "Should return 2"
46 )
47
48 @tf.individual_test("Empty List")
49 def _() -> None:
50 tf.assert_equals(
51 binary_search([], 1),
52 -1,
53 "Should return -1 (empty array)"
54 )
55
56 @tf.individual_test("Single Element Array")
57 def _() -> None:
58 tf.assert_equals(
59 binary_search([3], 3),
60 0,
61 "Should return 0"
62 )
63
64 @tf.individual_test("Single Element Array, Target Not Present")
65 def _() -> None:
66 tf.assert_equals(
67 binary_search([3], 4),
68 -1,
69 "Should return -1"
70 )
Time Complexity#
Recall the earlier sorted array \(\mathcal{A} = [2, 5, 8, 12, 16, 23, 38, 56, 72, 91]\)? We took only \(3\) iterations to find the target value \(23\) in the array. How did we get to this result? If we were to use a sequential search, it will take \(6\) iterations!
We will proceed to prove that the binary search algorithm takes \(\mathcal{O}(\log_2 N)\) time to search for an item in a list of \(N\) items, which means the maximum number of comparisons is in a logarithmic relationship to the number of items in the list [Runestone Interactive, 2023].
Formal Notation of Time Complexity
More formally, we aim to establish that the number of comparisons required by the binary search algorithm to find an item in a sorted list of \(N\) elements is bounded above by a function that grows logarithmically with respect to \(N\). Formally, we denote the time complexity function of binary search as \(\mathcal{T}_{\text{BS}}(N)\) and assert that \(\mathcal{T}_{\text{BS}}(N) \in \mathcal{O}(g(N))\), where \(g(N) = \log_2 N\). This inclusion in \(\mathcal{O}(g(N))\) is defined as:
In this context, it means there exist constants \(C > 0\) and \(N_0 \in \mathbb{N}\) such that for all \(N \geq N_0\), the number of comparisons made by the binary search algorithm, \(\mathcal{T}_{\text{BS}}(N)\), is bounded above by \(C \cdot \log_2 N\).
To analyze the binary search algorithm, we need to recall that each comparison eliminates about half of the remaining items from consideration. What is the maximum number of comparisons this algorithm will require to check the entire array \(\mathcal{A}\)?
Let’s say we have an array of \(N\) elements.
The first comparison eliminates about half of the remaining items from consideration. Thus, after the first comparison, we have about \(\frac{N}{2}\) elements left.
The second comparison eliminates about half of the remaining items from consideration. Thus, after the second comparison, we have about \(\frac{N}{2^2}\) elements left.
The third comparison eliminates about half of the remaining items from consideration. Thus, after the third comparison, we have about \(\frac{N}{2^3}\) elements left.
The \(k\)-th comparison eliminates about half of the remaining items from consideration. Thus, after the \(k\)-th comparison, we have about \(\frac{N}{2^k}\) elements left.
Note that we say approximately/about because the number of elements left after
the \(i\)-th comparison is not always “half”. Using back the same example
previously, if we have an array of \(10\) elements, and we want to find \(23\), then
after the first comparison, we discard the first half of the array,
[2, 5, 8, 12, 16]
, and we have [23, 38, 56, 72, 91]
left. This is indeed
\(\frac{N}{2} = \frac{10}{2} = 5\) elements left. However, after the second
comparison, we discard the second half of the array, [56, 72, 91]
, and we have
[23, 38]
left. This is now \(2\) elements left, which is not exactly half of the
remaining items.
Comparisons |
Approximate number of items left |
---|---|
\(i = 1\) |
\(\frac{N}{2}\) |
\(i = 2\) |
\(\frac{N}{2^2}\) |
\(i = 3\) |
\(\frac{N}{2^3}\) |
\(\ldots\) |
\(\ldots\) |
\(i = k\) |
\(\frac{N}{2^k}\) |
If we split the container/list enough times, eventually we will have only one item left [Runestone Interactive, 2023]. The last item is either the target value or it is not.
So our stopping condition is when the number of items left is \(1\). Consequently, we solve for \(k\) in the equation \(\frac{N}{2^k} = 1\):
This means that the maximum number of comparisons is \(\log_2 N\). In other words, after approximately \(\log_2 N\) comparisons, we can reduce the size of the list to \(1\) and since we will not be able to divide the list any further, we can conclude that the target value is either the last element or it is not in the list, ending the search.
The time complexity table is listed below, the best case is \(\mathcal{O}(1)\) for the same reason as the sequential search algorithm, where the target element \(\tau\) is in the middle, and we just need to make one comparison. For the worst case, the element is either in the first or last index, or it is not in the list at all. In this case, we need to make \(\log_2 N\) comparisons.
Space Complexity#
We break down the space complexity of binary search into three parts: input, auxiliary and total space complexity.
Input Space Complexity#
Input space complexity is the space used to store the input to the problem. For a binary search, the input is the array or list we are searching through, and a target value. We do not usually consider the space taken by the inputs when analyzing the space complexity of an algorithm, unless the algorithm modifies the input in place.
Regardless, for the sake of completeness, we say that the input space is the space taken by the inputs, which is parameterized by the length of the array \(\mathcal{A}\) and the target value \(\tau\). In this case, the input space complexity is
where \(N\) is the length of the array and \(\mathcal{O}(1)\) is the space taken by the target value.
Auxiliary Space Complexity#
Auxiliary space complexity is the extra space or the temporary space used by an
algorithm. In the case of binary search, we only need three variables to hold
the left, right and middle indices (l
, r
, m
). These variables occupy
constant space, so the auxiliary space complexity is in the order of
\(\mathcal{O}(1)\).
For the iterative approach, only one stack frame is used, contributing to the auxiliary space complexity of \(\mathcal{O}(1)\). There is no extra allocation in each iteration of the while loop in terms of variables or object instances.
Total Space Complexity#
Total space complexity is the sum of input and auxiliary space complexities. For binary search, if the input space complexity is not considered and the auxiliary space complexity is \(\mathcal{O}(1)\), the total space complexity of binary search is \(\mathcal{O}(1)\), else it is \(\mathcal{O}(N)\).
To summarize, the binary search algorithm is very space-efficient as it only requires constant auxiliary space to perform the search, and it does not modify the input array or list.
Algorithm (Recursive + Exact Match)#
We can also implement binary search recursively. The recursive version of binary search is very similar to the iterative version, except that it uses a recursive function to perform the search instead of a while loop.
Pseudocode#
(Pseudocode)
Algorithm: binary_search_recursive(A, t)
Input: A = [a_0, a_1, ..., a_{N-1}] (sorted list of elements),
t (target value)
Output: Index of t in A or -1 if not found
1: Set l = 0 and r = N - 1 // (Initialization)
2: return binary_search_recursive_helper(A, l, r, t)
Algorithm: binary_search_recursive_helper(A, l, r, t)
Input: A = [a_0, a_1, ..., a_{N-1}] (sorted list of elements),
l (left index),
r (right index),
t (target value)
Output: Index of t in A or -1 if not found
1: if l > r then
2: return -1
3: end if
4: m = l + (r - l) // 2 // (Calculate mid-point)
5: if A[m] == t then
6: return m // (Target found)
7: elif A[m] < t then
8: return binary_search_recursive_helper(A, m + 1, r, t)
9: else
10: return binary_search_recursive_helper(A, l, m - 1, t)
11: end if
Mathematical Representation (Recursive)#
The binary search algorithm is a divide-and-conquer algorithm that halves the search space at each step. It can be formally described using the notation for sequences, with \(\mathcal{A}\) representing the sequence (i.e., the array), \(N\) representing the length of the sequence, \(\ell\) and \(r\) representing the low and high indices of the search space, and \(m\) representing the midpoint.
Here’s a mathematical version for binary search:
(Mathematical Representation)
Define the recursive function \(f: (\mathcal{A}, \ell, r, \tau) \rightarrow k\) as follows:
where:
\(\mathcal{A}\) is a sequence of sorted elements (the array).
\(\ell\) and \(r\) are the leftmost and rightmost indices of the current search space, respectively.
\(m = \ell + \left\lfloor \frac{r - \ell}{2} \right\rfloor\) is the index of the middle element of the current search space, dynamically determined at each recursive call.
\(\mathcal{A}_m\) is the element at index \(m\) in the sequence \(\mathcal{A}\).
\(\tau\) is the target value.
In this recursive formulation, \(f\) operates on the subset of the search space \(\mathcal{S} := \mathcal{A}[\ell:r]\) where the target value \(\tau\) may exist. The function continually divides the problem size in half at each step, characterizing it as a divide-and-conquer algorithm.
In the recursive binary search algorithm, the function \(f(\mathcal{A}, \ell, r, \tau)\) operates based on the following logic:
Base Case for Unsuccessful Search: If \(\ell > r\), it indicates that the search space has become empty, and the target \(\tau\) is not present in \(\mathcal{A}\). In this case, the function returns \(-1\).
Successful Search: If \(\mathcal{A}_m = \tau\) (where \(m = \ell + \left\lfloor \frac{r - \ell}{2} \right\rfloor\) is the midpoint), the search is successful, and the function returns the index \(m\).
Recursive Calls for Subspace Search:
If \(\mathcal{A}_m > \tau\), the target must lie in the left half of the current search space. The function then recursively calls itself with updated parameters: \(f(\mathcal{A}, \ell, m - 1, \tau)\). This effectively narrows the search to the range from \(\ell\) to \(m-1\).
Conversely, if \(\mathcal{A}_m < \tau\), the function explores the right half by calling itself with \(f(\mathcal{A}, m + 1, r, \tau)\). This shifts the search space to between \(m+1\) and \(r\).
When a recursive function hits a base case and returns a value (like the index of the target in a binary search), this returned value is passed back up through the recursion stack. The key aspect here is that each level of the recursive call stack has its own execution context, including its own set of variables.
In the case of binary search:
When the target is found and the base case is reached (i.e., \(\mathcal{A}_m = \tau\)), the function returns the index
m
.This return value is passed back to the previous level of recursion.
At each level, if a returned value is received, the function can simply return this value again up to the next level, without any further processing. This effectively “unwinds” the recursion stack without altering the found value.
Here’s a simplified version of what happens:
Finding the Target: Suppose at some level of recursion, the target is found, and the function returns the index
m
.Unwinding the Stack: As the recursion unwinds, each recursive call returns this value
m
back up the stack.If the recursive function’s structure is like
return f(...)
, then the return valuem
is immediately passed back up to the next level.
Preventing Overwriting: Since each recursive call has its own execution context (its own local variables, parameters, etc.), the returned value does not get overwritten by other operations within those recursive calls. The value
m
simply bubbles up through the stack.Final Return: Eventually, the value reaches the initial call, and that’s the value finally returned by the recursive function.
This mechanism ensures that once the target is found and a value is returned, it effectively bypasses any further logic in the intermediate recursive calls. Each level of the stack just passes the return value up until it reaches the top level. This is a fundamental aspect of how return values are propagated in recursive functions.
We can use Python Tutor to visualize recursive calls here.
Correctness#
(Correctness of Binary Search Recursive Algorithm)
Theorem: Given a sorted array \(\mathcal{A}\) of \(N\) elements \(\mathcal{A}_0, \mathcal{A}_1, \ldots, \mathcal{A}_{N-1}\), the binary search algorithm correctly returns the index of a target value \(\tau\) if it is present in \(\mathcal{A}\), or returns -1 if \(\tau\) is not present.
The following proof follows the notes in CSE 241 Algorithm and Data Structure.
Proof. The proof uses strong mathematical induction, focusing on the invariant related to the search space \(\mathcal{S}\).
Invariant: At the beginning of each call to the binary search function, if the value \(\tau\) is in \(\mathcal{A}\), it lies within the search space \(\mathcal{S} := \mathcal{A}[\ell..r]\) where \(\ell\) and \(r\) are the current lower and upper bounds, respectively.
Base Case (Induction on the size of \(\mathcal{S}\), \(n = r - \ell + 1\)):
We first show the invariant holds for the base case, when the size of \(\mathcal{S}\) is \(n = 0\) or \(n = 1\).
When \(n = 0\) (i.e., \(\ell > r\)), \(\mathcal{S}\) is empty. The algorithm returns -1, which is correct as \(\tau\) is not present in an empty search space.
When \(n = 1\) (i.e., \(\ell = r\)), \(\mathcal{S}\) consists of a single element \(\mathcal{A}[\ell]\). If \(\mathcal{A}[\ell] = \tau\), the algorithm returns \(\ell\), which is correct. If \(\mathcal{A}[\ell] \neq \tau\), the algorithm returns -1, signifying \(\tau\) is not in \(\mathcal{S}\).
Inductive Step:
Assume for the sake of induction such that the invariant holds and the algorithm correctly returns the index of \(\tau\) or -1 for arrays of size up to \(k\) (i.e., for all \(\ell, r\) such that \(r - \ell + 1 \leq k\)).
We need to show that the algorithm also works correctly for an array of size \(k+1\).
In the algorithm, the midpoint \(m\) is calculated. Three scenarios arise:
If \(\mathcal{A}_m = \tau\), the algorithm returns \(m\), which is correct and maintains the invariant.
If \(\mathcal{A}_m < \tau\), the algorithm recurs on \(\mathcal{S} := \mathcal{A}[m+1..r]\). The size of this new search space is at most \(k\), and by the inductive hypothesis, the recursive call correctly maintains the invariant and returns the index of \(\tau\) or -1.
If \(\mathcal{A}_m > \tau\), it recurs on \(\mathcal{S} := \mathcal{A}[\ell..m-1]\), also of size at most \(k\). By the inductive hypothesis, this recursive call maintains the invariant and correctly handles the search.
Thus, for any array of size \(k+1\), the algorithm maintains the invariant and correctly reduces the problem to a smaller instance, thereby proving the correctness for arrays of size \(k+1\).
Conclusion:
By strong mathematical induction, the binary search algorithm correctly finds \(\tau\) in \(\mathcal{A}\) or determines its absence, preserving the invariant at each step. This proves the theorem.
Implementation#
class RecursiveBinarySearchExactMatch(Search):
"""Template 1 but recursive."""
def search(self, container: Sequence[Real], target: Real) -> int:
"""Search for a target from a sorted array container."""
def recursive(
l: NonNegativeInt, r: NonNegativeInt
) -> Union[NonNegativeInt, Literal[-1]]:
if l > r: # base case
return -1
mid_index = self.mid_strategy(l, r)
if container[mid_index] < target:
return recursive(l=mid_index + 1, r=r)
elif container[mid_index] > target:
return recursive(l=l, r=mid_index - 1)
else: # base case
return mid_index
l, r = 0, len(container) - 1
return recursive(l, r)
def mid_strategy(
self, left: NonNegativeInt, right: NonNegativeInt
) -> NonNegativeInt:
"""Strategy for calculating the middle index."""
mid_index = left + math.floor((right - left) / 2)
return mid_index
Tests#
1# Changing the strategy to RecursiveBinarySearchExactMatch
2context.strategy = RecursiveBinarySearchExactMatch()
3result = context.execute_search([2, 5, 8, 12, 16, 23, 38, 56, 72, 91], 23)
4assert result == 5
1tf = TestFramework()
2
3binary_search = RecursiveBinarySearchExactMatch().search
4
5@tf.describe("Testing Recursive Binary Search for Exact Match")
6def test_binary_search() -> None:
7
8 @tf.individual_test("Standard Test Case")
9 def _() -> None:
10 tf.assert_equals(
11 binary_search([1, 3, 5, 7, 9], 5),
12 2,
13 "Should return 2"
14 )
15
16 @tf.individual_test("Target at Start")
17 def _() -> None:
18 tf.assert_equals(
19 binary_search([1, 3, 5, 7, 9], 1),
20 0,
21 "Should return 0"
22 )
23
24 @tf.individual_test("Target at End")
25 def _() -> None:
26 tf.assert_equals(
27 binary_search([1, 3, 5, 7, 9], 9),
28 4,
29 "Should return 4"
30 )
31
32 @tf.individual_test("Target Not in List")
33 def _() -> None:
34 tf.assert_equals(
35 binary_search([1, 3, 5, 7, 9], 4),
36 -1,
37 "Should return -1"
38 )
39
40 @tf.individual_test("Large Numbers in Array")
41 def _() -> None:
42 tf.assert_equals(
43 binary_search([10000, 20000, 30000, 40000], 30000),
44 2,
45 "Should return 2"
46 )
47
48 @tf.individual_test("Empty List")
49 def _() -> None:
50 tf.assert_equals(
51 binary_search([], 1),
52 -1,
53 "Should return -1 (empty array)"
54 )
55
56 @tf.individual_test("Single Element Array")
57 def _() -> None:
58 tf.assert_equals(
59 binary_search([3], 3),
60 0,
61 "Should return 0"
62 )
63
64 @tf.individual_test("Single Element Array, Target Not Present")
65 def _() -> None:
66 tf.assert_equals(
67 binary_search([3], 4),
68 -1,
69 "Should return -1"
70 )
Time Complexity#
Master Theorem#
Let’s use the Master Theorem to analyze the time complexity of binary search.
We have a recurrence relation of the form (also defined in (233)) in Master Theorem:
Where:
\(N\) is the size of the input of the problem.
\(a\) is the number of subproblems in the recursion.
\(b\) is the factor by which the subproblem size is reduced in each recursive call (\(b > 1\)).
\(\frac{N}{b}\) is the size of each subproblem. (All subproblems are assumed to have the same size.)
\(f(N)\) represents the cost of the work done outside the recursive calls, which includes the cost of dividing the problem and the cost of merging the solutions.
For our binary search, we have the following information:
The size of the input is \(N\).
In each step, the problem is divided into two subproblems of size \(\frac{N}{2}\).
The immediate consequence is that \(a=1\) and \(b=2\). Why? Because we have only one subproblem, and the size of the subproblem is halved at each step. In particular \(a=1\) is because only one half is chosen for further search.
This means at each step, there exists only \(a=1\) recursive call.
The cost of the work done outside the recursive calls is \(\mathcal{O}(1)\) because we only need to calculate the middle index and make a decision based on the middle element. So \(f(N) = \mathcal{O}(1)\), which means \(f(N)\) is bounded by a constant \(C\).
With these defined, we can plug in the values into the recurrence relation:
The Master Theorem states that the solution to the recurrence relation (234) can be split into 3 cases, depending on the relationship between \(f(N)\) and \(N^{\log_b a}\). Let’s list them down briefly here.
First, define \(c_{\text{crit}}\) to be the critical exponent:
If \(f(N) = \mathcal{O}(N^c)\), where \(c < c_{\text{crit}}\), then \(\mathcal{T}(N) = \Theta(N^{c_{\text{crit}}})\).
If \(f(N) = \mathcal{O}(N^c)\), where \(c = c_{\text{crit}}\), then \(\mathcal{T}(N) = \Theta(N^{c_{\text{crit}}} \log N)\).
If \(f(N) = \mathcal{O}(N^c)\), where \(c > c_{\text{crit}}\), then \(\mathcal{T}(N) = \Theta(f(N))\).
Now what is \(c\)? \(c\) is \(0\) in our case because \(f(N) = \mathcal{O}(1) = \mathcal{O}(N^0)\). This means \(c = 0\). Now to know which case we’re in, we need to know \(c_{\text{crit}} = \log_b a = \log_2 1 = 0\).
So we’re in the second case of the Master Theorem since \(c = c_{\text{crit}}\).
According to the second case of the Master Theorem, the solution is \(\mathcal{T}(N) = \Theta(N^c \log N)\), where \(c = c_{\text{crit}} = 0\).
Therefore, the time complexity of binary search is \(\mathcal{O}(\log N)\) since \(N^c = N^0 = 1\) (slight abuse of notation here).
Repeated Substitution#
Let’s denote the time complexity of our function as \(\mathcal{T}(N)\), where \(N\) is the number of elements being considered during a given recursive call. Initially, \(N\) is the size of the entire list, but with each recursive call, it gets halved.
On every recursive call to the recursive
function, the list is divided into
two halves, and only one half is considered for further search. Hence, the size
of the problem is reduced to half its previous size.
To find the recurrence relation, let’s break down the operations:
We calculate the middle index, which takes constant time \(\mathcal{O}(1)\).
We make a decision based on the middle element, again taking \(\mathcal{O}(1)\) time.
We make a recursive call, but only on half of the current list.
Putting this into a recurrence relation:
This is a standard divide-and-conquer recurrence relation. We can solve it using the Master Theorem or repeated substitution.
Using repeated substitution:
Now, \(\mathcal{T}\left(\frac{N}{2^k}\right)\) will be \(\mathcal{T}(1)\) (i.e., a constant) when \(\frac{N}{2^k} = 1\) or \(k = \log N\).
So, the expansion becomes:
Given that \(\mathcal{T}(1)\) is a constant time, the dominating factor here is \(\log N\). Therefore, the time complexity is:
Space Complexity#
For recursive algorithms, the space complexity is often determined by the maximum depth of the recursion. In other words, it’s based on the maximum number of recursive calls that are in progress at the same time.
So the intuition is simple, we already established there can be a maximum of \(\log N\) splits, thus it follows that the recursion depth is bounded by \(\log N\).
The space complexity of a recursive binary search gets divided into three parts: input, auxiliary, and total space complexity.
Input Space Complexity#
Input space complexity is the space used to store the input to the problem. For binary search, the input is the array or list we are searching through, and a target value.
In most cases, we don’t consider the space taken by the inputs when analyzing the space complexity of an algorithm, unless the algorithm modifies the input in place. But if we do want to consider the input space, that’s the space taken by the inputs, which is \(\mathcal{O}(N) + \mathcal{O}(1) = \mathcal{O}(N)\), where \(N\) is the length of the array, and \(\mathcal{O}(1)\) is the space taken by the target value.
Auxiliary Space Complexity#
Auxiliary space complexity is the extra space or temporary space used by an
algorithm. In the case of recursive binary search, we still only need three
variables to hold the left, right, and middle indices (l
, r
, m
). These
variables occupy constant space, so the auxiliary space complexity is
\(\mathcal{O}(1)\) per recursive call.
However, because this is a recursive function, we also have to consider the space taken up by the recursion stack. In the worst-case scenario, binary search makes \(\mathcal{O}(\log N)\) recursive calls, and each call adds a level to the stack. So, the auxiliary space complexity in this case is \(\mathcal{O}(\log N)\).
Note that an iterative implementation of binary search would have a space complexity of \(\mathcal{O}(1)\), as it doesn’t need additional space that grows with the input size. It only uses a constant amount of space to store the pointers and the target element.
For the recursive binary search:
Each time we make a recursive call, we essentially halve the input size.
At most, we would need to make \(\log N\) recursive calls (since we are dividing by 2 each time) before we either find our target or exhaust the list.
Each of these calls gets pushed onto the call stack.
Therefore, the maximum height of the call stack, and hence the space complexity due to the recursive calls, is \(O(\log N)\).
Total Space Complexity#
Total space complexity is the sum of input and auxiliary space complexities. For binary search, if the input space complexity is not considered and the uses \(\mathcal{O}(\log N)\) auxiliary space, the total space complexity of recursive binary search is \(\mathcal{O}(\log N)\); otherwise, it is \(\mathcal{O}(N)\).
To summarize, the recursive binary search algorithm is still very space-efficient as it only requires logarithmic auxiliary space to perform the search, and it does not modify the input array or list.
References and Further Readings#
CS2112: Data Structures and Functional Programming - Loop Invariants
CS170: Efficient Algorithms and Intractable Problems - Tutorial 1
CSE241: Introduction to Algorithms and Data Structures - Binary Search Handout
Stack Overflow: How can we prove by induction that binary search is correct?
CS211: Data Structures and Algorithms - Lecture on Binary Search and Induction