Reference:

Binary Search is a basic technique for programmers. The problems it covers are usually examined by interviewers.

Why Binary Search?

  • Fast: $T(N) = T(N/2) + O(eval)$, where $O(eval)$ could be $O(1)$, $O(\log{N})$, $O(N)$, or $O(N\log{N})$.
    • Linear Scan: $O(range) \times O(eval)$
    • Binary Search: $O(\log{(range)}) \times O(eval)$

According to the input scale, we have the following estimates (from the video):

Range Binary (s) Linear (s) Speed Up (x)
100 7 100 14.29
10,000 (10K) 14 10,000 714.29
1,000,000 (1M) 20 1,000,000 50000.00
1,000,000,000 (1B) 30 TLE N/A

When Do We Use Binary Search?

  • The array is partially or fully sorted.
  • The upper bound of time complexity is $O(N)$ or $O(\log{N})$.

There are many variants of binary search. We should be careful of loop condition, mid/left/right update, and return value. Although we may always return lo, some problems may require lo - 1.

As you can see in 162. Find Peak Element, the loop condition and right update are not common.

Intervals

Three Common Intervals

  1. [lo, hi): Left-closed Right-open (C++ STL Style begin(), end())
  2. [lo, hi]: Left-closed Right-closed (can avoid overflow if hi is the largest integer)
  3. (lo, hi): Left-open Right-open (not common)

There is no difference in terms of their outputs if they implement the idea that:

  • Given a function g(mid), returns the smallest mid in the range such that g(mid) holds true. Don’t think in a way of going to the left or right like we do in binary search trees.

The internal representation could be off by $1$ or $2$, but the final output should be exactly the same. Check out the internal examples below.

When could we use binary search?

For function g(mid), there exists a point mid such that:

  • g(mid) holds true if x >= mid (at least one x or all xs).
  • g(mid) holds false if x < mid.

Note: In the peak example above, when x >= mid, g(mid) == true is guaranteed for at least one x.

Thus, only when g(mid) has this property can we use binary search.

The key to binary search is DON’T try to find the exact answer (e.g. don’t write code like if (mid == val) return mid;), but find a split point mid such that x >= mid holds true for all or at least one x, then mid will naturally become the answer. You can see later that it could also give us the lower bound (leftmost) index of the value.

Besides, it is a good way to avoid boundary errors. If you return the answer directly (standard binary search), your code will be erroneous with high probability.

From the video, m = mid (note that the 'g' here is different)

Left-closed Right-open [lo, hi):

1
2
3
4
5
6
7
8
9
10
11
public void binarySearch(int lo, int hi) {
while (lo < hi) { // diff
int mid = lo + (hi - lo) / 2;
if (g(mid)) {
hi = mid; // [lo, mid)
} else {
lo = mid + 1; // [mid + 1, hi)
}
}
return lo; // Returns <hi> if not found
}

Left-closed Right-closed [lo, hi]:

1
2
3
4
5
6
7
8
9
10
11
public void binarySearch(int lo, int hi) {
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (g(mid)) {
hi = mid - 1; // [lo, mid - 1]
} else {
lo = mid + 1; // [mid + 1, hi]
}
}
return lo; // Returns <hi + 1> if not found
}

Internal Examples

Reference: From the video

a. Search range is [10, 101], g(mid) = true if mid >= 25; Else g(mid) = false.

[lo, hi) vs. [lo, hi]:

Iteration lo hi mid g(mid) lo hi mid g(mid)
1 10 102 56 true 10 101 55 true
2 10 56 33 true 10 54 32 true
3 10 33 21 false 10 31 20 false
4 22 33 27 true 21 31 26 true
5 22 27 24 false 22 25 23 false
6 25 27 26 true 24 25 24 false
7 25 26 25 true 25 25 25 true
8 25 25 - - 25 24 - -

Notice the ending state lo == hi vs. lo > hi.

We can see that although g(mid) holds true, we don’t return immediately. We want to locate at the lower bound.

b. Search range is [10, 101], g(mid) = true if mid >= 120; Else g(mid) = false.

[lo, hi) vs. [lo, hi]:

Iteration lo hi mid g(mid) lo hi mid g(mid)
1 10 102 56 false 10 101 55 false
2 57 102 79 false 56 101 78 false
3 80 102 91 false 79 101 90 false
4 92 102 97 false 91 101 96 false
5 98 102 100 false 97 101 99 false
6 101 102 101 false 100 101 100 false
7 102 102 - - 101 101 101 false
8 - - - - 102 101 - -

The final result is the same.

Ex 1 | LC 278 First Bad Version

  1. [lo, hi):
1
2
3
4
5
6
7
8
9
10
11
12
public int firstBadVersion(int n) { // available version: [1, n]
int lo = 1, hi = n;
while (lo < hi) {
int mid = lo + (hi - lo) / 2;
if (isBadVersion(mid)) {
hi = mid;
} else {
lo = mid + 1;
}
}
return lo;
}

Note: hi should have been n + 1, but will overflow if $n = 2^{31} - 1$. This is why people might not use [lo, hi).

To handle this, we can look into the problem to see if we can omit the plus one. For this problem, it is okay because if we process [lo, n) (equivalent to [lo, n - 1]) and the key is not found, it will return n which is the answer to the question. So there is no necessity to process n with extra code. If not found in [1, n), n will be returned.

  1. [lo, hi]:
1
2
3
4
5
6
7
8
9
10
11
12
public int firstBadVersion(int n) { // version: [1, n]
int lo = 1, hi = n;
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (isBadVersion(mid)) {
hi = mid - 1;
} else {
lo = mid + 1;
}
}
return lo;
}

If not found in [1, n], n + 1 will be returned.

Ex 2 | LC 69 Sqrt(x)

  1. [lo, hi):
1
2
3
4
5
6
7
8
9
10
11
12
public int sqrt(int x) {
int lo = 1, hi = x + 1;
while (lo < hi) {
int mid = lo + (hi - lo) / 2;
if (mid > x / mid) { // avoid overflow
hi = mid;
} else {
lo = mid + 1;
}
}
return lo - 1;
}
  1. [lo, hi]:
1
2
3
4
5
6
7
8
9
10
11
12
public int sqrt(int x) {
int lo = 1, hi = x;
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (mid > x / mid) { // mid^2 > x
hi = mid - 1;
} else {
lo = mid + 1;
}
}
return lo - 1;
}

Templates & Examples

Idea: Return lo means that lo is the least index such that g(mid) is true.

Left-closed Right-open Interval [lo, hi):

1
2
3
4
5
6
7
8
9
10
11
12
public int binarySearch(int lo, int hi) {
while (lo < hi) {
int mid = lo + (hi - lo) / 2;
if (f(mid)) return mid; // optional
if (g(mid)) {
hi = mid; // left: [lo, mid)
} else {
lo = mid + 1; // right: [mid + 1, hi)
}
return lo; // or not found
}
}

*Recursion version:

1
2
3
4
5
6
7
8
9
10
11
12
public int binarySearch(int lo, int hi) {
if (lo >= hi) {
return lo;
}
int mid = lo + (hi - lo) / 2;
if (f(mid)) return mid;
if (g(mid)) {
return binarySearch(lo, mid);
} else {
return binarySearch(mid + 1, hi);
}
}

Left-closed Right-closed Interval [lo, hi]:

1
2
3
4
5
6
7
8
9
10
11
12
public int binarySearch(int lo, int hi) {
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (f(mid)) return mid; // optional
if (g(mid)) {
hi = mid - 1; // left: [lo, mid - 1]
} else {
lo = mid + 1; // right: [mid + 1, hi]
}
}
return lo;
}

*Recursion version:

1
2
3
4
5
6
7
8
9
10
11
12
public int binarySearch(int lo, int hi) {
if (lo > hi) {
return lo;
}
int mid = lo + (hi - lo) / 2;
if (f(mid)) return mid; // optional
if (g(mid)) {
return binarySearch(lo, mid - 1);
} else {
return binarySearch(mid + 1, hi);
}
}

Ex 1 | No Duplicates

Return the index of an element in a sorted array. Elements are unique. If not found return -1. For example, A = [1, 2, 5, 7, 8, 12] and search(8) = 4, search(6) = 1.

Usage: int result = binarySearch(A, 8, 0, A.length - 1)

1
2
3
4
5
6
7
8
9
10
11
12
public int binarySearch(int[] A, int val, int lo, int hi) {
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (A[mid] = val) return mid;
if (A[mid] > val) {
hi = mid - 1; // [lo, mid - 1]
} else {
lo = mid + 1; // [mid + 1, hi]
}
}
return -1; // not found
}

Or, we can modify the lower bound version to get the required result.

1
2
3
4
5
6
7
8
9
10
11
12
public int binarySearch(int[] A, int val, int lo, int hi) {
int n = A.length;
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (A[mid] >= val) {
hi = mid - 1;
} else {
lo = mid + 1;
}
}
return (lo >= 0 && lo <= n - 1 && A[lo] == val) ? lo : -1;
}

Ex 2 | Lower / Upper Bound

Return lower or upper bound of a value in a sorted array with duplicates. Since there are duplicates, we have two bounds for the duplicates.

  • lowerBound(A, val): First index of i such that A[i] >= val, which is the g(mid) we mentioned above.
  • upperBound(A, val): First index of i such that A[i] > val (or the element not satisfying the A[mid] == val).

For example A = [1, 2, 2, 2, 4, 4, 5]:

  • lowerBound(A, 2) = 1 (first occurrence), lowerBound(A, 3) = 4 (does not exist)
  • upperBound(A, 2) = 4, upperBound(A, 5) = 7 (does not exist)

Note: #duplicate = upperBound(A, 2) - lowerBound(A, 2)

Lower Bound:

Application scenes:

  • Array is sorted but contains duplicates.
  • Array is partially sorted but does not contain duplicates.
  • Array is partially sorted but contains duplicates.
1
2
3
4
5
6
7
8
9
10
11
public int lowerBound(int[] A, int val, int lo, int hi) {
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (A[mid] >= val) { // according to the definition
hi = mid - 1; // [lo, mid - 1]
} else {
lo = mid + 1; // [mid + 1, hi]
}
}
return lo; // return the least lo that satisfies A[mid] >= val
}

Note: If the value does not exist, the method will return arbitrary values (-1 or hi + 1 etc.)

Therefore, when the requirement says if the value does not exist return -1, we need to use the following code:

1
2
3
4
5
if (lo >= 0 && lo < A.length && A[lo] == val) {
return lo;
} else {
return -1;
}

Upper Bound:

1
2
3
4
5
6
7
8
9
10
11
public int upperBound(int[] A, int val, int lo, int hi) {
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (A[mid] > val) { // according to the definition
hi = mid - 1; // [lo, mid - 1]
} else {
lo = mid + 1; // [mid + 1, hi]
}
}
return lo; // return the first index in the sequence that satisfies A[mid] > val
}

Ex 3 | LC 69 Sqrt(x) with integer part only

sqrt(4) = 2
sqrt(8) = 2

Conversion: Find the value m such that m * m > x then return m - 1.

Note: We can also use m > x / m to avoid overflow.

1
2
3
4
5
6
7
8
9
10
11
12
public int sqrt(int x) {
int lo = 0, hi = x + 1; // maybe overflow
while (lo < hi) {
int mid = lo + (hi - lo) / 2;
if (mid > x / mid) { // if it is mid * mid >= x, x = 4 won't pass
hi = mid;
} else {
lo = mid + 1;
}
}
return lo - 1; // the first index in the sequence lo such that mid > x / mid
}

Ex 4 | LC 278 First Bad Version

It is an interactive problem. We can call isBadVersion(int version) at any time.

[lo, hi):

1
2
3
4
5
6
7
8
9
10
11
12
public int firstBadVersion(int n) {
int lo = 0, hi = n; // it should be n + 1, but n is okay
while (lo < hi) {
int mid = lo + (hi - lo) / 2;
if (isBadVersion(mid)) {
hi = mid;
} else {
lo = mid + 1;
}
}
return lo; // lo is the first index in the sequence such that isBadVersion returns true.
}

[lo, hi]:

1
2
3
4
5
6
7
8
9
10
11
12
public int firstBadVersion(int n) {
int lo = 0, hi = n;
while (lo <= hi) {
int mid = lo + (hi - lo) / 2;
if (isBadVersion(mid)) {
hi = mid - 1;
} else {
lo = mid + 1;
}
}
return lo; // lo is the first index in the sequence such that isBadVersion returns true.
}

Ex 5 | LC 875 Koko Eating Bananas

Find minimum K such that she can eat all the bananas within H hours.

Conversion: The eating speed K to the eating time H.

[lo, hi):

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
public int eat(int[] piles, int H) {
int lo = 1; // min eating speed
int hi = max(piles) + 1; // max eating speed
while (lo < hi) {
int mid = lo + (hi - lo) / 2;
// counting - O(N)
int h = 0;
for (int p : piles) {
h += (p + mid - 1) / mid;
}
// checking
if (h <= H) { // g(mid) = (h <= H)
hi = mid;
} else {
lo = mid + 1;
}
}
return lo;
}

Ex 6 | LC 378 Kth Smallest Element in a Sorted Matrix

Each row and column are sorted.

[lo, hi):

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
public int kthSmallest(int[][] matrix, int k) {
int n = matrix.length;
int lo = matrix[0][0];
int hi = matrix[n - 1][n - 1] + 1;
while (lo < hi) {
int mid = lo + (hi - lo) / 2;
// counting - O(NlogN)
int count = 0;
// for row in A:
// count += upper_bound(row, m)
// checking
if (count >= k) {
hi = mid;
} else {
lo = mid + 1;
}
}
return lo;
}