[Hard] 315. Count of Smaller Numbers After Self

Problem Statement

You are given an integer array nums and you have to return a new array counts. The array counts has the property where counts[i] is the number of smaller elements to the right of nums[i].

Examples

Example 1:

Input: nums = [5,2,6,1]
Output: [2,1,1,0]
Explanation:
To the right of 5 there are 2 smaller elements (2 and 1).
To the right of 2 there is 1 smaller element (1).
To the right of 6 there is 1 smaller element (1).
To the right of 1 there is 0 smaller elements.

Example 2:

Input: nums = [-1]
Output: [0]

Example 3:

Input: nums = [-1,-1]
Output: [0,0]

Constraints

1 <= nums.length <= 10^5
-10^4 <= nums[i] <= 10^4

Clarification Questions

Before diving into the solution, here are 5 important clarifications and assumptions to discuss during an interview:

“After self” definition: What does “smaller numbers after self” mean? (Assumption: For each element at index i, count how many elements to the right (indices > i) have smaller values)
Duplicate values: How should we handle duplicate values? (Assumption: Count only strictly smaller values - if nums[j] == nums[i] and j > i, don’t count it)
Output format: Should we return counts for each position? (Assumption: Yes - return array where result[i] = count of smaller numbers after nums[i])
Array mutability: Can the input array be modified? (Assumption: No - we need to preserve original array for counting)
Time complexity: What’s the expected time complexity? (Assumption: O(n log n) is optimal - need efficient data structure like Fenwick Tree or BST)

Interview Deduction Process (30 minutes)

Step 1: Brute-Force Approach (8 minutes)

For each element nums[i], scan all elements after it (nums[i+1] to nums[n-1]) and count how many are smaller. This straightforward approach has O(n²) time complexity, which is too slow for arrays up to 10^5 elements.

Step 2: Semi-Optimized Approach (10 minutes)

Process from right to left, maintaining a sorted data structure (like a balanced BST or sorted list) of elements seen so far. For each element, insert it into the sorted structure and count how many elements are smaller. Using a balanced BST gives O(n log n) time, but implementing a balanced BST is complex. Alternatively, use a Fenwick Tree or Segment Tree for efficient range queries.

Step 3: Optimized Solution (12 minutes)

Use merge sort with counting: during the merge process, when merging two sorted halves, count inversions. When an element from the right half is smaller than an element from the left half, all remaining elements in the left half are larger, so we can count them. Alternatively, use Fenwick Tree (Binary Indexed Tree): process from right to left, for each element, query how many smaller elements have been inserted, then insert the current element. This achieves O(n log n) time with O(n) space, which is optimal. The key insight is that we need to count elements that appear after the current element and are smaller, which can be done efficiently using merge sort inversion counting or Fenwick Tree range queries.

Solution Approach

This problem requires counting inversions (smaller elements to the right). We need an efficient data structure to track counts as we process elements from right to left.

Key Insights:

Right-to-Left Processing: Process from right to left so we can query counts of already-seen elements
Coordinate Compression: Map values to indices [1, k] for Fenwick Tree (handles negative numbers)
Fenwick Tree: Efficiently track and query counts of smaller elements
Query Before Update: Query count of elements < current, then update tree

Algorithm:

Coordinate Compression: Map distinct values to [1, k]
Process Right to Left: For each element from right to left
Query: Count how many elements < current have been seen
Update: Mark current element as seen in Fenwick Tree

Solution

Solution: Fenwick Tree (Binary Indexed Tree) with Coordinate Compression

class FenwickTree:
    def __init__(self, n):
        self.sums_ = [0] * (n + 1)

    def lowbit(self, x):
        return x & (-x)

    def update(self, i, delta):
        while i < len(self.sums_):
            self.sums_[i] += delta
            i += self.lowbit(i)

    def query(self, i):
        total = 0
        while i > 0:
            total += self.sums_[i]
            i -= self.lowbit(i)
        return total


class Solution:
    def countSmaller(self, nums):
        # Get rank order
        sorted_vals = sorted(nums)
        sorted_vals = list(dict.fromkeys(sorted_vals))  # remove duplicates

        ranks = {}
        rank = 1
        for num in sorted_vals:
            ranks[num] = rank
            rank += 1

        res = []

        # Fenwick Tree
        tree = FenwickTree(len(ranks))

        # Process from right to left
        for i in range(len(nums) - 1, -1, -1):
            res.append(tree.query(ranks[nums[i]] - 1))
            tree.update(ranks[nums[i]], 1)

        res.reverse()
        return res

Algorithm Explanation:

FrenwickTree Class:

Constructor: Initialize Fenwick Tree with size n (1-indexed array sums_)
lowbit(x): Extract lowest set bit using x & (-x)
update(i, delta): Add delta to position i (1-indexed) and all ancestors
- Traverse upward: i += lowbit(i)
- Stops when i >= sums_.size()
query(i): Get prefix sum from 1 to i (1-indexed)
- Traverse downward: i -= lowbit(i)
- Stops when i <= 0

Solution Class:

Coordinate Compression:
- Create sorted, unique array of all values
- Build unordered_map mapping each value to its rank [1, k]
- Handles negative numbers and large ranges efficiently
- Example: [5, 2, 6, 1] → sorted [1, 2, 5, 6] → ranks: {1:1, 2:2, 5:3, 6:4}
Right-to-Left Processing:
- Process from nums.size()-1 down to 0
- For each element:
  - Get rank: ranks[nums[i]]
  - Query count of elements < current: tree.query(ranks[nums[i]] - 1)
  - Push result to rtn vector
  - Update tree: mark current element as seen with tree.update(ranks[nums[i]], 1)
- Reverse result array to get correct order

How It Works:

Coordinate Compression: [5, 2, 6, 1] → sorted [1, 2, 5, 6] → ranks {1:1, 2:2, 5:3, 6:4}
Right-to-Left: Ensures we only count elements to the right
Query Before Update: Query counts elements already processed (to the right)
Update: Marks current element for future queries
Reverse Result: Results are collected in reverse order, then reversed at the end

Example Walkthrough:

Input: nums = [5, 2, 6, 1]

Step 1: Coordinate Compression
  sorted = [1, 2, 5, 6]
  ranks = {1:1, 2:2, 5:3, 6:4}

Step 2: Process from right to left
  i=3: nums[3] = 1, rank = 1
    query(0) = 0 → rtn.push_back(0)
    update(1, 1) → sums_[1] = 1
    rtn = [0]
    
  i=2: nums[2] = 6, rank = 4
    query(3) = sums_[3] + sums_[2] = 0 + 1 = 1 → rtn.push_back(1)
    update(4, 1) → sums_[4] = 1
    rtn = [0, 1]
    
  i=1: nums[1] = 2, rank = 2
    query(1) = sums_[1] = 1 → rtn.push_back(1)
    update(2, 1) → sums_[2] = 2
    rtn = [0, 1, 1]
    
  i=0: nums[0] = 5, rank = 3
    query(2) = sums_[2] = 2 → rtn.push_back(2)
    update(3, 1) → sums_[3] = 1
    rtn = [0, 1, 1, 2]

Step 3: Reverse result
  reverse(rtn) → [2, 1, 1, 0] ✓

Complexity Analysis:

Time Complexity: O(n log n)
- Coordinate compression: O(n log n) for sorting
- Binary search for each element: O(n log n)
- Fenwick Tree operations: O(n log n) for n updates + n queries
- Overall: O(n log n)
Space Complexity: O(n)
- Result array: O(n)
- Sorted array: O(n)
- Fenwick Tree: O(n)
- Overall: O(n)

Key Insights

Coordinate Compression: Essential for handling negative numbers and large ranges
- Use unordered_map for O(1) rank lookup after sorting
- Maps distinct values to consecutive ranks [1, k]
Right-to-Left Processing: Ensures we only count elements to the right
Fenwick Tree Efficiency: O(log n) per operation, better than naive O(n)
Query Before Update: Query counts already-seen elements, then mark current
Result Collection: Use push_back and reverse for cleaner code when processing backwards
Rank Mapping: unordered_map provides O(1) lookup vs O(log n) binary search

Edge Cases

Single element: nums = [5] → return [0]
All same: nums = [1, 1, 1] → return [0, 0, 0]
Negative numbers: nums = [-1, -2] → coordinate compression handles it
Descending order: nums = [5, 4, 3, 2, 1] → all counts are 0
Ascending order: nums = [1, 2, 3, 4, 5] → counts increase

Common Mistakes

Left-to-right processing: Would count elements to the left instead
Forgetting coordinate compression: BIT requires positive indices
Wrong query index: Using query(x) instead of query(x-1) for strictly smaller
Update before query: Should query first, then update
Not handling duplicates: Coordinate compression must preserve uniqueness

Alternative Approaches

Approach 2: Merge Sort (Divide and Conquer)

Count inversions during merge sort by tracking how many elements from the right subarray are smaller than each element in the left subarray.

class Solution:
    def countSmaller(self, nums):
        n = len(nums)
        res = [0] * n
        indexed = []

        for i in range(n):
            indexed.append((nums[i], i))

        self.mergeSort(indexed, 0, n - 1, res)
        return res

    def mergeSort(self, arr, l, r, res):
        if l >= r:
            return

        mid = l + (r - l) // 2

        self.mergeSort(arr, l, mid, res)
        self.mergeSort(arr, mid + 1, r, res)
        self.merge(arr, l, mid, r, res)

    def merge(self, arr, l, mid, r, res):
        temp = []

        i = l
        j = mid + 1
        rightCount = 0  # elements from right already placed

        while i <= mid and j <= r:
            if arr[i][0] > arr[j][0]:
                rightCount += 1
                temp.append(arr[j])
                j += 1
            else:
                res[arr[i][1]] += rightCount
                temp.append(arr[i])
                i += 1

        while i <= mid:
            res[arr[i][1]] += rightCount
            temp.append(arr[i])
            i += 1

        while j <= r:
            temp.append(arr[j])
            j += 1

        for k in range(len(temp)):
            arr[l + k] = temp[k]

Algorithm Explanation:

Divide: Split array into halves recursively
Conquer: Merge sorted halves while counting inversions
Key Insight: When merging, if a right element is smaller than a left element, it contributes to the count for all remaining left elements

Time Complexity: O(n log n)
Space Complexity: O(n)

Approach 3: Segment Tree with Coordinate Compression

Similar to Fenwick Tree but using explicit segment tree structure.

class SegmentTree:
    def __init__(self, size):
        self.n = size
        self.tree = [0] * (4 * size)

    def update(self, node, l, r, idx):
        if l == r:
            self.tree[node] += 1
            return

        mid = l + (r - l) // 2

        if idx <= mid:
            self.update(2 * node + 1, l, mid, idx)
        else:
            self.update(2 * node + 2, mid + 1, r, idx)

        self.tree[node] = self.tree[2 * node + 1] + self.tree[2 * node + 2]

    def query(self, node, l, r, ql, qr):
        if qr < l or r < ql:
            return 0

        if ql <= l and r <= qr:
            return self.tree[node]

        mid = l + (r - l) // 2

        return (
            self.query(2 * node + 1, l, mid, ql, qr)
            + self.query(2 * node + 2, mid + 1, r, ql, qr)
        )


class Solution:
    def countSmaller(self, nums):
        n = len(nums)
        res = [0] * n

        # Coordinate compression
        sorted_vals = sorted(nums)
        sorted_vals = list(dict.fromkeys(sorted_vals))  # remove duplicates

        def lower_bound(arr, x):
            l, r = 0, len(arr)
            while l < r:
                m = (l + r) // 2
                if arr[m] < x:
                    l = m + 1
                else:
                    r = m
            return l

        st = SegmentTree(len(sorted_vals))

        # Process from right to left
        for i in range(n - 1, -1, -1):
            rank = lower_bound(sorted_vals, nums[i])

            # Query count of elements < nums[i]
            if rank > 0:
                res[i] = st.query(0, 0, len(sorted_vals) - 1, 0, rank - 1)

            # Mark nums[i] as seen
            st.update(0, 0, len(sorted_vals) - 1, rank)

        return res

Time Complexity: O(n log n)
Space Complexity: O(4n) = O(n)

Approach 4: Binary Search Tree (BST)

Use an augmented BST that tracks the count of smaller elements.

class Node:
    def __init__(self, val):
        self.val = val
        self.count = 1
        self.left_count = 0
        self.left = None
        self.right = None

    def less_or_equal(self):
        return self.count + self.left_count


class Solution:
    def countSmaller(self, nums):
        if not nums:
            return []

        nums.reverse()

        root = Node(nums[0])
        res = [0]

        for i in range(1, len(nums)):
            res.append(self.insert(root, nums[i]))

        res.reverse()
        return res

    def insert(self, root, val):
        if root.val == val:
            root.count += 1
            return root.left_count

        elif val < root.val:
            root.left_count += 1
            if not root.left:
                root.left = Node(val)
                return 0
            return self.insert(root.left, val)

        else:
            if not root.right:
                root.right = Node(val)
                return root.less_or_equal()
            return root.less_or_equal() + self.insert(root.right, val)

Algorithm Explanation:

BST Structure: Each node stores:
- val: The value stored in the node
- count: Number of duplicates of this value
- left_count: Number of nodes in left subtree
- left, right: Pointers to children
Helper Method less_or_equal(): Returns count + left_count (elements ≤ current node)
Insert Logic:
- If value equals node: increment count, return left_count
- If value < node: increment left_count, go left (create node if needed)
- If value > node: return less_or_equal() + count from right subtree (create node if needed)
Processing Strategy:
- Reverse input array first
- Process left-to-right (which corresponds to right-to-left in original)
- Reverse result to get correct order
Memory Management: Uses unique_ptr for automatic cleanup and destructor for recursive deletion

Time Complexity:

Average: O(n log n)
Worst: O(n²) if tree becomes unbalanced

Space Complexity: O(n)

Approach 5: Binary Search with Sorted List

Maintain a sorted list and use binary search to find insertion position.

class Solution:
    def countSmaller(self, nums):
        n = len(nums)
        res = [0] * n
        sortedList = []

        # Process from right to left
        for i in range(n - 1, -1, -1):
            # Find position where nums[i] should be inserted
            import bisect
            it = bisect.bisect_left(sortedList, nums[i])

            # Count of elements smaller than nums[i]
            res[i] = it

            # Insert nums[i] at correct position
            sortedList.insert(it, nums[i])

        return res

Algorithm Explanation:

Maintain a sorted list of elements seen so far
For each element, find its insertion position using binary search
Count of smaller elements = insertion index
Insert element to maintain sorted order

Time Complexity: O(n²) - insert operation is O(n)
Space Complexity: O(n)

When to Use: Only for small inputs or when simplicity is preferred

Comparison of All Approaches

Approach	Time Complexity	Space Complexity	Code Complexity	Best For
Fenwick Tree	O(n log n)	O(n)	Simple	General purpose, space-efficient
Merge Sort	O(n log n)	O(n)	Moderate	When you need stable sort
Segment Tree	O(n log n)	O(4n)	More verbose	When you need range queries later
BST	O(n log n) avg, O(n²) worst	O(n)	Moderate	When tree structure is preferred
Binary Search + Insert	O(n²)	O(n)	Simple	Small inputs only
Naive	O(n²)	O(1)	Very simple	Not recommended for large inputs

LC 327: Count of Range Sum - Similar inversion counting
LC 493: Reverse Pairs - Count inversions with condition
LC 1649: Create Sorted Array through Instructions - Fenwick Tree for cost calculation
LC 307: Range Sum Query - Mutable - Fenwick Tree basics

This problem demonstrates the Fenwick Tree (Binary Indexed Tree) pattern for efficient inversion counting. The key insight is using coordinate compression to map values to indices and processing from right to left to count smaller elements efficiently.

Robina Li

[Hard] 315. Count of Smaller Numbers After Self

[Hard] 315. Count of Smaller Numbers After Self

Problem Statement

Examples

Constraints

Clarification Questions

Interview Deduction Process (30 minutes)

Solution Approach

Key Insights:

Algorithm:

Solution

Solution: Fenwick Tree (Binary Indexed Tree) with Coordinate Compression

Algorithm Explanation:

FrenwickTree Class:

Solution Class:

How It Works:

Example Walkthrough:

Complexity Analysis:

Key Insights

Edge Cases

Common Mistakes

Alternative Approaches

Approach 2: Merge Sort (Divide and Conquer)

Approach 3: Segment Tree with Coordinate Compression

Approach 4: Binary Search Tree (BST)

Approach 5: Binary Search with Sorted List

Comparison of All Approaches

Related Posts

Recent Posts

[Hard] 315. Count of Smaller Numbers After Self

Problem Statement

Examples

Constraints

Clarification Questions

Interview Deduction Process (30 minutes)

Solution Approach

Key Insights:

Algorithm:

Solution

Solution: Fenwick Tree (Binary Indexed Tree) with Coordinate Compression

Algorithm Explanation:

FrenwickTree Class:

Solution Class:

How It Works:

Example Walkthrough:

Complexity Analysis:

Key Insights

Edge Cases

Common Mistakes

Alternative Approaches

Approach 2: Merge Sort (Divide and Conquer)

Approach 3: Segment Tree with Coordinate Compression

Approach 4: Binary Search Tree (BST)

Approach 5: Binary Search with Sorted List

Comparison of All Approaches

Related Problems

Related Posts

Recent Posts