LeetCode 287: Find the Duplicate Number Solution in Python – A Step-by-Step Guide

Imagine you have a list of numbers, like [1, 3, 4, 2, 2], where there’s one number that sneaks in twice, and your job is to catch that sneaky duplicate—without changing the list or using extra space if possible. That’s the fun puzzle of LeetCode 287: Find the Duplicate Number, a medium-level problem that challenges you to spot the repeat in a clever way. Using Python, we’ll explore two solutions: the Best Solution, Floyd’s Cycle-Finding Algorithm (also called the "tortoise and hare" method), which is super smart and space-saving, and an Alternative Solution, a binary search approach that’s easier to follow but uses a different trick. With detailed examples, clear code, and a friendly tone—especially for the cycle-finding breakdown—this guide will help you find that duplicate, even if you’re just starting out in coding. Let’s go detective mode and track it down!

What Is LeetCode 287: Find the Duplicate Number?

Section link icon

In LeetCode 287: Find the Duplicate Number, you’re given an array nums with n+1 integers, where each number is between 1 and n (inclusive), and exactly one number appears twice (or more), while the others appear once. Your task is to find that duplicate number. The twist? You can’t modify the array (like sorting it), and ideally, you should use O(1) extra space—meaning no big helper lists or dictionaries. For example, in [1, 3, 4, 2, 2], the duplicate is 2. This problem tests your ability to think outside the box, sharing a spirit with challenges like LeetCode 141: Linked List Cycle.

Problem Statement

  • Input: An integer array nums of length n+1, with values from 1 to n.
  • Output: The integer that appears more than once.
  • Rules: Exactly one duplicate; don’t modify the array; prefer O(1) space; O(n log n) time or better.

Constraints

  • n: 1 to 10⁵ (100,000).
  • nums[i]: 1 to n.
  • One number appears at least twice; others appear once.

Examples

  • Input: nums = [1, 3, 4, 2, 2]
    • Output: 2
  • Input: nums = [3, 1, 3, 4, 2]
    • Output: 3
  • Input: nums = [1, 1]
    • Output: 1

Understanding the Problem: Spotting the Repeat

Section link icon

To solve LeetCode 287: Find the Duplicate Number in Python, we need to find the number that shows up twice in an array where every other number fits neatly from 1 to n, but there’s one extra slot (n+1 total). A simple idea might be to count each number with a dictionary, but that uses extra space, and we’re told to avoid that if we can. Plus, we can’t sort the array to make it obvious. So, we’ll use: 1. Best Solution (Floyd’s Cycle-Finding Algorithm): O(n) time, O(1) space—tricky but brilliant. 2. Alternative Solution (Binary Search): O(n log n) time, O(1) space—more straightforward.

Let’s dive into Floyd’s method with a super clear explanation for beginners.

Best Solution: Floyd’s Cycle-Finding Algorithm (Tortoise and Hare)

Section link icon

Why This Is the Best Solution

Floyd’s Cycle-Finding Algorithm is the coolest way to solve LeetCode 287 because it runs in O(n) time—super fast—and uses O(1) space, meaning no extra storage beyond a couple of variables. It treats the array like a linked list where each number points to an index, and the duplicate creates a loop we can detect. For beginners, it might sound fancy, but it’s like a game of tag with a slow and fast runner—it’s clever and fun once you see it!

How It Works—Explained for Beginners

Okay, let’s make this really simple. Imagine the array as a treasure map where each number tells you where to jump next—like “go to spot 3” or “go to spot 1.” Because there’s a duplicate, you’ll eventually loop back to a spot you’ve been before, like a circle in the path. Here’s how we find it, step-by-step:

  • Step 1: Think of the Array as a Path:
    • Each value in the array is a “pointer” to an index. For example, in [1, 3, 4, 2, 2], at index 0 is 1 (go to index 1), at index 1 is 3 (go to index 3), at index 3 is 2 (go to index 2), at index 2 is 4 (go to index 4), at index 4 is 2 (back to index 2)—whoa, a loop: 2 → 4 → 2!
    • The duplicate (2) makes this loop happen because two indices (3 and 4) point to the same spot (index 2).
  • Step 2: Send Two Runners—Tortoise and Hare:
    • Picture two friends running this path: a slow one (tortoise) moves one step at a time, and a fast one (hare) moves two steps.
    • Start them both at index 0. Tortoise takes it easy: nums[0] → nums[1] → nums[3]..., while Hare zooms: nums[0] → nums[1] → nums[3] → nums[2] → nums[4]...
    • Because there’s a loop (thanks to the duplicate), Hare will eventually catch up to Tortoise inside that loop—like they’re running in circles and bump into each other.
  • Step 3: Find Where They Meet:
    • Keep moving them until they’re at the same spot. For [1, 3, 4, 2, 2], they meet at index 2 (value 4), inside the loop 2 → 4 → 2.
  • Step 4: Find the Loop’s Start (the Duplicate):
    • Now, reset Tortoise to the start (index 0) and slow Hare down to one step at a time.
    • Move them together: when they meet again, it’s at the loop’s entrance—the duplicate number!
    • Why? Math magic says the distance from the start to the loop entrance equals the distance from their meeting point back to the entrance when paced evenly.
  • Step 5: Why It Works:
    • The duplicate creates a cycle because two indices point to the same value, forcing a repeat.
    • The runners find this cycle, and the second phase pinpoints the duplicate value (not just its index).

It’s like a treasure hunt where the loop leads you to the prize—the duplicate!

Step-by-Step Example

Example: nums = [1, 3, 4, 2, 2]

  • Path: 0 → 1 → 3 → 2 → 4 → 2 (loop: 2 → 4 → 2).
  • Phase 1: Find Meeting Point:
    • Tortoise: 0 → 1 → 3 → 2 → 4.
    • Hare: 0 → 1 → 3 → 2 → 4 → 2 (two steps: 0 → 1 → 3, 1 → 3 → 2, 3 → 2 → 4, 2 → 4 → 2).
    • Meet at index 2 (value 4).
  • Phase 2: Find Loop Entrance:
    • Tortoise: 0 → 1 → 3 → 2.
    • Hare (slowed): 2 → 4 → 2 → 2.
    • Meet at index 2, but values are nums[2] = 4, nums[3] = 2—adjust thinking: we want the value they point to.
    • Corrected: Start Hare at 0 too, use values:
      • Tortoise: 1 → 3 → 2.
      • Hare: 1 → 3 → 2.
      • Meet at value 2 (duplicate!).
  • Result: 2.

Code with Detailed Line-by-Line Explanation

Here’s the Python code, explained so any beginner can follow:

class Solution:
    def findDuplicate(self, nums: List[int]) -> int:
        # Step 1: Start two pointers at the first number
        tortoise = nums[0]
        hare = nums[0]

        # Step 2: Phase 1 - Find where they meet in the loop
        while True:
            tortoise = nums[tortoise]  # Move one step
            hare = nums[nums[hare]]    # Move two steps
            if tortoise == hare:       # They bump into each other
                break

        # Step 3: Phase 2 - Find the start of the loop (duplicate)
        tortoise = nums[0]  # Reset tortoise to start
        while tortoise != hare:
            tortoise = nums[tortoise]  # Both move one step now
            hare = nums[hare]

        # Step 4: They meet at the duplicate value
        return tortoise
  • Line 4-5: Tortoise and Hare start at nums[0]—the beginning of our “path.”
  • Line 8-12: First loop:
    • Tortoise moves one spot: nums[tortoise] is the next “jump.”
    • Hare moves two spots: nums[nums[hare]] jumps twice.
    • When they’re equal, they’ve met in the loop—stop here.
  • Line 14-18: Second loop:
    • Reset Tortoise to nums[0]; Hare stays where they met.
    • Both move one step at a time until they meet again—at the duplicate!
  • Line 20: Return the value they land on—it’s our answer.
  • Time Complexity: O(n)—they meet and find the duplicate in linear steps.
  • Space Complexity: O(1)—just two variables, no extra space.

This is like a detective chase—catch the duplicate with no mess!

Comparing the Two Solutions

Section link icon
  • Floyd’s (Best):
    • Pros: O(n) time, O(1) space, super clever.
    • Cons: Takes time to understand.
  • Binary Search (Alternative):
    • Pros: O(n log n) time, O(1) space, easier logic.
    • Cons: Slower than Floyd’s.

Floyd’s wins for speed and elegance.

Additional Examples and Edge Cases

Section link icon
  • [1, 1]: Duplicate = 1.
  • [2, 1, 3, 1]: Duplicate = 1.
  • [1, 2, 2]: Duplicate = 2.

Both work perfectly.

Complexity Breakdown

Section link icon
  • Floyd’s: Time O(n), Space O(1).
  • Binary Search: Time O(n log n), Space O(1).

Floyd’s is the champ.

Key Takeaways

Section link icon
  • Floyd’s: Loops catch duplicates—amazing!
  • Binary Search: Guess and count—simple!
  • Space: O(1) is the goal.
  • Python Tip: Loops and logic are key—see [Python Basics](/python/basics).

Final Thoughts: Catch That Duplicate

Section link icon

LeetCode 287: Find the Duplicate Number in Python is a thrilling puzzle. Floyd’s method is a speedy detective trick, while binary search is a steady guesser. Want more? Try LeetCode 141: Linked List Cycle or LeetCode 442: Find All Duplicates. Ready to hunt? Head to Solve LeetCode 287 on LeetCode and nab that duplicate today!