There are sometimes corresponding constraints in reality, such as systems with comparatively low amounts of RAM where data can be requested (e.g. embedded systems with I/O, servers with network APIs, etc) but data can not or should not be written via those channels. It doesn't make them common, but it is useful to be able to devise algorithms under these or other constraints.

[-]Brendan Long3y20

I agree that the math puzzle is interesting.

I'm still skeptical that this algorithm is useful any real-world situation, although I was hoping I might get comments with counter-examples. Even in the examples you gave, you already have another machine that clearly has far more memory than you need to implement the set algorithm but for some reason you have to write this algorithm to run on a toaster and talk to your dramatically more powerful server over the network? I'm not saying it's impossible, but I hope you can see why I'm skeptical.

Moderation Log

More from Brendan Long

Curated and popular this week

from typing import Any, Iterable # python3 -m pip install bitarray from bitarray import bitarray # We need a resizable bit set for streaming since we don't know the largest value we'll see # If we do know the largest value, we can replace this entire class with bitarray(max_value) class BitSet: """ A set wrapper around bitarray Note that the set can only contain integers, and this set only makes sense if the integers it contains are in a predictable, densely packed range (specifically, when max_value - min_value / 64 > 1). """ def __init__(self, min_value: int = 0, initial_size: int = 256) -> None: """ Initialize a bit set min_value is the smallest value that can be contained in the set (i.e. it will be mapped to index 0) """ self.min_value = min_value self._set = bitarray(initial_size) self._set.setall(0) def __contains__(self, value: Any) -> bool: return ( isinstance(value, int) and 0 <= value - self.min_value < len(self._set) and self._set[value - self.min_value] == 1 ) def ensure_size(self, min_length: int) -> None: """ Ensure the underlying bit array is at least as large the given min_length (resizing if necessary). """ starting_size = len(self._set) if min_length > starting_size: # Don't do a bunch of tiny resizes min_length = max(min_length, starting_size * 2) # Create a new array, initialize it, and copy the old data if applicable old_set = self._set self._set = bitarray(min_length) self._set[starting_size:] = 0 if old_set: self._set[:starting_size] = old_set def add(self, value: int) -> None: assert value >= self.min_value mapped_value = value - self.min_value self.ensure_size(mapped_value + 1) self._set[mapped_value] = 1 def find_first_duplicate(int_list: Iterable[int]) -> int: """ Find one of the duplicates in a list of n + 1 integers in the range 1...n """ seen = BitSet(min_value=1) for value in int_list: if value in seen: return value seen.add(value) assert False, "No duplicates found, which is impossible with conforming inputs"

LESSWRONG
LW

LESSWRONG
LW

4

Additional space complexity isn't always a useful metric

4

4