A tree set is a data structure that stores unique values and keeps them sorted. You can get check if a value exists in O(log n)
time.
Another way to implement a Set is by using a hash function, as we covered on Hash Set. There are some critical differences between the two implementations.
We can implement a map
using a balanced binary search tree or a hash function. If we use them to implement a Set
, we would have a HashSet
and TreeSet
. As with all data structures, there are trade-offs. Here are some key differences:
-
TreeSet
, would return the values sorted in ascending order. -
HashSet
, would return the values in insertion order. -
Operations on a
HashSet
would take on average O(1), and in the worst case (rehash is due), it would take O(n). -
Operation on a
TreeSet
is always O(log n).
Data Structure |
Searching By |
Insert |
Delete |
Space Complexity |
|
Index/Key |
Value |
||||
O(1) |
- |
O(1)* |
O(1) |
O(n) |
|
O(log n) |
- |
O(log n) |
O(log n) |
O(n) |
* = Amortized run time. E.g. rehashing might affect run time to O(n).
Tip
|
JavaScript only provides (Hash) Set that’s enough for most needs. But we will implement a Tree Set so it’s more clear how it works and when it should be used.
|
link:../../../src/data-structures/sets/tree-set.js[role=include]
}
-
Converts an array or any iterable data structure to a set.
An everyday use case for Sets is to remove duplicated values from an array. We can do that by bypassing them in the constructor as follows:
set = new TreeSet([1, 2, 3, 2, 1]);
expect(set.size).toBe(3);
expect(Array.from(set.keys())).toEqual([1, 2, 3]);
Ok, now let’s implement the add method.
For adding values to the set, we Tree.add
method.
link:../../../src/data-structures/sets/tree-set.js[role=include]
Our BST implementation can hold duplicated values. It has a multiplicity tally to keep track of duplicates. However, we don’t dupe in a set. For that, we check if the value is already in the tree.
Don’t worry about adding extra lookups. The
Tree.has
is also very performant O(log n).
Again, we rely on the Tree implementation to do the heavy lifting:
has
methodlink:../../../src/data-structures/sets/tree-set.js[role=include]
We delete the elements from the TreeSet using the remove method of the BST.
delete
methodlink:../../../src/data-structures/sets/tree-set.js[role=include]
Voilà! That’s it!
Another use case for a Set is to convert it to an array or use an iterator (for loops, forEach, …). Let’s provide the method for that:
link:../../../src/data-structures/sets/tree-set.js[role=include]
We are using the inOrderTraversal
method of the BST to go each key in an
ascending order.
Symbol
iteratorThe Symbol.iterator
built-in symbol specifies the default iterator for
an object. Used by for…of
, Array.from
, and others.
Now we can convert from set to array and vice versa easily. For instance:
const array = [1, 1, 2, 3, 5];
// array to set
const set = new TreeSet(array);
// set to array
Array.from(set); //↪️ (4) [1, 2, 3, 5]
No more duplicates in our array!
Check out our GitHub repo for the full TreeSet implementation.