The Day–Stout–Warren (DSW) algorithm is a method for efficiently balancing binary search trees that is, decreasing their height to O(log n) nodes, where n is the total number of nodes. Unlike a self-balancing binary search tree, it does not do this incrementally during each operation, but periodically, so that its cost can be amortized over many operations. The algorithm was designed by Quentin F. Stout and Bette Warren in a 1986 CACM paper, based on work done by Colin Day in 1976.

The algorithm requires linear (O(n)) time and is in-place. The original algorithm by Day generates as compact a tree as possible: all levels of the tree are completely full except possibly the bottom-most. It operates in two phases. First, the tree is turned into a linked list by means of an in-order traversal, reusing the pointers in the (threaded) tree's nodes. A series of left-rotations forms the second phase.

The Stout–Warren modification generates a complete binary tree, namely one in which the bottom-most level is filled strictly from left to right. This is a useful transformation to perform if it is known that no more inserts will be done. It does not require the tree to be threaded, nor does it require more than constant space to operate. Rolfe cites two main advantages: "in circumstances in which one generates an entire binary search tree at the beginning of processing, followed by item look-up access for the rest of processing" and "pedagogically within a course on data structures where one progresses from the binary search tree into self-adjusting trees, since it gives a first exposure to doing rotations within a binary search tree."

Pseudocode

The following is a presentation of the basic DSW algorithm in pseudocode, after the Stout–Warren paper. It consists of a main routine with three subroutines. The main routine is given by

  1. Allocate a node, the "pseudo-root", and make the tree's actual root the right child of the pseudo-root.
  2. Call tree-to-vine with the pseudo-root as its argument.
  3. Call vine-to-tree on the pseudo-root and the size (number of elements) of the tree.
  4. Make the tree's actual root equal to the pseudo-root's right child.
  5. Dispose of the pseudo-root.

The subroutines are defined as follows:

routine tree-to-vine(root)

// Convert tree to a "vine", i.e., a sorted linked list,

// using the right pointers to point to the next node in the list

tail ← root

rest ← tail.right

while rest ≠ nil

if rest.left = nil

tail ← rest

rest ← rest.right

else

temp ← rest.left

rest.left ← temp.right

temp.right ← rest

rest ← temp

tail.right ← temp

routine vine-to-tree(root, size)

leaves ← size + 1 − 2<sup>⌊log<sub>2</sub>(size + 1)⌋</sup>

compress(root, leaves)

size ← size − leaves

while size > 1

compress(root, ⌊size / 2⌋)

size ← ⌊size / 2⌋

routine compress(root, count)

scanner ← root

for i ← 1 to count

child ← scanner.right

scanner.right ← child.right

scanner ← scanner.right

child.right ← scanner.left

scanner.left ← child

Notes

References