Why is CLD2 Fast and Small?

CLD2 detects languages in Unicode UTF-8 text by alternating between extracting runs of text from the input document and scoring that text. The first step is done by getonescriptspan() and the second by scoreonescriptspan(). Each is designed for speed and small size.