> Will need to be Unicode-aware I suppose Not really. Both sides (string and separator) are strings, so *unicode* already. So already encoded to some bytes. Just match the bytes. The real design decision is whether the *separator* is a set of chars, or a text-fragment; and whether to *compress* / *ignore* repeated *separator*. Ideally those are options of `split()`.