Byte pair encodings are at the nucleus of representation learning and are generated for what Victoroff termed “meaningful chunks” of words or language. “‘ing [i-n-g] space’ might be a chunk, or ‘space ...