Seeing this pushback a lot - and it‘s fair!
However, these models don’t have a fixed vocabulary, i.e. there are infinitely many words the model can operate over instead of a finite set of tokens.
I wouldn't really consider these to be tokenizer-free tbh.
Unlike Hnets, these models are word level. The sequence is turned into words (this is literally called tokenization).
Then, the bytes of these words are turned into embeddings, which are then processed by a model.