TR Tokenizer trained on C4 TR