This tokenizer was created from the BrWac dataset (full version). It is been used by models originated from Flan-T5.